Add `POST /tasks/tag` by PGijsbers · Pull Request #350 · openml/server-api

PGijsbers · 2026-06-25T14:22:08Z

Closes #26.

Description

Adds an endpoint for tagging a task.
Also refactors the /datasets/tag endpoint tests to be less dependent on database state.

Checklist

Please check all that apply. You can mark items as N/A if they don't apply to your change.

Always:

I have performed a self-review of my own pull request to ensure it contains all relevant information, and the proposed changes are minimal but sufficient to accomplish their task.

Required for code changes:

Tests pass locally
I have commented my code in hard-to-understand areas, and provided or updated docstrings as needed
Changes are already covered under existing tests
I have added tests that cover the changes (only required if not already under coverage)

If applicable:

I have made corresponding changes to the documentation pages (/docs)

Extra context:

This PR and the commits have been created autonomously by a bot/agent.

nb. If tests pass locally but fail on CI, please try to investigate the cause. If you are unable to resolve the issue, please share your findings.

coderabbitai · 2026-06-25T14:22:25Z

Warning

Review limit reached

@PGijsbers, we couldn't start this review because you've reached your PR review rate limit.

More reviews will be available in 51 minutes and 58 seconds. Learn how PR review limits work.

Your organization has used up its prepaid credits, and credit purchases are no longer available. Enable the review add-on in the billing tab to keep reviews running — you're only billed for reviews past your plan's rate limits ($0.25/file).

⌛ How to resolve this issue?

After more reviews become available, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

To avoid repeated limits, reduce automatic review volume by pausing incremental auto-reviews earlier, using label-based review opt-in, excluding WIP or generated PR titles, or requesting reviews manually when the PR is ready. If your team needs uninterrupted high-volume reviews, an organization admin can enable usage-based credits.

🚦 How do rate limits work?

CodeRabbit enforces per-developer PR review limits for each organization. Most developers receive the normal plan review availability.

For paid Pro and Pro+ PR reviews, CodeRabbit uses adaptive limits for sustained high-volume activity. When a developer's recent PR review activity reaches the 95th percentile or higher among CodeRabbit users, additional reviews become available more gradually as earlier reviews age out of the rolling window.

Please see our Fair Usage Limits Policy for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 5365689a-897b-4092-af52-c40678f36eea

📥 Commits

Reviewing files that changed from the base of the PR and between e19bd0d and 46ae998.

📒 Files selected for processing (1)

tests/routers/openml/tag_test_helper.py

Walkthrough

Adds task-tag persistence and error mapping in the database layer, exposes a POST /tasks/tag route that returns updated task tags, and expands shared test fixtures plus task and dataset tag coverage.

Possibly related PRs

openml/server-api#322: Introduces the database-level tagging error types and related task-tag behavior used by this PR.

Suggested labels

tests, POST

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title is concise and clearly describes the main change: adding the task tagging endpoint.
Description check	✅ Passed	The description matches the changeset by describing the new task-tag endpoint and related test refactor.
Linked Issues check	✅ Passed	The PR implements the requested task tagging endpoint for issue `#26` and adds supporting tests and error handling.
Out of Scope Changes check	✅ Passed	The extra test fixtures, constants, and helper updates are all in service of the new task tagging feature.

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch post-tag

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands.}

sourcery-ai

Hey - I've left some high level feedback:

The new TaskFactory/DatasetFactory fixtures always insert with a fixed default ID (42_000), which can easily lead to primary key collisions if they are called more than once per test; consider generating unique IDs (e.g. via a counter or random range) to make the fixture safer to reuse.
The TaskNotFoundError mapping in tag_task uses a hard-coded numeric error code (472) which is then duplicated in tests; centralizing this code in a shared constant would reduce the risk of future mismatches.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- The new TaskFactory/DatasetFactory fixtures always insert with a fixed default ID (42_000), which can easily lead to primary key collisions if they are called more than once per test; consider generating unique IDs (e.g. via a counter or random range) to make the fixture safer to reuse.
- The `TaskNotFoundError` mapping in `tag_task` uses a hard-coded numeric error code (472) which is then duplicated in tests; centralizing this code in a shared constant would reduce the risk of future mismatches.

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

codecov · 2026-06-25T14:24:20Z

Codecov Report

❌ Patch coverage is 97.61905% with 4 lines in your changes missing coverage. Please review.
✅ Project coverage is 95.21%. Comparing base (c88d9fc) to head (46ae998).

Files with missing lines	Patch %	Lines
src/database/tasks.py	86.66%	1 Missing and 1 partial ⚠️
tests/routers/openml/tag_test_helper.py	94.11%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #350      +/-   ##
==========================================
+ Coverage   95.00%   95.21%   +0.21%     
==========================================
  Files          72       74       +2     
  Lines        3580     3699     +119     
  Branches      243      244       +1     
==========================================
+ Hits         3401     3522     +121     
+ Misses        115      113       -2     
  Partials       64       64

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

coderabbitai

🧹 Nitpick comments (3)

tests/routers/openml/tag_test_helper.py (2)
50-51: 📐 Maintainability & Code Quality | 🔵 Trivial | 💤 Low value

Redundant status check.

already_tagged (lines 27-30) is only True when php_response.status_code == INTERNAL_SERVER_ERROR, so the extra status comparison on line 51 is always satisfied and can be dropped for clarity.
♻️ Simplify the condition
-    if php_response.status_code == HTTPStatus.INTERNAL_SERVER_ERROR and already_tagged:
+    if already_tagged:
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/routers/openml/tag_test_helper.py` around lines 50 - 51, The condition
in the tag conflict handling helper is redundant because already_tagged is only
true when php_response.status_code is INTERNAL_SERVER_ERROR. Simplify the check
in the tag_test_helper flow by removing the extra status comparison and relying
on already_tagged alone, keeping the logic around the php_response handling and
the helper’s conflict detection clear and easier to read.
32-33: 📐 Maintainability & Code Quality | 🔵 Trivial | 💤 Low value

Comment typo: "taskbase".

These comments read "persist this change to the taskbase" / "committed to the taskbase", which looks like a stray word (likely "database").
📝 Suggested wording
-        # undo the tag, because we don't want to persist this change to the taskbase
-        # Sometimes a change is already committed to the taskbase even if an error occurs.
+        # undo the tag, because we don't want to persist this change to the database
+        # Sometimes a change is already committed to the database even if an error occurs.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/routers/openml/tag_test_helper.py` around lines 32 - 33, The comments
in tag_test_helper.py use the stray word “taskbase” in the undo/persistence
note. Update those nearby comments to use the intended term consistently, likely
“database,” and keep the wording aligned with the surrounding logic in the tag
helper comments.
tests/routers/openml/task_tag_test.py (1)
90-98: 📐 Maintainability & Code Quality | 🔵 Trivial | 💤 Low value

Parity test parametrizes task_id with dataset-specific constants.

constants.SOME_DEACTIVATED_DATASET_ID and constants.DATASET_ID_THAT_DOES_NOT_EXIST are dataset identifiers used here as task_id values. They function as arbitrary integers for PHP/Python parity, but the names imply dataset semantics and obscure the intent (existing vs. non-existing task). Consider task-oriented constants or inline literals with explanatory ids.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/routers/openml/task_tag_test.py` around lines 90 - 98, The parity test
in task_tag_test.py is using dataset-named constants as task_id inputs, which
obscures the intent of the parametrization. Update the task_id cases in the
parametrized test near the task_id fixture to use task-oriented identifiers or
explicit integer literals with clear naming/comments, so it is obvious which
values represent existing versus non-existing tasks. Keep the change localized
to the test data setup around the pytest.mark.parametrize block.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@tests/routers/openml/tag_test_helper.py`:
- Around line 50-51: The condition in the tag conflict handling helper is
redundant because already_tagged is only true when php_response.status_code is
INTERNAL_SERVER_ERROR. Simplify the check in the tag_test_helper flow by
removing the extra status comparison and relying on already_tagged alone,
keeping the logic around the php_response handling and the helper’s conflict
detection clear and easier to read.
- Around line 32-33: The comments in tag_test_helper.py use the stray word
“taskbase” in the undo/persistence note. Update those nearby comments to use the
intended term consistently, likely “database,” and keep the wording aligned with
the surrounding logic in the tag helper comments.

In `@tests/routers/openml/task_tag_test.py`:
- Around line 90-98: The parity test in task_tag_test.py is using dataset-named
constants as task_id inputs, which obscures the intent of the parametrization.
Update the task_id cases in the parametrized test near the task_id fixture to
use task-oriented identifiers or explicit integer literals with clear
naming/comments, so it is obvious which values represent existing versus
non-existing tasks. Keep the change localized to the test data setup around the
pytest.mark.parametrize block.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: ad0693cf-3d9d-4aaf-b625-05ed55d0c9eb

📥 Commits

Reviewing files that changed from the base of the PR and between c88d9fc and 25992b2.

📒 Files selected for processing (6)

src/database/tasks.py
src/routers/openml/tasks.py
tests/conftest.py
tests/routers/openml/dataset_tag_test.py
tests/routers/openml/tag_test_helper.py
tests/routers/openml/task_tag_test.py

PGijsbers added 5 commits June 25, 2026 08:54

Add tag endpoint

f5bc641

Add tests

2af428f

Toward unifying tag tests

3db1974

Refactor the tag task tests

e43b177

Refactor dataset tag tests to not rely on database state

25992b2

sourcery-ai Bot reviewed Jun 25, 2026

View reviewed changes

PGijsbers added 2 commits June 25, 2026 16:28

indicate that the identifier does not matter to the test

3078682

generalize the name since it should be valid for all entities

0396bca

coderabbitai Bot reviewed Jun 25, 2026

View reviewed changes

PGijsbers added 4 commits June 26, 2026 09:12

remove dead code

d7a406a

Make task not found in tag error code a constant

c11d5ae

Make the dataset and task factories callable multiple times

e19bd0d

Simplify control flow, fix comments

46ae998

PGijsbers merged commit b61e974 into main Jun 26, 2026
8 of 9 checks passed

PGijsbers deleted the post-tag branch June 26, 2026 07:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add `POST /tasks/tag`#350

Add `POST /tasks/tag`#350
PGijsbers merged 11 commits into
mainfrom
post-tag

PGijsbers commented Jun 25, 2026

Uh oh!

coderabbitai Bot commented Jun 25, 2026 •

edited

Loading

Review limit reached

❌ Failed checks (1 warning)

Uh oh!

sourcery-ai Bot left a comment

Uh oh!

codecov Bot commented Jun 25, 2026 •

edited

Loading

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Uh oh!

Conversation

PGijsbers commented Jun 25, 2026

Description

Checklist

Uh oh!

coderabbitai Bot commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review limit reached

Walkthrough

Possibly related PRs

Suggested labels

❌ Failed checks (1 warning)

Uh oh!

sourcery-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

coderabbitai Bot commented Jun 25, 2026 •

edited

Loading

codecov Bot commented Jun 25, 2026 •

edited

Loading