Skip to content

Tolerate captioned-output schema in explorer datasets#77

Merged
max-rosenblattl merged 1 commit into
mainfrom
max-rosenblattl/explorer-on-captioned-datasets
May 7, 2026
Merged

Tolerate captioned-output schema in explorer datasets#77
max-rosenblattl merged 1 commit into
mainfrom
max-rosenblattl/explorer-on-captioned-datasets

Conversation

@max-rosenblattl

Copy link
Copy Markdown
Collaborator

Summary

  • Make MHCDataset._parse (daily) and WeeklyMHCDataset._parse (weekly) tolerant of the captioned-output schema (sensor-cap-{daily,weekly}-tiny on HF), which lacks a few raw-input fields and renames channel_units -> units.
  • Lets the existing explorer.py load either raw HF saved-disk dirs or the captioned shipped configs without code-path branching.

Test plan

  • MHC_DATASET_DIR=./sensor-cap-daily-tiny python explorer.py opens, navigates rows, runs detectors live.
  • MHC_WEEKLY_DATASET_DIR=./sensor-cap-weekly-tiny python explorer.py --weekly same.
  • No regression on raw daily / raw weekly explorer flow.

@coderabbitai

coderabbitai Bot commented May 7, 2026

Copy link
Copy Markdown
Contributor

Warning

Rate limit exceeded

@max-rosenblattl has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 16 minutes and 57 seconds before requesting another review.

To continue reviewing without waiting, purchase usage credits in the billing tab.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: c00fb642-e4e7-47ee-8132-d82a32fb2f0a

📥 Commits

Reviewing files that changed from the base of the PR and between 8515a38 and 2e6dcb5.

📒 Files selected for processing (2)
  • mhc/dataset.py
  • mhc_weekly/dataset.py
✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch max-rosenblattl/explorer-on-captioned-datasets

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@max-rosenblattl max-rosenblattl merged commit db53907 into main May 7, 2026
2 of 3 checks passed
@max-rosenblattl max-rosenblattl deleted the max-rosenblattl/explorer-on-captioned-datasets branch May 7, 2026 09:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant