Skip to content

Support dataset-type HF repos in dataset materialization#398

Merged
MaxGhenis merged 2 commits into
mainfrom
populace-dataset-repo-support
Jun 11, 2026
Merged

Support dataset-type HF repos in dataset materialization#398
MaxGhenis merged 2 commits into
mainfrom
populace-dataset-repo-support

Conversation

@MaxGhenis

Copy link
Copy Markdown
Contributor

policyengine_core's download helper assumes model-type HF repos; certified data releases can live in dataset-type repos (the populace-us release, certified-track via policyengine-bundles#25, is one). Adds a dataset-repo retry in materialize_dataset_source before surfacing the original failure. Verified end-to-end against hf://policyengine/populace-us/populace_us_2024.h5 — resolution, download, and dataset-key creation all work through ensure_datasets.

🤖 Generated with Claude Code

Certified data releases can live in dataset-type repos (e.g.
policyengine/populace-us); the core helper assumes model-type. Retry
with repo_type=dataset before surfacing the original failure.

Verified: hf://policyengine/populace-us/populace_us_2024.h5 resolves
and downloads through ensure_datasets with this fallback.
@MaxGhenis MaxGhenis merged commit 86f1de1 into main Jun 11, 2026
11 checks passed
@MaxGhenis MaxGhenis deleted the populace-dataset-repo-support branch June 11, 2026 15:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant