Rollout annotation prompts#403
Conversation
|
Warning This pull request is not mergeable via GitHub because a downstack PR is open. Once all requirements are satisfied, merge this PR as a stack on Graphite.
This stack of pull requests is managed by Graphite. Learn more about stacking. |
Per-task / per-object prompt files used by `rollout.py --annotation-path` (or the `a <path>` intervention command). Each file holds a single natural-language instruction tokenized into the `Task: ...` block of the pi prompt. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
PI's process_batch_for_training now owns prompt sampling, embodiment/ control-mode/State splicing, and tokenization (see PI._build_prompts / _tokenize_prompts). The collate's only job is to stack tensors and preserve list-valued keys. - Use annotation_collate instead of build_tokenized_collate. - Replace _build_collate_from_checkpoint_cfg with _apply_annotation_to_algo: override annotation_key="annotations", sampling_mode="first", default_prompt=<rollout annotation> on the loaded PI algo so rollout-time prompts deterministically flow through. - Drop _debug_print_proprio_norm and the post-collate sampled_prompt debug print -- both inspected fields the collate no longer produces. - Stop loading the hydra run snapshot in _load_policy; the only consumer was the removed collate builder. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Claude Code ReviewSummaryAdds per-task/object prompt text files under Key concerns
Suggestions
VerdictComment — text files are fine to merge; the rollout refactor is reasonable but the silent-failure paths around Reviewed by Claude · Review workflow |

Per-task / per-object prompt files used by
rollout.py --annotation-path(or the
a <path>intervention command). Each file holds a singlenatural-language instruction tokenized into the
Task: ...block ofthe pi prompt.
Co-Authored-By: Claude Opus 4.7 (1M context) noreply@anthropic.com