Parent
Spawned from #8 (M4.1 tracking issue).
Objective
Archive each session's trajectory (the sequence of pred commands the AI ran) alongside the results file, so any leaderboard number can be re-validated offline.
Definition of done
Dependencies
Depends on #4, #6.
Parent
Spawned from #8 (M4.1 tracking issue).
Objective
Archive each session's trajectory (the sequence of pred commands the AI ran) alongside the results file, so any leaderboard number can be re-validated offline.
Definition of done
run_mini.pysaves atrajectories/{safe_model}_{rule}.jsonlfile containing each pred call (args + stdout + returncode) during a sessionbuild_index.pyalready picks uptrajectory_file(done in M3.1 — Build the public leaderboard: dual metrics + certificate drilldown + trajectory #6) — no change needed theremake test-unitstill passesDependencies
Depends on #4, #6.