Parent
Spawned from #8 (M4.1 tracking issue).
Objective
Write a README that explains:
- What this benchmark measures and why (counterexample certificate format, three violation types)
- How to add a model (what runner to implement, what results file to produce)
- How to run a session locally (
make demo, env vars needed)
- How to read the metrics (bugs/Ktok vs bugs/$, why bugs/Ktok is the primary ranking)
Definition of done
README.md exists at repo root with all four sections above
- A
CONTRIBUTING.md documents the certificate JSON schema and the verification workflow
make test-unit still passes
Dependencies
Depends on #1–#7 (all prior milestones).
Parent
Spawned from #8 (M4.1 tracking issue).
Objective
Write a README that explains:
make demo, env vars needed)Definition of done
README.mdexists at repo root with all four sections aboveCONTRIBUTING.mddocuments the certificate JSON schema and the verification workflowmake test-unitstill passesDependencies
Depends on #1–#7 (all prior milestones).