Public repository scaffold for the paper "What Are We Actually Benchmarking in Robot Manipulation?"
Project website: https://ripl.github.io/manipulation_benchmark_audit/
This repository will host public code, diagnostics, result summaries, and artifact documentation for the paper. The current version is a minimal release scaffold while code and artifacts are prepared.
This repository is being prepared for public release. Code, benchmark diagnostics, result summaries, and artifact access instructions are coming soon.
The current repository intentionally contains only lightweight documentation scaffolding. It does not include model weights, datasets, simulator caches, rollout videos, generated result bundles, or private filesystem paths.
.
├── artifacts/ # Release artifact documentation; files coming soon.
├── diagnostics/ # Benchmark diagnostic descriptions; code coming soon.
├── results/ # Lightweight paper result summaries; files coming soon.
├── LICENSE
└── README.md
- Diagnostic definitions and minimal code needed to run public benchmark audit checks.
- Lightweight result summaries for paper-facing claims.
- Artifact documentation with download or reproduction instructions for data that is too large to store in git.
For questions, contact Tianchong Jiang via GitHub: https://github.com/Tianchong-Jiang.