Skip to content

Actions: CodingThrust/problem-reductions-benchmark

Actions

CI — unit tests and verifier calibration

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
49 workflow runs
49 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

leaderboard: aggregate-only public results; never publish found bugs
CI — unit tests and verifier calibration #50: Commit 3d2a526 pushed by isPANN
1m 11s main
skill: add run-benchmark for driving a run end-to-end (macOS/Linux)
CI — unit tests and verifier calibration #49: Commit 5581951 pushed by isPANN
1m 21s main
docs: add MIT LICENSE; rename the guide to CONTRIBUTING.md (#27)
CI — unit tests and verifier calibration #48: Commit 609c772 pushed by isPANN
1m 40s main
docs: collapse to README + SUBMISSION; drop SHOWCASE and CONTRIBUTING…
CI — unit tests and verifier calibration #46: Commit 9b5b7af pushed by isPANN
1m 15s main
docs(readme): submission is a GitHub PR (not the Space); pin Python 3.12
CI — unit tests and verifier calibration #44: Commit 02dac7f pushed by isPANN
1m 15s main
build(deps): bump mini-swe-agent from 2.2.8 to 2.4.4 in /benchmark (#19)
CI — unit tests and verifier calibration #43: Commit 5252b45 pushed by isPANN
1m 32s main
ci(dependabot): pin python base to 3.12, ignore its updates
CI — unit tests and verifier calibration #41: Commit dfe1a4e pushed by isPANN
1m 13s main
build(deps): update pytest requirement in /benchmark (#24)
CI — unit tests and verifier calibration #40: Commit 7684fcb pushed by isPANN
1m 38s main
build(deps): update pytest-timeout requirement in /benchmark (#20)
CI — unit tests and verifier calibration #37: Commit 030a26a pushed by isPANN
1m 40s main
build(deps): update pyyaml requirement in /benchmark (#21)
CI — unit tests and verifier calibration #36: Commit 2c67f71 pushed by isPANN
1m 12s main
chore: drop internal docs/superpowers planning notes
CI — unit tests and verifier calibration #32: Commit 4c9eb42 pushed by isPANN
1m 36s main
chore: remove dead run.py and its anthropic dependency (#25)
CI — unit tests and verifier calibration #31: Commit 7c6b117 pushed by isPANN
1m 9s main
build(deps): bump the actions group with 7 updates (#22)
CI — unit tests and verifier calibration #29: Commit 9d70dce pushed by isPANN
1m 12s main