feat(benchmark): add run_benchmark() with dataframe-first pipeline, metrics, and tests by anshul23102 · Pull Request #3892 · PecanProject/pecan

anshul23102 · 2026-03-24T16:44:47Z

Description

Adds a new run_benchmark() function as a database-free entry point for the
benchmark module. Unlike calc_benchmark(), this works without a BETYdb
connection, taking validated dataframes directly as input.

Files added:

R/run_benchmark.R — dataframe-first pipeline with bm_validate(),
align_by_time(), compute_metrics(), and plot_time_series()
inst/testdata/sample_model.csv — sample model output
inst/testdata/sample_obs.csv — sample observations
tests/testthat/test-run_benchmark.R — unit tests for all four pipeline stages
README.md — updated with quickstart example

Motivation and Context

The existing calc_benchmark() requires a full database connection which
makes it hard to test and use standalone. This adds a lightweight,
dataframe-first entry point — run_benchmark(model_df, obs_df) — for users
who want to quickly benchmark model output against observations without any
database setup. The pipeline follows a four-stage design: validate → align →
compute metrics → plot.

Review Time Estimate

Immediately
Within one week
When possible

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

My change requires a change to the documentation.
My name is in the list of CITATION.cff
I agree that PEcAn Project may distribute my contribution under any or all of
- the same license as the existing code,
- and/or the BSD 3-clause license.
I have updated the CHANGELOG.md.
I have updated the documentation accordingly.
I have read the CONTRIBUTING document.
I have added tests to cover my changes.
All new and existing tests passed.

…est data

…rics/plot_time_series, update tests

…_time_series

anshul23102 · 2026-04-01T08:08:12Z

Hi @dlebauer, just wanted to give you a quick update on PR #3892. All CI checks are now passing and I've refactored the API to be dataframe-first (run_benchmark(model_df, obs_df)) which directly aligns with the GSoC proposal I submitted.
The pipeline now has four clean stages: validate, align, compute metrics, and plot, each usable independently or through the orchestrator. Would love your feedback whenever you get a chance!

anshul23102 added 2 commits March 23, 2026 22:57

feat(benchmark): add run_benchmark MVP with alignment, metrics, and t…

d6a427c

…est data

feat(benchmark): add tests and update README with quickstart

f8203ff

github-actions bot added tests modules labels Mar 24, 2026

anshul23102 and others added 7 commits March 24, 2026 22:17

Merge branch 'develop' into feat/benchmark-runner

13f35a7

docs(benchmark): add roxygen man page for run_benchmark

acc2d0d

Merge branch 'develop' into feat/benchmark-runner

fe4d329

docs(benchmark): add man pages and update NAMESPACE

0d272fb

fix(benchmark): fix NAMESPACE export and run_benchmark.Rd usage format

8c0832c

fix(benchmark): use multi-line usage format and fix author name

1e90203

refactor(benchmark): dataframe-first API, add bm_validate/compute_met…

01043af

…rics/plot_time_series, update tests

anshul23102 changed the title ~~feat(benchmark): add run_benchmark MVP with IO, alignment, metrics, and tests~~ feat(benchmark): add run_benchmark() with dataframe-first pipeline, metrics, and tests Apr 1, 2026

anshul23102 added 2 commits April 1, 2026 13:02

docs(benchmark): add man pages for bm_validate, compute_metrics, plot…

408870a

…_time_series

fix(benchmark): use .data$ in aes() to fix R CMD check NOTE

b1c6a87

anshul23102 added 2 commits April 4, 2026 14:19

Merge branch 'develop' into feat/benchmark-runner

158440c

Merge branch 'develop' into feat/benchmark-runner

871ad22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(benchmark): add run_benchmark() with dataframe-first pipeline, metrics, and tests#3892

feat(benchmark): add run_benchmark() with dataframe-first pipeline, metrics, and tests#3892
anshul23102 wants to merge 13 commits intoPecanProject:developfrom
anshul23102:feat/benchmark-runner

anshul23102 commented Mar 24, 2026 •

edited

Loading

Uh oh!

anshul23102 commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

anshul23102 commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Review Time Estimate

Types of changes

Checklist:

Uh oh!

anshul23102 commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

anshul23102 commented Mar 24, 2026 •

edited

Loading