|
9 | 9 | - `traitsview`: 43,532 records (traits + yields view) |
10 | 10 | - 15 support tables: species, sites, variables, citations, pfts, priors, etc. |
11 | 11 | * Data quality: |
12 | | - - Excludes [checked = -1] records (failed QA/QC) |
13 | | - - Public data only (`access_level >= 4`) |
14 | | -* Formats: |
15 | | - |
16 | | - - Primary: `.rda` files with xz compression |
17 | | - - Alternative: Parquet files in `inst/extdata/parquet/` |
| 12 | + - Excludes `checked = -1` records (failed QA/QC) |
| 13 | + - Converts `checked = NA` to `checked = 0` (unchecked) |
| 14 | + - Public data only (`access_level == 4`); build enforces this with a hard filter |
| 15 | +* Format: `.rda` files with xz compression (lazy-loaded) |
18 | 16 | * Metadata: |
19 | 17 | - Frictionless `datapackage.json` in `inst/metadata/` |
20 | 18 | - Full roxygen2 documentation for all datasets |
21 | 19 | * Vignettes: |
22 | | - - `orientation`: Getting started guide |
23 | | - - `manuscript`: Reproduction of key GCB Bioenergy figures |
24 | | - - `sql-analogs`: Common SQL queries as dplyr equivalents |
| 20 | + - `getting_started`: Package overview, data model, and key concepts |
| 21 | + - `manuscript`: Reproduction of key GCB bioenergy figures |
| 22 | + - `common_analyses`: Common SQL queries as dplyr equivalents |
25 | 23 | - `pfts-priors`: Working with PFTs and Bayesian priors |
26 | | -* GitHub issue templates for data corrections and verification |
| 24 | +* Source CSVs in `data-raw/csv/` under version control for community corrections |
| 25 | +* `inst/CITATION` for `citation("betydata")` support |
| 26 | +* `inst/STYLE_GUIDE.md` documenting package conventions |
| 27 | +* Data correction workflow: issue templates and PR template for corrections |
| 28 | +* Github actions for R CMD check and quarto site deployment |
0 commit comments