Content extraction: turn PDF, DOCX, XLSX, Markdown, and HTML into markdown text.
| Package | Description |
|---|---|
| @statewalker/content-extractors | PDF/DOCX/XLSX/Markdown/HTML extractors producing markdown text via a mime-aware registry. |
| App | Description |
|---|---|
| indexer-tests | Integration tests for the indexer stack. |
pnpm install
pnpm run build
pnpm run testReleases are managed via changesets:
pnpm changeset # describe the change
pnpm version-packages # roll versions + regenerate CHANGELOGs
pnpm release-packages # publish to npm