podtext

A command-line tool that turns podcast episodes into searchable, structured markdown. It downloads audio from RSS feeds, transcribes it using MLX-Whisper (optimized for Apple Silicon), and enriches the output with AI-generated summaries, topics, keywords, and advertisement detection via Claude.

Installation

Requires Apple Silicon, Python 3.13+, uv and a Anthropic API key.

Install directly from the repository:

uv tool install git+https://github.com/mickume/podtext.git

This installs the podtext command globally (in your active environment) without cloning the repo.

Install from a local clone

git clone https://github.com/mickume/podtext.git
cd podtext
uv venv
source .venv/bin/activate
uv pip install -e .

Usage

Set your API key

export ANTHROPIC_API_KEY="your-api-key-here"

Search for podcasts

Find podcasts by keyword. Results include the title and RSS feed URL.

podtext search "artificial intelligence"
podtext search "tech news" --limit 5       # default: 10 results

List episodes

Show recent episodes from a feed, sorted newest-first. Each episode gets an index number for use with transcribe.

podtext episodes "https://example.com/feed.xml"
podtext episodes "https://example.com/feed.xml" --limit 20

Transcribe episodes

Transcribe one or more episodes by index. Each produces a markdown file with metadata, transcript, and (if an API key is configured) AI analysis.

# Single episode
podtext transcribe "https://example.com/feed.xml" 3

# Multiple episodes in one command (processed sequentially)
podtext transcribe "https://example.com/feed.xml" 1 3 5

# Skip language detection (useful for non-English content)
podtext transcribe "https://example.com/feed.xml" 1 --skip-language-check

Batch runs show progress per episode and a summary at the end. If one episode fails, the rest continue.

Output Format

Each transcribed episode produces a markdown file:

---
title: "Episode Title"
pub_date: "2024-01-15"
podcast: "Podcast Name"
feed_url: "https://example.com/feed.xml"
media_url: "https://example.com/episode.mp3"
topics:
  - "Topic: brief description"
keywords:
  - keyword1
  - keyword2
---

## Summary

AI-generated summary...

## Show Notes

Show notes from the RSS feed (converted from HTML)...

## Transcription

Full transcript with paragraph formatting...

When AI analysis is unavailable (no API key or service issues), the file still contains the transcript and show notes -- the Summary section and metadata enrichment are simply omitted.

Detected advertisements are replaced with [ADVERTISEMENT WAS REMOVED] markers in the transcript.

Configuration

Podtext uses a TOML configuration file. On first run, it creates .podtext/config in the current directory with defaults. You can also place a global config at ~/.podtext/config -- the local file takes precedence.

API key

Set via environment variable:

export ANTHROPIC_API_KEY="your-key-here"

Full configuration reference

[storage]
media_dir = ".podtext/downloads/"     # Where downloaded audio is stored
output_dir = ".podtext/output/"       # Where markdown output is saved
temp_storage = false                  # Delete audio files after transcription

[whisper]
model = "base"                        # tiny, base, small, medium, large

Larger Whisper models produce more accurate transcriptions but are slower and use more memory. The base model is a good default for most podcasts.

Prompt customization

The AI analysis prompts can be customized by editing .podtext/prompts.md (or ~/.podtext/prompts.md). This file is auto-created on first run with defaults. It contains four sections that you can edit:

Advertisement Detection -- how ads are identified in the transcript
Content Summary -- how the episode summary is generated
Topic Extraction -- how topics are identified and described
Keyword Extraction -- how keywords are selected

If the file is missing or malformed, built-in defaults are used automatically.

Development

# Install with dev dependencies
uv sync --all-extras

# Run tests
uv run pytest

# Type checking
uv run mypy src/

# Linting
uv run ruff check src/ tests/

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.agent-fox		.agent-fox
.claude		.claude
.podtext		.podtext
src/podtext		src/podtext
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
prd.md		prd.md
pyproject.toml		pyproject.toml
stuff.md		stuff.md
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

podtext

Installation

Install from a local clone

Usage

Set your API key

Search for podcasts

List episodes

Transcribe episodes

Output Format

Configuration

API key

Full configuration reference

Prompt customization

Development

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

podtext

Installation

Install from a local clone

Usage

Set your API key

Search for podcasts

List episodes

Transcribe episodes

Output Format

Configuration

API key

Full configuration reference

Prompt customization

Development

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages