Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
-
Updated
Apr 6, 2026 - Python
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.
MinerU免安装部署一键启动整合包
🎨 Display system theme colors and their references easily with this Lua Plugin for GrandMA3, simplifying your color selection process.
PDF table extraction for RAG — convert to clean HTML. Fast, local, no GPU.
A small web app that finds relevant documents and produces query-focused summaries using Gemini. Supports PDF upload with one-time multimodal preprocessing into per-page Markdown + metadata.
🔄 Optimize model loading in ComfyUI with flexible node connections and controlled sequences for better performance and memory management.
🎨 Enhance video generation by syncing audio to visuals with ComfyUI-PainterAI2V. Create precise lip-syncing and seamless transitions using dual model workflows.
Extract tables precisely from PDFs and convert them to clean HTML for RAG pipelines, running fast on CPU without external dependencies.
🖼️ Segment characters in images with ComfyUI using a Vision LLM agent, enhancing your projects with detailed and high-quality masks.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
🎶 Generate multilingual AI music with lyrics in English, Chinese, Japanese, Korean, and Spanish using ComfyUI's HeartMuLa model.
💡 Control 3D lighting angles effortlessly with ComfyUI - adjust direction, intensity, and color while relighting images in real-time.
Parse JSON quickly using a fast, recursive-descent parser designed for lightweight integration in C++ projects.
🤖 Process SCAIL-pose data with ComfyUI nodes, utilizing VitPose for accurate face and hand detection in an efficient, streamlined setup.
UE5 Server Emulator 2026 🎮 | Python Lyra Client & Replication Tools
📝 Manage your projects and notes locally with Ironpad, a file-based system that keeps your data safe in Markdown format without cloud reliance.
🎨 Build interactive Blazor applications with A2UI, a secure and portable protocol for rich UI rendered natively across platforms without code execution risks.
Implements Unreal Engine 5 network protocol in Python to connect, authenticate, and replicate actors with UE5 Lyra Starter Game servers.
Add a description, image, and links to the pdf-extractor-rag topic page so that developers can more easily learn about it.
To associate your repository with the pdf-extractor-rag topic, visit your repo's landing page and select "manage topics."