A simple RAG Chat Bot with premade ingestion and execution scripts
-
Embedding = hf.co/CompendiumLabs/bge-base-en-v1.5-gguf
-
LLM = hf.co/bartowski/Llama-3.2-1B-Instruct-GGUF
The following resources were helpful in building and understanding this project:
-
NVIDIA Developer Blog Finding the best chunking strategy for accurate AI responses https://developer.nvidia.com/blog/finding-the-best-chunking-strategy-for-accurate-ai-responses/
-
Medium (DevChris01) 3D embedding visualization with Python and ChromaDB https://medium.com/@DevChris01/3d-embedding-visualization-with-python-and-chromadb-8189f696c8a8
-
Hugging Face Demo (ngxson) Simple RAG implementation example https://huggingface.co/ngxson/demo_simple_rag_py/raw/main/demo.py
-
Hugging Face Cookbook RAG evaluation techniques and best practices https://huggingface.co/learn/cookbook/rag_evaluation