This is a production-style Retrieval-Augmented Generation (RAG) chatbot using 100% free & open-source tools.
- LLM: Ollama (LLaMA3 / Mistral) – FREE, local
- Embeddings: SentenceTransformers
- Vector DB: FAISS
- Backend: FastAPI
- Frontend: Streamlit
- MLOps: MLflow
- Orchestration: LangChain
- Containerization: Docker (optional)
curl -fsSL https://ollama.com/install.sh | sh
ollama pull llama3python -m venv venv
source venv/bin/activate # Linux/Mac
venv\Scripts\activate # Windowspip install -r requirements.txtPut your PDFs or TXTs inside data/ and run:
python ingest.pyuvicorn app.main:app --reloadAPI available at:
http://127.0.0.1:8000/docs
streamlit run ui/chat_ui.pymlflow uiOpen:
http://127.0.0.1:5000
rag_chatbot_project/
│── app/
│ ├── main.py
│ ├── rag.py
│── data/
│── vectorstore/
│── ui/
│ ├── chat_ui.py
│── ingest.py
│── requirements.txt
│── Dockerfile
- Internship / Job Portfolio
- Startup MVP
- Research demo
- Production extension (Auth, Redis, CI/CD)