An offline RAG (Retrieval-Augmented Generation) chatbot using FAISS for document retrieval and Microsoft Phi3 for response generation. Easily integrate via Flask API for private, fast Q&A from indexed documents. Includes a Jupyter notebook for setup, making it ideal for localized FAQ, customer support, and knowledge base applications.
- FAISS-powered Document Retrieval: Quickly indexes and searches through documents.
- Local LLM Integration: Uses Microsoft Phi3 for offline, context-aware responses.
- Prompt Engineering: Tailored prompts ensure relevant, concise answers.
- Flask API: Simple integration with other applications.
- Setup: Create a Conda environment and install dependencies (
faiss-cpu,flask,torch, etc.). - Document Indexing: Load documents with FAISS to enable retrieval.
- Run: Start the chatbot via the Flask server and begin querying.
Ideal for customer support, technical knowledge bases, and internal FAQs.
For more detailed instructions, refer to the included Jupyter notebook.