🚀 Adaptive-RAG – Intelligent Retrieval-Augmented Generation System

title	emoji	colorFrom	colorTo	sdk	app_file	pinned
Adaptive RAG Chat	🧠	yellow	red	streamlit	app.py	false

🚀 Adaptive-RAG – Intelligent Retrieval-Augmented Generation System

An adaptive Retrieval-Augmented Generation (RAG) system that dynamically routes queries between vector search and web search for accurate, grounded answers.

✨ Features • 🏗️ Architecture • 🚀 Quick Start • 🔬 How It Works • 📂 Project Structure

🎯 Overview

Adaptive-RAG is a production-oriented RAG pipeline built using LangChain and LangGraph that intelligently decides how to retrieve information before generating an answer.

Instead of relying on a single data source, the system:

Routes questions to a local FAISS vectorstore when internal knowledge is sufficient
Falls back to web search (Tavily) when information is missing or outdated
Grades retrieved documents and generated answers
Automatically rewrites queries when retrieval quality is poor

This results in more accurate, less hallucinated, and context-aware answers.

💡 Why I Built This

The Problem: Traditional RAG systems blindly retrieve documents from a vectorstore, even when:

The question is out of domain
Knowledge is outdated
Retrieval quality is poor

This leads to hallucinations or incomplete answers.

My Goal: Build a self-correcting RAG pipeline that can:

Decide where to retrieve from
Judge how good the retrieval is
Improve itself by rewriting queries when needed

Key Learning: LangGraph is ideal for building conditional, feedback-driven RAG workflows instead of linear chains.

✨ Key Features

Adaptive Routing Automatically selects between vectorstore retrieval and web search.
Retrieval Grading Evaluates whether retrieved documents are relevant to the query.
Hallucination Detection Grades generated answers against retrieved context.
Query Rewriting Reformulates weak questions to improve retrieval quality.
Graph-based Control Flow Uses LangGraph for explicit decision-making and retry loops.
Interactive UI Streamlit app for real-time querying and visualization.

🏗️ Architecture

High-Level Workflow

User Query
   │
   ▼
Router Node
(Vectorstore or Web?)
   │
   ├──► Vectorstore Retrieval (FAISS)
   │
   └──► Web Search (Tavily)
           │
           ▼
Document Grader
   │
   ├── Relevant → Answer Generator
   │
   └── Not Relevant → Query Rewriter
                            │
                            └── Loop Back to Router

LangGraph Advantage

LangGraph enables:

Conditional edges
Feedback loops
Explicit state transitions
Clean separation of logic

🔬 How It Works

1. Routing

A router node decides whether the query should go to:

Vectorstore (domain-specific questions)
Web Search (open-ended or current topics)

2. Retrieval

Vectorstore uses FAISS embeddings
Web search uses Tavily API

3. Grading

LLM-based graders check:

Document relevance
Answer grounding
Hallucination risk

4. Query Rewriting

If retrieval is weak, the query is rewritten and re-routed automatically.

🛠️ Tech Stack

Component	Technology
Orchestration	LangGraph
RAG Framework	LangChain
Vector Store	FAISS
Web Search	Tavily
LLM	Groq (Llama 3.3)
UI	Streamlit
Language	Python

🚀 Quick Start

Prerequisites

Python 3.10+
Groq API key
Tavily API key

Environment Setup

Create a .env file:

GROQ_API_KEY=your_groq_key
TAVILY_API_KEY=your_tavily_key

Install Dependencies

pip install -r requirements.txt

Run the App

streamlit run app.py

On first run:

A FAISS index is built
Cached under data/index/faiss_index

📂 Project Structure (this repo)

.
├── app.py                     # Streamlit UI that invokes the LangGraph workflow
├── requirements.txt           # Python dependencies
├── src/
│   ├── graphs/graph_builder.py  # FAISS index + retriever setup
│   ├── llms/llm.py              # RAG prompt and LLM chain
│   ├── nodes/node_implementation.py # Router, retrieve, web_search, graders, transform
│   └── states/state.py          # Graph state + compile (app)
└── data/faiss_index/          # Vectorstore cache (created at runtime)

🚀 Deploy to Hugging Face Spaces

Create a new Space: choose SDK "Streamlit".
Push this repository to the Space (or connect via GitHub).
In the Space Settings → Secrets, add:

GROQ_API_KEY: your Groq key
TAVILY_API_KEY: your Tavily key

The app auto-builds using requirements.txt and runs app.py.
Optional: users can also paste keys in the sidebar if Secrets are not set.

Notes:

First run may build a FAISS index and cache under src/data/faiss_index/.
If web search is disabled (missing Tavily key), queries will route to the vectorstore.

📊 What Makes This Project Strong

Not a basic RAG demo
Uses decision-making and self-correction
Demonstrates real LangGraph value
Clean separation of concerns
Directly aligned with industry GenAI systems

This is the kind of RAG system used in:

Enterprise knowledge assistants
AI support bots
Research copilots
Internal search tools

🔮 Future Improvements

Multi-vector routing (code, docs, FAQs)
Tool-augmented RAG
Caching and latency optimization
Confidence-based answer refusal
Evaluation dashboards

👨‍💻 Author

Vivek Kumar Gupta AI Engineering Student | GenAI & Agentic Systems Builder

GitHub: https://github.com/vivek34561
LinkedIn: https://linkedin.com/in/vivek-gupta-0400452b6
Portfolio: https://resume-sepia-seven.vercel.app/

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.github/workflows		.github/workflows
src		src
.gitignore		.gitignore
4-AdaptiveRAG.ipynb		4-AdaptiveRAG.ipynb
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
README_API.md		README_API.md
__init__.py		__init__.py
app.py		app.py
requirements.txt		requirements.txt
test_api.py		test_api.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Adaptive-RAG – Intelligent Retrieval-Augmented Generation System

🎯 Overview

💡 Why I Built This

✨ Key Features

🏗️ Architecture

High-Level Workflow

LangGraph Advantage

🔬 How It Works

1. Routing

2. Retrieval

3. Grading

4. Query Rewriting

🛠️ Tech Stack

🚀 Quick Start

Prerequisites

Environment Setup

Install Dependencies

Run the App

📂 Project Structure (this repo)

🚀 Deploy to Hugging Face Spaces

📊 What Makes This Project Strong

🔮 Future Improvements

👨‍💻 Author

📄 License

About

Uh oh!

Releases

Packages

Languages

License

vivek34561/Adaptive-RAG

Folders and files

Latest commit

History

Repository files navigation

🚀 Adaptive-RAG – Intelligent Retrieval-Augmented Generation System

🎯 Overview

💡 Why I Built This

✨ Key Features

🏗️ Architecture

High-Level Workflow

LangGraph Advantage

🔬 How It Works

1. Routing

2. Retrieval

3. Grading

4. Query Rewriting

🛠️ Tech Stack

🚀 Quick Start

Prerequisites

Environment Setup

Install Dependencies

Run the App

📂 Project Structure (this repo)

🚀 Deploy to Hugging Face Spaces

📊 What Makes This Project Strong

🔮 Future Improvements

👨‍💻 Author

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages