Hybrid RAG Agent - Clean Architecture

Modern and modular RAG (Retrieval-Augmented Generation) system designed under Clean Architecture principles. This system enables intelligent document retrieval with total independence from infrastructure providers (Database, LLM, or Embeddings).

🏛️ Architecture: Clean RAG Design

This project implements a decoupled architecture where business logic resides in the core, protected from changes in external services.

Design Principles

Provider Independence: Easily switch between MongoDB, Supabase, PostgreSQL, or any other DB by implementing its interface.
AI Abstraction: Support for multiple LLM and Embeddings providers (OpenAI, Anthropic, Local).
Testability: RAG logic verifiable without external connections.
CLI-First: Powerful terminal interface designed for technical workflows.

Layer Structure

Domain (Core): Pure schemas (Document, Chunk) and abstract interfaces (IRepository, ILLMProvider, IParser, IChunker).
Application (Services): AgentService (agentic entry point), RAGService (hybrid search), IngestService (modular ingestion).
Infrastructure: Concrete implementations (currently includes MongoDB, Supabase, and OpenAI).
Endpoints: User interface via CLI (Rich).

Architecture Diagram (C4 Clean Design)

flowchart TB
    User((User))

    subgraph Endpoints["Endpoints Layer"]
        CLI[CLI Rich]
    end

    subgraph Services["Application Layer"]
        Agent[AgentService]
        RAG[RAGService]
        Ingest[IngestService]
    end

    subgraph Core["Domain Layer - Interfaces"]
        direction LR
        IRepo([IRepository])
        ILLM([ILLMProvider])
        IEmb([IEmbedder])
        IPar([IParser])
        IChu([IChunker])
    end

    subgraph Infra["Infrastructure Layer"]
        direction TB
        subgraph DBs["Database Providers"]
            Mongo[(MongoRepository)]
            Supa[(SupabaseRepository)]
        end
        subgraph AI["AI Providers"]
            OAILLM[OpenAIProvider]
            OAIEmb[OpenAIEmbedder]
        end
        subgraph Ingestion["Ingestion Implementation"]
            DPar[DoclingParser]
            DChu[DoclingChunker]
        end
    end

    User --> CLI
    CLI --> Agent
    Agent --> RAG
    CLI --> Ingest

    RAG -.-> IRepo
    RAG -.-> ILLM
    Ingest -.-> IRepo
    Ingest -.-> IEmb
    Ingest -.-> IPar
    Ingest -.-> IChu

    IRepo -.-> Mongo
    IRepo -.-> Supa
    ILLM -.-> OAILLM
    IEmb -.-> OAIEmb
    IPar -.-> DPar
    IChu -.-> DChu

    %% Layer styling
    style Endpoints fill:#2d3436,stroke:#636e72,color:#dfe6e9
    style Services fill:#0984e3,stroke:#74b9ff,color:#fff
    style Core fill:#6c5ce7,stroke:#a29bfe,color:#fff
    style Infra fill:#00b894,stroke:#55efc4,color:#fff
    style DBs fill:#00cec9,stroke:#81ecec,color:#2d3436
    style AI fill:#e17055,stroke:#fab1a0,color:#fff
    style Ingestion fill:#fdcb6e,stroke:#f39c12,color:#2d3436

📂 Project Organization

src/
├── core/
│   ├── schemas/        # Base models: Document, Chunk, SearchHit
│   ├── dtos/           # Data Transfer Objects
│   └── interfaces/     # Abstract contracts (IRepository, IParser, etc.)
├── services/           # Business logic (Agent, RAG, Ingestion)
├── infrastructure/     # Concrete provider implementations (Supabase, OpenAI, Docling)
└── endpoints/          # Input adapters (CLI)

🚀 Quick Start Guide

1. Installation

Requires Python 3.10+ and UV Package Manager.

git clone https://github.com/FullFran/Hybrid-RAG-example.git
cd Hybrid-RAG-example
uv venv && uv sync

2. Configuration

Copy .env.example to .env and set your environment variables (Supabase URL/Key, OpenAI API Key, etc.).

3. Usage

# Ingest documents
uv run python -m src.endpoints.cli.ingest -d ./documents

# Start the intelligent chat
uv run python -m src.endpoints.cli.main

🛠️ Extensibility

Thanks to the clean architecture, adding a new database provider is as simple as:

Create a new class in src/infrastructure/database/.
Implement the IRepository interface.
Inject it into the service during application bootstrap.

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
.agent		.agent
.agents		.agents
.claude		.claude
.github/workflows		.github/workflows
docs		docs
documents		documents
examples		examples
migrations		migrations
scripts		scripts
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
GEMINI.md		GEMINI.md
README.md		README.md
debug_db.py		debug_db.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hybrid RAG Agent - Clean Architecture

🏛️ Architecture: Clean RAG Design

Design Principles

Layer Structure

Architecture Diagram (C4 Clean Design)

📂 Project Organization

🚀 Quick Start Guide

1. Installation

2. Configuration

3. Usage

🛠️ Extensibility

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

FullFran/Hybrid-RAG-example

Folders and files

Latest commit

History

Repository files navigation

Hybrid RAG Agent - Clean Architecture

🏛️ Architecture: Clean RAG Design

Design Principles

Layer Structure

Architecture Diagram (C4 Clean Design)

📂 Project Organization

🚀 Quick Start Guide

1. Installation

2. Configuration

3. Usage

🛠️ Extensibility

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages