🥗 NutriPlan LLM Server

The Intelligent Brain behind NutriPlan. A high-performance, context-aware AI backend orchestrating interactions between users, nutritional data, and Large Language Models.

🌟 Overview

The NutriPlan LLM Server is a sophisticated AI microservice built with FastAPI. It acts as the cognitive engine for the NutriPlan ecosystem, enabling:

🗣️ Conversational Meal Planning: A state-of-the-art chat agent that understands dietary constraints, preferences, and nutritional goals.
🔍 Hybrid Search Engine: Merging Vector Search (Semantic) with Keyword Search (Precision) to find the perfect food items.
📚 Intelligent Manual Search: RAG-based support system to answer user questions about the application itself.

🔗 Live API Endpoint: https://star-terribly-sturgeon.ngrok-free.app/

📄 Interactive Docs: https://star-terribly-sturgeon.ngrok-free.app/docs

� Key Features

Feature	Description
RAG Architecture	Retrieves real-time data from MongoDB Vector Stores to ground AI responses in fact, preventing hallucinations.
Hybrid Search	Combines `Vector Search` + `MongoDB Filters` + `Cross-Encoder Reranking` for top-tier relevance.
Streaming API	Full Server-Sent Events (SSE) support for ChatGPT-like real-time typing effects.
Adaptive Logic	Automatically switches between general chat fallback, specific tool execution, and database queries.
Personalized Context	Uses JWT-based authorization to tailor advice to the specific user's health profile.

🏗️ System Architecture

🧠 The Core Pipeline (`CompleteMealPlannerPipeline`)

The central nervous system of the server. It manages the lifecycle of:

MealPlannerAgent: The reasoning engine communicating with the LLM provider (vLLM/OpenAI).
FoodRAGPipeline: The specialized search subsystem for nutritional data.
UserManualRAGPipeline: The support subsystem for app documentation.

🔎 Search Strategy (The "Secret Sauce")

We don't just "search keywords". We understand intent.

graph LR
    A[User Query] --> B(Embedding Model)
    B --> C{Vector Index}
    C -->|Top K Candidates| D[MongoDB Atlas]
    D -->|Initial Results| E[Cross-Encoder Reranker]
    E -->|Scored & Sorted| F[Final Response]

Embed: Query converted to 384-dimensional vector.
Retrieve: Semantic match found in millions of records.
Filter: Hard constraints (e.g., "Gluten-Free") applied immediately.
Rerank: A second, more powerful model re-reads the top results to ensure the best one is first.

📂 Project Structure

llm-server/
├── 📄 llm_server.py           # 🚀 Entry Point: FastAPI App & Routes
├── 📄 .env.example            # 🔐 Config Template
├── 📦 requirements.txt        # 🐍 Dependencies
├── 📂 llm_agent/              # 🧠 Core Logic
│   ├── 📄 agent.py            #    - Agent Reasoning & State Management
│   ├── 📄 pipeline.py         #    - Master Orchestrator
│   ├── 📂 rag/                #    - Retrieval Augmented Generation
│   │   ├── 📄 food_rag.py     #      * Food Search Engine
│   │   └── 📄 manual_rag.py   #      * Documentation Search Engine
│   ├── 📂 tools/              #    - Functional Tools (API Calls)
│   └── 📂 utils/              #    - Helpers (Logging, Prompts)
└── 📂 dataset/                # 💾 Static Data Resources

🛠️ Installation & Setup

Prerequisites

Python 3.9+
MongoDB Atlas (Required for Vector Search)
ngrok (For exposing server publically)

1. ☁️ MongoDB Atlas Setup (Critical)

This project relies on Vector Search, which requires a specific setup in MongoDB Atlas.

Create Cluster: Go to MongoDB Atlas and create a new free cluster.
Create Database: Create a database named nutriplan.
- Collection 1: foods (Import your food data here).
- Collection 2: llm_documents (Import manual/help docs here).

Create Vector Search Index:

Navigate to the Atlas Search tab > Vector Search.
Click Create Search Index.

JSON Editor Configuration:

{
  "fields": [
    {
      "type": "vector",
      "path": "embedding",
      "numDimensions": 384,
      "similarity": "cosine"
    },
    {
      "type": "filter",
      "path": "nutrition.calories"
    },
    {
      "type": "filter",
      "path": "nutrition.proteins"
    },
    {
      "type": "filter",
      "path": "nutrition.carbs"
    },
    {
      "type": "filter",
      "path": "nutrition.fats"
    },
    {
      "type": "filter",
      "path": "property.isBreakfast"
    },
    {
      "type": "filter",
      "path": "property.isLunch"
    },
    {
      "type": "filter",
      "path": "property.isDinner"
    },
    {
      "type": "filter",
      "path": "property.isSnack"
    }
  ]
}

Note: Ensure your data objects have an embedding field containing the vector array.

2. 🐍 Local Installation

# Clone the repository
git clone <your-repo-url>
cd llm-server

# Virtual Env (Recommended)
python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

3. 🌐 Expose Public API (Ngrok)

To allow the frontend or external services to access your local LLM server, you need to expose it.

Install ngrok: Download here.
Start ngrok:
```
ngrok http 8000
```
Copy the URL: You will get a forwarding URL like https://star-terribly-sturgeon.ngrok-free.app.
Update Config: Use this URL in your frontend .env and this server's .env (if needed for callbacks).

4. 🏃 Run Server

uvicorn llm_server:app --host 0.0.0.0 --port 8000 --reload

📡 API Reference

Base URL: https://star-terribly-sturgeon.ngrok-free.app

💬 Chat

POST /ai/chat Streamed conversation endpoint.

// Request
{
  "messages": [{"role": "user", "content": "I need a keto dinner idea."}],
  "stream": true
}

🍎 Food Search

POST /search/food Semantic search for ingredients/meals.

// Request
{
  "query": "low carb pasta alternatives",
  "k": 5
}

📘 Manual Search

POST /search/manual Find help topics.

// Request
{
  "query": "how do I reset my password?",
  "k": 3
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
llm_agent		llm_agent
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
llm_server.py		llm_server.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🥗 NutriPlan LLM Server

🌟 Overview

� Key Features

🏗️ System Architecture

🧠 The Core Pipeline (`CompleteMealPlannerPipeline`)

🔎 Search Strategy (The "Secret Sauce")

📂 Project Structure

🛠️ Installation & Setup

Prerequisites

1. ☁️ MongoDB Atlas Setup (Critical)

2. 🐍 Local Installation

3. 🌐 Expose Public API (Ngrok)

4. 🏃 Run Server

📡 API Reference

💬 Chat

🍎 Food Search

📘 Manual Search

About

Uh oh!

Packages

Languages

NutriPlan-Workspace/Nutriplan_LLM_Server

Folders and files

Latest commit

History

Repository files navigation

🥗 NutriPlan LLM Server

🌟 Overview

� Key Features

🏗️ System Architecture

🧠 The Core Pipeline (CompleteMealPlannerPipeline)

🔎 Search Strategy (The "Secret Sauce")

📂 Project Structure

🛠️ Installation & Setup

Prerequisites

1. ☁️ MongoDB Atlas Setup (Critical)

2. 🐍 Local Installation

3. 🌐 Expose Public API (Ngrok)

4. 🏃 Run Server

📡 API Reference

💬 Chat

🍎 Food Search

📘 Manual Search

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Packages 0

Languages

🧠 The Core Pipeline (`CompleteMealPlannerPipeline`)

Packages