🤖 Manoo — Offline RAG-Powered Humanoid Assistant for Xibotix

Manoo is an offline, domain-specific humanoid assistant designed for Xibotix Private Limited. The system combines local large language model (LLM) inference via llama.cpp with Retrieval-Augmented Generation (RAG) to deliver concise, safety-aware answers about Xibotix, its founders, and its robotic hand–wrist rehabilitation devices (Gyrosphere, ExoFist, ExoCarp).

The project emphasizes edge deployment, low latency, privacy, and controlled generation, making it suitable for real-world humanoid and robotics demonstrations.

✨ Key Features

Fully offline LLM inference using llama.cpp
Lightweight Model: Qwen2.5-1.5B Instruct (GGUF Q4_K_M) optimized for edge devices
RAG Architecture: Retrieval-Augmented Generation using ChromaDB
Safety-First: Domain-restricted system prompt for factual consistency
API: OpenAI-compatible REST API via llama.cpp server
Metrics: Real-time latency measurement (retrieval, generation, end-to-end)
Integration Ready: Designed for humanoid robots and embedded systems

🧠 System Architecture

User Query
   |
   v
Python Client (rag_ollama.py)
   |
   +--> ChromaDB (semantic retrieval)
   |
   v
Prompt Assembly
(System Prompt + Retrieved Context + User Query)
   |
   v
llama.cpp Server (Qwen2.5-1.5B GGUF, Metal acceleration)
   |
   v
Generated Response
   |
   v
Latency Metrics + Final Answer

🛠 Technology Stack

LLM & Inference

llama.cpp (Metal backend on Apple Silicon)
Qwen2.5-1.5B Instruct (GGUF Q4_K_M)

Retrieval

ChromaDB (persistent vector database)
SentenceTransformers (all-MiniLM-L6-v2)

Backend

Python
Requests (OpenAI-style API client)

📁 Project Structure

manu_ai_offline/
│
├── rag_ollama.py          # Main RAG pipeline
├── xibotix_db/            # ChromaDB persistent store
├── llama.cpp/             # llama.cpp build + models
└── README.md

🚀 Quick Start

Prerequisites

Python 3.9+
Apple Silicon Mac (for Metal acceleration)
llama.cpp built with Metal support
Git LFS (for large model files)

Setup

[git clone https://github.com/your-repo/manu_ai_offline.git](https://github.com/TaherPanbiharwala/xibotix.git)
cd manu_ai_offline

python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Start llama.cpp Server

cd llama.cpp/build
./bin/llama-server \
  -m ../models/qwen2.5-1.5b-instruct-q4_k_m.gguf \
  -ngl 99 \
  --port 8080

Run RAG Client

python rag_ollama.py "What is Gyrosphere?"

🔒 Safety Design

Manoo enforces strict guardrails through its system prompt:

No medical diagnosis or prescriptions.
No device parameter recommendations.
Mandatory clinician referral for therapy decisions.
Domain restriction to Xibotix and rehabilitation devices.
No speculative clinical claims.

These rules ensure patient-safe, investor-ready responses.

🎯 Use Cases

Humanoid assistant demonstrations
Rehab device explanation kiosks
Investor presentations
Patient-friendly educational interfaces
Edge AI benchmarking
RAG experimentation on Apple Silicon

📊 Performance Notes

Context Window: 4096 tokens
Hardware: Metal GPU acceleration enabled
Optimization: Prompt caching active in llama.cpp
Latency: ~1.5–4s on Apple M1 (Q4_K_M)

⚠️ Limitations

Text-only interaction
Context window limited to 4096 tokens
Knowledge restricted to indexed documents
No physical robot control yet

🔮 Future Work

Multimodal input (speech + vision)
On-device speech synthesis
Adaptive context sizing for latency optimization
Expanded RAG knowledge base
Integration with physical humanoid control systems

👨‍💻 Author

Taher Panbiharwala

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.vscode		.vscode
manu_ai_offline		manu_ai_offline
.gitignore		.gitignore
README.md		README.md
dataset.json		dataset.json
manoo.py		manoo.py
prompt_gen.py		prompt_gen.py
prompt_response.jsonl		prompt_response.jsonl
prompts.jsonl		prompts.jsonl
prompts_fl.jsonl		prompts_fl.jsonl
requirements.txt		requirements.txt
shuffle_jsonl.py		shuffle_jsonl.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 Manoo — Offline RAG-Powered Humanoid Assistant for Xibotix

✨ Key Features

🧠 System Architecture

🛠 Technology Stack

LLM & Inference

Retrieval

Backend

📁 Project Structure

🚀 Quick Start

Prerequisites

Setup

Start llama.cpp Server

Run RAG Client

🔒 Safety Design

🎯 Use Cases

📊 Performance Notes

⚠️ Limitations

🔮 Future Work

👨‍💻 Author

About

Uh oh!

Releases

Packages

Languages

TaherPanbiharwala/xibotix

Folders and files

Latest commit

History

Repository files navigation

🤖 Manoo — Offline RAG-Powered Humanoid Assistant for Xibotix

✨ Key Features

🧠 System Architecture

🛠 Technology Stack

LLM & Inference

Retrieval

Backend

📁 Project Structure

🚀 Quick Start

Prerequisites

Setup

Start llama.cpp Server

Run RAG Client

🔒 Safety Design

🎯 Use Cases

📊 Performance Notes

⚠️ Limitations

🔮 Future Work

👨‍💻 Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages