Case study using dotfurther's Open Discover Platform with the RavenDB document store to rapidly create a full-text search/eDiscovery/information governance capable demonstration application.
-
Updated
May 28, 2024
Case study using dotfurther's Open Discover Platform with the RavenDB document store to rapidly create a full-text search/eDiscovery/information governance capable demonstration application.
RAG-Ingest: A tool for converting PDFs to markdown and indexing them for enhanced Retrieval Augmented Generation (RAG) capabilities.
Self-hosted RAG engine for AI coding assistants. Ingests technical docs & code repositories locally with structure-aware chunking. Serves grounded context via MCP to prevent hallucinations in software development workflows.
A simple RAG toolkit.
Production-grade RAG chatbot with a FastAPI + LangGraph backend (Pinecone vector search + Groq LLM + Tavily web fallback) and a Streamlit chat UI, secured via API key and observable in LangSmith.
Production-grade RAG backend for document ingestion and semantic retrieval using embeddings and Pinecone.
An implementation of the GraphRAG pipeline (based on the 2024 paper "From Local to Global" by Edge et al.) for query-focused summarization of large text corpora.
Self-hosted RAG prototype to ingest PDFs/HTML and chat with them via a local UI
Enterprise Document Ingestion with AI Embeddings — Multi-source ingestion pipeline with pgvector, GDPR compliance, and MCP server
An AI Analytics Dashboard for research labs analytics, collaboration, and email workflow using React and FastAPI.
Store millions of text chunks inside ultra-compact MP4 files, index them with local embeddings, and retrieve answers instantly for fully offline RAG with any LLM.
AI-powered RAG assistant for parents to get instant, context-aware answers on Brainwonders’ career counseling programs, pricing, and services. Built with Streamlit, LangChain, ChromaDB, and Google Gemma LLM for fast, multi-document retrieval and conversational Q&A.
Agentic RAG Chatbot using multi-agent architecture and Streamlit. Ingests PDFs, DOCX, PPTX, CSV, TXT, and Markdown files to provide contextually accurate answers with a persistent knowledge base. Supports multi-turn conversations, source citations, and dynamic document uploads.
Async document watcher that keeps your RAG index hot. Automatically ingests new or changed documents into a live RAG pipeline with built-in observability.
ScriptumAI is an advanced Retrieval-Augmented Generation platform designed for document ingestion and query processing.
📊 Streamline query-focused summarization by constructing knowledge graphs and extracting insights from document corpora with the GraphRAG pipeline.
🗂️ Build a knowledge graph for global query-focused summarization from document corpora using the GraphRAG pipeline, enhancing information synthesis.
Add a description, image, and links to the document-ingestion topic page so that developers can more easily learn about it.
To associate your repository with the document-ingestion topic, visit your repo's landing page and select "manage topics."