Research-grade RAG benchmarking platform with hybrid retrieval, sentence-level grounding, hallucination analysis, and quantitative model comparison.
python render openai custom-metrics faiss streamlit youtube-transcript-api yt-dlp langchain hybrid-retrieval rouge-score bm25-retriever embedding-based-similarity unstructured-url-loader
-
Updated
Feb 9, 2026 - Python