This repo is to summarize relevant pages and repos that are related to RAG and Agent.
- data preparation
- framework
- prompt engineering
- deployment and operations
- learning materials
Components of RAG and Agent
- Retrieval
- Data preparation (parsing, extraction from all kinds of data)
- Generation
- Evaluation
- Deployment
- Monitoring
- Workflow
Enhance LLM with external data
- Data
- text, image, video, audio, etc.
- pdf, docx, txt, json, csv, etc.
- Algorithm, i.e. how to do the retrival and generation
- e.g. graphRAG, generate a knowledge graph to help answer global questions
- Deployment
- LLM (API or local deployment)
- database
- monitoring and evaluation
Autonomous system that uses an underlying large language model (LLM) to perform complex, multi-step tasks to achieve a specific goal.
- Key characteritics
- goal oriented
- reasoning and planning
- tool use
- execution and iteration
- e.g. Deep research, use a series of iterative and systematic steps to autonomously conduct in-depth research on a given topic.
- planning and question generation
- information retrieval
- synthesis and analyze (including evaluation, if the retrieval is not satisfied, the agent may go to step 1 again)
- report generation
- Deployment is similar to RAG