diff --git a/docs/en-US/design/document_upload_design.md b/docs/en-US/design/document_upload_design.md
index 9bf4dc43..5de9cbaf 100644
--- a/docs/en-US/design/document_upload_design.md
+++ b/docs/en-US/design/document_upload_design.md
@@ -1,1077 +1,710 @@
-# ApeRAG Document Upload Architecture Design
+---
+title: Document Upload Design
+description: Complete process and core design of ApeRAG document upload
+keywords: Document Upload, Multi-format Support, Document Parsing, Smart Indexing
+---
 
-## Overview
+# Document Upload Design
 
-This document details the complete architecture design of the document upload module in the ApeRAG project, covering the full pipeline from file upload, temporary storage, document parsing, format conversion to final index construction.
+## 1. What is Document Upload
 
-**Core Design Philosophy**: Adopts a **two-phase commit** pattern, separating file upload (temporary storage) from document confirmation (formal addition), providing better user experience and resource management capabilities.
+Document upload is the entry point of ApeRAG, allowing you to add various formats of documents to your knowledge base. The system automatically processes, indexes, and makes this knowledge searchable and conversational.
 
-## System Architecture
+### 1.1 What Can You Upload
 
-### Overall Architecture
+ApeRAG supports 20+ document formats, covering virtually all file types used in daily work:
 
+```mermaid
+flowchart LR
+    subgraph Input[📁 Your Documents]
+        A1[PDF Reports]
+        A2[Word Docs]
+        A3[Excel Sheets]
+        A4[Screenshots]
+        A5[Meeting Recordings]
+        A6[Markdown Notes]
+    end
+    
+    subgraph Process[🔄 ApeRAG Auto Processing]
+        B[Recognize Format<br/>Extract Content<br/>Build Indexes]
+    end
+    
+    subgraph Output[✨ Searchable Knowledge]
+        C[Answer Questions<br/>Find Information<br/>Analyze Relationships]
+    end
+    
+    A1 --> B
+    A2 --> B
+    A3 --> B
+    A4 --> B
+    A5 --> B
+    A6 --> B
+    
+    B --> C
+    
+    style Input fill:#e3f2fd
+    style Process fill:#fff59d
+    style Output fill:#c8e6c9
 ```
-┌─────────────────────────────────────────────────────────────┐
-│                        Frontend                             │
-│                       (Next.js)                             │
-└────────┬───────────────────────────────────┬────────────────┘
-         │                                   │
-         │ Step 1: Upload                    │ Step 2: Confirm
-         │ POST /documents/upload            │ POST /documents/confirm
-         ▼                                   ▼
-┌─────────────────────────────────────────────────────────────┐
-│  View Layer: aperag/views/collections.py                    │
-│  - HTTP request handling                                    │
-│  - JWT authentication                                       │
-│  - Parameter validation                                     │
-└────────┬───────────────────────────────────┬────────────────┘
-         │                                   │
-         │ document_service.upload_document() │ document_service.confirm_documents()
-         ▼                                   ▼
-┌─────────────────────────────────────────────────────────────┐
-│  Service Layer: aperag/service/document_service.py          │
-│  - Business logic orchestration                             │
-│  - File validation (type, size)                             │
-│  - SHA-256 hash deduplication                               │
-│  - Quota checking                                           │
-│  - Transaction management                                   │
-└────────┬───────────────────────────────────┬────────────────┘
-         │                                   │
-         │ Step 1                            │ Step 2
-         ▼                                   ▼
-┌────────────────────────┐     ┌────────────────────────────┐
-│  1. Create Document    │     │  1. Update Document status │
-│     status=UPLOADED    │     │     UPLOADED → PENDING     │
-│  2. Save to ObjectStore│     │  2. Create DocumentIndex   │
-│  3. Calculate hash     │     │  3. Trigger indexing tasks │
-└────────┬───────────────┘     └────────┬───────────────────┘
-         │                              │
-         ▼                              ▼
-┌─────────────────────────────────────────────────────────────┐
-│                    Storage Layer                            │
-│                                                             │
-│  ┌───────────────┐  ┌──────────────────┐  ┌─────────────┐ │
-│  │  PostgreSQL   │  │  Object Store    │  │  Vector DB  │ │
-│  │               │  │                  │  │             │ │
-│  │ - document    │  │ - Local/S3       │  │ - Qdrant    │ │
-│  │ - document_   │  │ - Original files │  │ - Vectors   │ │
-│  │   index       │  │ - Converted files│  │             │ │
-│  └───────────────┘  └──────────────────┘  └─────────────┘ │
-│                                                             │
-│  ┌───────────────┐  ┌──────────────────┐                  │
-│  │ Elasticsearch │  │   Neo4j/PG       │                  │
-│  │               │  │                  │                  │
-│  │ - Full-text   │  │ - Knowledge Graph│                  │
-│  └───────────────┘  └──────────────────┘                  │
-└─────────────────────────────────────────────────────────────┘
-                         │
-                         ▼
-               ┌───────────────────┐
-               │  Celery Workers   │
-               │                   │
-               │  - Doc parsing    │
-               │  - Format convert │
-               │  - Content extract│
-               │  - Doc chunking   │
-               │  - Index building │
-               └───────────────────┘
+
+**Document Types**:
+
+| Category | Formats | Typical Use |
+|----------|---------|-------------|
+| **Office Docs** | PDF, Word, PPT, Excel | Annual reports, meeting minutes, data sheets |
+| **Text Files** | TXT, MD, HTML, JSON | Technical docs, notes, config files |
+| **Images** | PNG, JPG, GIF | Product screenshots, designs, charts |
+| **Audio** | MP3, WAV, M4A | Meeting recordings, interviews |
+| **Archives** | ZIP, TAR, GZ | Batch document packages |
+
+### 1.2 What Happens After Upload
+
+```mermaid
+flowchart TB
+    A[You upload a PDF] --> B{System Auto Recognizes}
+    
+    B --> C[Extract text content]
+    B --> D[Identify table structure]
+    B --> E[Extract images]
+    B --> F[Recognize formulas]
+    
+    C --> G[Build indexes]
+    D --> G
+    E --> G
+    F --> G
+    
+    G --> H1[Vector Index<br/>Semantic search]
+    G --> H2[Full-text Index<br/>Keyword search]
+    G --> H3[Graph Index<br/>Relationship query]
+    
+    H1 --> I[Done! Can retrieve]
+    H2 --> I
+    H3 --> I
+    
+    style A fill:#e1f5ff
+    style B fill:#fff59d
+    style G fill:#ffe0b2
+    style I fill:#c8e6c9
 ```
 
-### Layered Architecture
+**Simply put**: You just upload files, the system automatically handles everything!
+
+## 2. Practical Applications
+
+See how document upload works in real scenarios.
+
+### 2.1 Enterprise Knowledge Base
+
+**Scenario**: Company building internal knowledge base.
+
+**Upload Content**:
+- 📋 Policy documents: Employee handbook, attendance policies, reimbursement procedures
+- 📊 Business materials: Product introductions, sales data, financial reports
+- 🔧 Technical docs: System architecture, API documentation, deployment guides
+- 📁 Project materials: Project proposals, meeting records, retrospectives
+
+**Results**:
 
 ```
-┌─────────────────────────────────────────────┐
-│  View Layer (views/collections.py)         │  HTTP handling, auth, validation
-└─────────────────┬───────────────────────────┘
-                  │ calls
-┌─────────────────▼───────────────────────────┐
-│  Service Layer (service/document_service.py)│  Business logic, transaction, permission
-└─────────────────┬───────────────────────────┘
-                  │ calls
-┌─────────────────▼───────────────────────────┐
-│  Repository Layer (db/ops.py, objectstore/) │  Data access abstraction
-└─────────────────┬───────────────────────────┘
-                  │ accesses
-┌─────────────────▼───────────────────────────┐
-│  Storage Layer (PG, S3, Qdrant, ES, Neo4j) │  Data persistence
-└─────────────────────────────────────────────┘
+Employee asks: "What's the business trip reimbursement process?"
+System: Finds reimbursement process section from "Finance Policy.pdf"
+
+New hire asks: "What products does the company have?"
+System: Extracts product list from "Product Manual.pptx"
+
+Developer: "How to call this API?"
+System: Finds calling example from "API Docs.md"
 ```
 
-## Core Process Details
+### 2.2 Research Material Organization
 
-### Phase 0: API Interface Definition
+**Scenario**: Graduate student organizing papers and study materials.
 
-The system provides three main interfaces:
+**Upload Content**:
+- 📖 Academic papers (PDF)
+- 📝 Reading notes (Markdown)
+- 🎓 Course slides (PPT)
+- 📊 Experiment data (Excel)
 
-1. **Upload File** (Two-phase mode - Step 1)
-   - Endpoint: `POST /api/v1/collections/{collection_id}/documents/upload`
-   - Function: Upload file to temporary storage, status `UPLOADED`
-   - Returns: `document_id`, `filename`, `size`, `status`
+**Results**:
 
-2. **Confirm Documents** (Two-phase mode - Step 2)
-   - Endpoint: `POST /api/v1/collections/{collection_id}/documents/confirm`
-   - Function: Confirm uploaded documents, trigger index building
-   - Parameters: `document_ids` array
-   - Returns: `confirmed_count`, `failed_count`, `failed_documents`
+```
+Q: "What research exists on Graph RAG?"
+A: Finds relevant content from multiple papers
 
-3. **One-step Upload** (Legacy mode, backward compatible)
-   - Endpoint: `POST /api/v1/collections/{collection_id}/documents`
-   - Function: Upload and directly add to knowledge base, status directly to `PENDING`
-   - Supports batch upload
+Q: "What are an author's main contributions?"
+A: Analyzes papers, summarizes research directions
+```
+
+### 2.3 Personal Knowledge Management
 
-### Phase 1: File Upload and Temporary Storage
+**Scenario**: Developer accumulating technical notes.
 
-#### 1.1 Upload Flow
+**Upload Content**:
+- 💻 Study notes (Markdown)
+- 📸 Technical screenshots (PNG)
+- 🎬 Tutorial audio
+- 📚 Technical books (PDF)
+
+**Results**:
 
 ```
-User selects files
-    │
-    ▼
-Frontend calls upload API
-    │
-    ▼
-View layer validates identity and params
-    │
-    ▼
-Service layer processes business logic:
-    │
-    ├─► Verify collection exists and active
-    │
-    ├─► Validate file type and size
-    │
-    ├─► Read file content
-    │
-    ├─► Calculate SHA-256 hash
-    │
-    └─► Transaction processing:
-        │
-        ├─► Duplicate detection (by filename + hash)
-        │   ├─ Exact match: Return existing doc (idempotent)
-        │   ├─ Same name, different content: Throw conflict error
-        │   └─ New document: Continue creation
-        │
-        ├─► Create Document record (status=UPLOADED)
-        │
-        ├─► Upload to object store
-        │   └─ Path: user-{user_id}/{collection_id}/{document_id}/original{suffix}
-        │
-        └─► Update document metadata (object_path)
+Q: "How did I solve Redis connection issues before?"
+A: Finds solution from "Redis Troubleshooting.md"
+
+Q: "What are best practices for this tech?"
+A: Summarizes best practices from multiple documents
 ```
 
-#### 1.2 File Validation
+### 2.4 Multimodal Content Processing
 
-**Supported File Types**:
-- Documents: `.pdf`, `.doc`, `.docx`, `.ppt`, `.pptx`, `.xls`, `.xlsx`
-- Text: `.txt`, `.md`, `.html`, `.json`, `.xml`, `.yaml`, `.yml`, `.csv`
-- Images: `.png`, `.jpg`, `.jpeg`, `.gif`, `.bmp`, `.tiff`, `.tif`
-- Audio: `.mp3`, `.wav`, `.m4a`
-- Archives: `.zip`, `.tar`, `.gz`, `.tgz`
+**Scenario**: Product team's design materials.
 
-**Size Limits**:
-- Default: 100 MB (configurable via `MAX_DOCUMENT_SIZE` environment variable)
-- Extracted total size: 5 GB (`MAX_EXTRACTED_SIZE`)
+**Upload Content**:
+- 🎨 UI designs (images)
+- 📋 Product PRDs (Word)
+- 🎤 User interview recordings
+- 📊 Data analysis reports (Excel)
 
-#### 1.3 Duplicate Detection Mechanism
+**System Processing**:
+- Designs → OCR extract text + Vision understand design intent
+- PRD → Extract product requirements and features
+- Recordings → Transcribe to text, extract user feedback
+- Reports → Extract key metrics
 
-Uses **filename + SHA-256 hash** dual detection:
+**Result**: All content integrated, searchable together!
 
-| Scenario | Filename | Hash | System Behavior |
-|----------|----------|------|-----------------|
-| Exact match | Same | Same | Return existing document (idempotent) |
-| Name conflict | Same | Different | Throw `DocumentNameConflictException` |
-| New document | Different | - | Create new document record |
+## 3. Upload Experience
 
-**Advantages**:
-- ✅ Supports idempotent upload: Network retries won't create duplicates
-- ✅ Prevents content conflicts: Same name with different content prompts user
-- ✅ Saves storage space: Same content stored only once
+### 3.1 Batch Upload is Simple
 
-### Phase 2: Temporary Storage Configuration
+Suppose you need to upload 50 company documents:
 
-#### 2.1 Object Storage Types
+**Step 1: Select Files (10 seconds)**
 
-System supports two object storage backends, switchable via environment variables:
+```
+Click "Upload Documents" → Select 50 PDFs → Click "Start Upload"
+```
 
-**1. Local Storage (Local filesystem)**
+**Step 2: Quick Upload (30 seconds)**
 
-Use cases:
-- Development and testing environments
-- Small-scale deployments
-- Single-machine deployments
+```
+Progress: 1/50, 2/50, 3/50... 50/50 ✅
+All files uploaded to staging in seconds, no wait for processing
+```
 
-Configuration:
-```bash
-# Development environment
-OBJECT_STORE_TYPE=local
-OBJECT_STORE_LOCAL_ROOT_DIR=.objects
+**Step 3: Preview and Confirm (1 minute)**
 
-# Docker environment
-OBJECT_STORE_TYPE=local
-OBJECT_STORE_LOCAL_ROOT_DIR=/shared/objects
 ```
+View uploaded file list:
+- ✅ annual_report.pdf (5.2 MB)
+- ✅ product_manual.pdf (3.1 MB)
+- ❌ personal_notes.pdf (shouldn't upload) → Uncheck
+- ✅ technical_docs.pdf (2.8 MB)
+...
 
-Storage path example:
-```
-.objects/
-└── user-google-oauth2-123456/
-    └── col_abc123/
-        └── doc_xyz789/
-            ├── original.pdf              # Original file
-            ├── converted.pdf             # Converted PDF
-            ├── processed_content.md      # Parsed Markdown
-            ├── chunks/                   # Chunked data
-            │   ├── chunk_0.json
-            │   └── chunk_1.json
-            └── images/                   # Extracted images
-                ├── page_0.png
-                └── page_1.png
+Click "Save to Knowledge Base"
 ```
 
-**2. S3 Storage (Compatible with AWS S3/MinIO/OSS, etc.)**
+**Step 4: Background Processing (5-30 minutes)**
 
-Use cases:
-- Production environments
-- Large-scale deployments
-- Distributed deployments
-- High availability and disaster recovery needs
+```
+System auto processes:
+- Parse document content
+- Build multiple indexes
+- You can continue other work, no need to wait
+```
+
+**Step 5: Completion Notification**
 
-Configuration:
-```bash
-OBJECT_STORE_TYPE=s3
-OBJECT_STORE_S3_ENDPOINT=http://127.0.0.1:9000  # MinIO/S3 address
-OBJECT_STORE_S3_REGION=us-east-1                # AWS Region
-OBJECT_STORE_S3_ACCESS_KEY=minioadmin           # Access Key
-OBJECT_STORE_S3_SECRET_KEY=minioadmin           # Secret Key
-OBJECT_STORE_S3_BUCKET=aperag                   # Bucket name
-OBJECT_STORE_S3_PREFIX_PATH=dev/                # Optional path prefix
-OBJECT_STORE_S3_USE_PATH_STYLE=true             # Set to true for MinIO
 ```
+Notification: "49 documents processed, ready for retrieval"
+```
+
+### 3.2 Processing Time Reference
+
+Different sized documents have different processing speeds:
+
+| Document Type | Size | Upload Time | Processing Time | Example |
+|--------------|------|-------------|-----------------|---------|
+| 🏃 Small | < 5 pages | < 1 sec | 1-3 minutes | Notices, emails |
+| 🚶 Medium | 10-50 pages | < 3 sec | 3-10 minutes | Reports, manuals |
+| 🐌 Large | 100+ pages | < 10 sec | 10-30 minutes | Books, paper collections |
 
-#### 2.2 Object Storage Path Rules
+**Key Points**:
+- ✅ Upload always fast (seconds)
+- ⏳ Processing happens in background (non-blocking)
+- 📊 Can view processing progress in real-time
+
+### 3.3 Real-time Progress Tracking
+
+After upload, you can check document status anytime:
 
-**Path Format**:
 ```
-{prefix}/user-{user_id}/{collection_id}/{document_id}/{filename}
+Document List:
+
+📄 annual_report.pdf
+   Status: Processing (60%)
+   ├─ ✅ Document Parsing: Complete
+   ├─ ✅ Vector Index: Complete
+   ├─ 🔄 Full-text Index: In Progress
+   └─ ⏳ Graph Index: Waiting
+
+📄 product_manual.pdf
+   Status: Complete ✅
+   Can retrieve
+
+📄 meeting_notes.pdf
+   Status: Failed ❌
+   Error: File corrupted
+   Action: Re-upload
 ```
 
-**Components**:
-- `prefix`: Optional global prefix (S3 only)
-- `user_id`: User ID (`|` replaced with `-`)
-- `collection_id`: Collection ID
-- `document_id`: Document ID
-- `filename`: Filename (e.g., `original.pdf`, `page_0.png`)
+## 4. Core Features
 
-**Multi-tenancy Isolation**:
-- Each user has an independent namespace
-- Each collection has an independent storage directory
-- Each document has an independent folder
+ApeRAG document upload has unique features making it more convenient.
 
-### Phase 3: Document Confirmation and Index Building
+### 4.1 Staging Area Design
 
-#### 3.1 Confirmation Flow
+**Core Idea**: Upload first, select later - gives you a chance to "regret".
+
+**Like online shopping**:
 
 ```
-User clicks "Save to Collection"
-    │
-    ▼
-Frontend calls confirm API
-    │
-    ▼
-Service layer processes:
-    │
-    ├─► Validate collection configuration
-    │
-    ├─► Check Quota (deduct quota at confirmation stage)
-    │
-    └─► For each document_id:
-        │
-        ├─► Verify document status is UPLOADED
-        │
-        ├─► Update document status: UPLOADED → PENDING
-        │
-        ├─► Create index records based on collection config:
-        │   ├─ VECTOR (Vector index, required)
-        │   ├─ FULLTEXT (Full-text index, required)
-        │   ├─ GRAPH (Knowledge graph, optional)
-        │   ├─ SUMMARY (Document summary, optional)
-        │   └─ VISION (Vision index, optional)
-        │
-        └─► Return confirmation result
-    │
-    ▼
-Trigger Celery task: reconcile_document_indexes
-    │
-    ▼
-Background async index building
+Shopping process:
+1. Add to cart (staging)
+2. Review cart, remove unwanted items
+3. Submit order (confirm)
+
+Document upload:
+1. Upload to staging (quick upload)
+2. Review list, cancel unneeded ones
+3. Save to knowledge base (confirm addition)
 ```
 
-#### 3.2 Quota Management
+**Benefits**:
 
-**Check Timing**:
-- ❌ Not checked during upload phase (temporary storage doesn't consume quota)
-- ✅ Checked during confirmation phase (formal addition consumes quota)
+- ✅ **Fast Upload**: 20 files uploaded in 5 seconds, no wait for processing
+- ✅ **Selective Addition**: Upload 100, save only the 80 needed
+- ✅ **Save Quota**: Staging files don't consume quota
+- ✅ **Easy Correction**: Found error? Cancel directly, no need to delete
 
-**Quota Types**:
+### 4.2 Smart Processing
 
-1. **User Global Quota**
-   - `max_document_count`: Total document count limit per user
-   - Default: 1000 (configurable via `MAX_DOCUMENT_COUNT`)
+**Auto Format Recognition**:
 
-2. **Per-Collection Quota**
-   - `max_document_count_per_collection`: Document count limit per collection
-   - Excludes `UPLOADED` and `DELETED` status documents
+System auto recognizes file type and selects appropriate processing:
 
-**Quota Exceeded Handling**:
-- Throws `QuotaExceededException`
-- Returns HTTP 400 error
-- Includes current usage and quota limit information
+- 📄 PDF → Extract text, tables, images, formulas
+- 📋 Word → Convert format, extract content
+- 📊 Excel → Recognize table structure
+- 🎨 Images → OCR text + understand content
+- 🎤 Audio → Transcribe to text
 
-### Phase 4: Document Parsing and Format Conversion
+**No extra operations needed**, system handles automatically!
 
-#### 4.1 Parser Architecture
+### 4.3 Background Processing
 
-System uses a **multi-parser chain invocation** architecture, where each parser handles specific file types:
+After upload, system auto processes in background:
 
-```
-DocParser (Main Controller)
-    │
-    ├─► MinerUParser
-    │   └─ Function: High-precision PDF parsing (commercial API)
-    │   └─ Supports: .pdf
-    │
-    ├─► DocRayParser
-    │   └─ Function: Document layout analysis and content extraction
-    │   └─ Supports: .pdf, .docx, .pptx, .xlsx
-    │
-    ├─► ImageParser
-    │   └─ Function: Image content recognition (OCR + vision understanding)
-    │   └─ Supports: .jpg, .png, .gif, .bmp, .tiff
-    │
-    ├─► AudioParser
-    │   └─ Function: Audio transcription (Speech-to-Text)
-    │   └─ Supports: .mp3, .wav, .m4a
-    │
-    └─► MarkItDownParser (Fallback)
-        └─ Function: Universal document to Markdown conversion
-        └─ Supports: Almost all common formats
+```mermaid
+sequenceDiagram
+    participant U as You
+    participant S as System
+    
+    U->>S: Upload file
+    S-->>U: Second-level return ✅
+    Note over U: Continue work, no wait
+    
+    S->>S: Parse document...
+    S->>S: Build indexes...
+    S-->>U: Processing complete notification 🔔
 ```
 
-#### 4.2 Parser Configuration
+**Advantages**:
+- No wait, upload then do other things
+- System auto retries failed documents
+- Real-time view processing progress
 
-**Configuration Method**: Dynamically controlled via Collection Config
+### 4.4 Auto Cleanup
 
-```json
-{
-  "parser_config": {
-    "use_mineru": false,           // Enable MinerU (requires API Token)
-    "use_doc_ray": false,          // Enable DocRay
-    "use_markitdown": true,        // Enable MarkItDown (default)
-    "mineru_api_token": "xxx"      // MinerU API Token (optional)
-  }
-}
-```
+Staging area files not confirmed in 7 days are auto cleaned, preventing storage waste.
 
-**Environment Variable Configuration**:
-```bash
-USE_MINERU_API=false              # Globally enable MinerU
-MINERU_API_TOKEN=your_token       # MinerU API Token
+## 5. Document Parsing Principles
+
+After upload, system needs to "understand" the document. Different formats have different processing methods.
+
+### 5.1 Parser Workflow
+
+System has multiple parsers, auto selects most suitable:
+
+```mermaid
+flowchart TD
+    File[Upload PDF] --> Try1{Try MinerU}
+    Try1 -->|Success| Result[Parsing Complete]
+    Try1 -->|Fail/Not Configured| Try2{Try DocRay}
+    Try2 -->|Success| Result
+    Try2 -->|Fail/Not Configured| Try3[Use MarkItDown]
+    Try3 --> Result
+    
+    style File fill:#e1f5ff
+    style Result fill:#c5e1a5
+    style Try1 fill:#fff3e0
+    style Try2 fill:#fff3e0
+    style Try3 fill:#c5e1a5
 ```
 
-#### 4.3 Parsing Flow
+**Parser Priority**:
+
+1. **MinerU**: Most powerful, commercial API, paid
+   - Good at: Complex PDFs, academic papers, documents with formulas
+   
+2. **DocRay**: Open source, free, strong layout analysis
+   - Good at: Tables, charts, multi-column layouts
+   
+3. **MarkItDown**: Generic, fallback, supports all formats
+   - Good at: Simple documents, text files
+
+**Auto degradation benefits**:
+- Try best parser first
+- Auto switch to next if fails
+- Always one succeeds
+
+### 5.2 Specific Examples
+
+**Example 1: Complex PDF**
 
 ```
-Celery Worker receives indexing task
-    │
-    ▼
-1. Download original file from object store
-    │
-    ▼
-2. Select Parser based on file extension
-    │
-    ├─► Try first matching Parser
-    │   ├─ Success: Return parsing result
-    │   └─ Failure: FallbackError → Try next Parser
-    │
-    └─► Final fallback: MarkItDownParser
-    │
-    ▼
-3. Parsing result (Parts):
-    │
-    ├─► MarkdownPart: Text content
-    │   └─ Contains: headings, paragraphs, lists, tables, etc.
-    │
-    ├─► PdfPart: PDF file
-    │   └─ For: linearization, page rendering
-    │
-    └─► AssetBinPart: Binary resources
-        └─ Contains: images, embedded files, etc.
-    │
-    ▼
-4. Post-processing:
-    │
-    ├─► PDF pages to images (required for Vision index)
-    │   └─ Each page rendered as PNG image
-    │   └─ Saved to {document_path}/images/page_N.png
-    │
-    ├─► PDF linearization (speed up browser loading)
-    │   └─ Use pikepdf to optimize PDF structure
-    │   └─ Saved to {document_path}/converted.pdf
-    │
-    └─► Extract text content (plain text)
-        └─ Merge all MarkdownPart content
-        └─ Saved to {document_path}/processed_content.md
-    │
-    ▼
-5. Save to object store
+Upload: annual_report.pdf (50 pages, with tables and charts)
+    ↓
+DocRay parser auto:
+- 📝 Extract all text content
+- 📊 Recognize tables, maintain structure
+- 🎨 Extract images and charts
+- 📐 Recognize LaTeX formulas
+    ↓
+Get:
+- Complete Markdown document
+- 50 page screenshots (if vision index needed)
 ```
 
-#### 4.4 Format Conversion Examples
+**Example 2: Image Screenshot**
 
-**Example 1: PDF Document**
 ```
-Input: user_manual.pdf (5 MB)
-    │
-    ▼
-Parser selection: DocRayParser / MarkItDownParser
-    │
-    ▼
-Output Parts:
-    ├─ MarkdownPart: "# User Manual\n\n## Chapter 1\n..."
-    └─ PdfPart: <original PDF data>
-    │
-    ▼
-Post-processing:
-    ├─ Render 50 pages to images → images/page_0.png ~ page_49.png
-    ├─ Linearize PDF → converted.pdf
-    └─ Extract text → processed_content.md
+Upload: product_screenshot.png
+    ↓
+ImageParser auto:
+- 📸 OCR recognize text in image
+- 👁️ Vision AI understand image content
+    ↓
+Get:
+- Text: "Product name: ApeRAG, Version: 2.0..."
+- Description: "This is a product intro page with name, version, and feature list"
 ```
 
-**Example 2: Image File**
+**Example 3: Meeting Recording**
+
 ```
-Input: screenshot.png (2 MB)
-    │
-    ▼
-Parser selection: ImageParser
-    │
-    ▼
-Output Parts:
-    ├─ MarkdownPart: "[OCR extracted text]"
-    └─ AssetBinPart: <original image data> (vision_index=true)
-    │
-    ▼
-Post-processing:
-    └─ Save original image copy → images/file.png
+Upload: meeting.mp3 (30 minutes)
+    ↓
+AudioParser auto:
+- 🎤 Speech-to-text (STT)
+- 📝 Generate meeting transcript
+    ↓
+Get:
+- "Meeting starts. Host John: Hello everyone, today we discuss product planning..."
+- Complete meeting text transcript
 ```
 
-**Example 3: Audio File**
+### 5.3 Duplicate File Handling
+
+System auto detects duplicate uploads:
+
 ```
-Input: meeting_record.mp3 (50 MB)
-    │
-    ▼
-Parser selection: AudioParser
-    │
-    ▼
-Output Parts:
-    └─ MarkdownPart: "[Transcribed meeting content]"
-    │
-    ▼
-Post-processing:
-    └─ Save transcription text → processed_content.md
+First upload report.pdf → Create new document ✅
+Second upload report.pdf (same content) → Return existing document ✅
+Third upload report.pdf (different content) → Conflict warning, need rename ⚠️
 ```
 
-### Phase 5: Index Building
+**Advantages**:
+- Avoid duplicate documents
+- Network retries don't create multiple documents
+- Save storage space
 
-#### 5.1 Index Types and Functions
+## 6. Index Building
 
-| Index Type | Required | Function Description | Storage Location |
-|-----------|----------|---------------------|------------------|
-| **VECTOR** | ✅ Required | Vector retrieval, semantic search | Qdrant / Elasticsearch |
-| **FULLTEXT** | ✅ Required | Full-text search, keyword search | Elasticsearch |
-| **GRAPH** | ❌ Optional | Knowledge graph, entity & relation extraction | Neo4j / PostgreSQL |
-| **SUMMARY** | ❌ Optional | Document summary, LLM generated | PostgreSQL (index_data) |
-| **VISION** | ❌ Optional | Vision understanding, image content analysis | Qdrant (vectors) + PG (metadata) |
+After document parsing, system auto builds multiple indexes for different retrieval methods.
 
-#### 5.2 Index Building Flow
+### 6.1 Why Multiple Indexes Needed
+
+Different questions need different retrieval methods:
 
 ```
-Celery Worker: reconcile_document_indexes task
-    │
-    ▼
-1. Scan DocumentIndex table, find indexes needing processing
-    │
-    ├─► PENDING status + observed_version < version
-    │   └─ Need to create or update index
-    │
-    └─► DELETING status
-        └─ Need to delete index
-    │
-    ▼
-2. Group by document, process one by one
-    │
-    ▼
-3. For each document:
-    │
-    ├─► parse_document (parse document)
-    │   ├─ Download original file from object store
-    │   ├─ Call DocParser to parse
-    │   └─ Return ParsedDocumentData
-    │
-    └─► For each index type:
-        │
-        ├─► create_index (create/update index)
-        │   │
-        │   ├─ VECTOR index:
-        │   │   ├─ Document chunking
-        │   │   ├─ Generate vectors using Embedding model
-        │   │   └─ Write to Qdrant
-        │   │
-        │   ├─ FULLTEXT index:
-        │   │   ├─ Extract plain text content
-        │   │   ├─ Chunk by paragraph/section
-        │   │   └─ Write to Elasticsearch
-        │   │
-        │   ├─ GRAPH index:
-        │   │   ├─ Extract entities using LightRAG
-        │   │   ├─ Extract entity relationships
-        │   │   └─ Write to Neo4j/PostgreSQL
-        │   │
-        │   ├─ SUMMARY index:
-        │   │   ├─ Generate summary using LLM
-        │   │   └─ Save to DocumentIndex.index_data
-        │   │
-        │   └─ VISION index:
-        │       ├─ Extract image Assets
-        │       ├─ Understand image content using Vision LLM
-        │       ├─ Generate image description vectors
-        │       └─ Write to Qdrant
-        │
-        └─► Update index status
-            ├─ Success: CREATING → ACTIVE
-            └─ Failure: CREATING → FAILED
-    │
-    ▼
-4. Update document overall status
-    │
-    ├─ All indexes ACTIVE → Document.status = COMPLETE
-    ├─ Any index FAILED → Document.status = FAILED
-    └─ Some indexes still processing → Document.status = RUNNING
-```
+Q: "How to optimize database performance?"
+→ Need: Vector index (semantic similarity search)
 
-#### 5.3 Document Chunking
+Q: "Where is PostgreSQL config file?"
+→ Need: Full-text index (exact keyword search)
 
-**Chunking Strategy**:
-- Recursive character splitting (RecursiveCharacterTextSplitter)
-- Prioritize splitting by natural paragraphs and sections
-- Maintain context overlap
+Q: "What's the relationship between John and Mike?"
+→ Need: Graph index (relationship query)
 
-**Chunking Parameters**:
-```json
-{
-  "chunk_size": 1000,           // Max characters per chunk
-  "chunk_overlap": 200,         // Overlap characters
-  "separators": ["\n\n", "\n", " ", ""]  // Separator priority
-}
-```
+Q: "What's this document mainly about?"
+→ Need: Summary index (quick overview)
 
-**Chunking Result Storage**:
-```
-{document_path}/chunks/
-    ├─ chunk_0.json: {"text": "...", "metadata": {...}}
-    ├─ chunk_1.json: {"text": "...", "metadata": {...}}
-    └─ ...
+Q: "What's in this image?"
+→ Need: Vision index (image content search)
 ```
 
-## Database Design
-
-### Table 1: document (Document Metadata)
-
-**Table Structure**:
-
-| Field | Type | Description | Index |
-|-------|------|-------------|-------|
-| `id` | String(24) | Document ID, primary key, format: `doc{random_id}` | PK |
-| `name` | String(1024) | Filename | - |
-| `user` | String(256) | User ID (supports multiple IDPs) | ✅ Index |
-| `collection_id` | String(24) | Collection ID | ✅ Index |
-| `status` | Enum | Document status (see table below) | ✅ Index |
-| `size` | BigInteger | File size (bytes) | - |
-| `content_hash` | String(64) | SHA-256 hash (for deduplication) | ✅ Index |
-| `object_path` | Text | Object store path (deprecated, use doc_metadata) | - |
-| `doc_metadata` | Text | Document metadata (JSON string) | - |
-| `gmt_created` | DateTime(tz) | Creation time (UTC) | - |
-| `gmt_updated` | DateTime(tz) | Update time (UTC) | - |
-| `gmt_deleted` | DateTime(tz) | Deletion time (soft delete) | ✅ Index |
-
-**Unique Constraint**:
-```sql
-UNIQUE INDEX uq_document_collection_name_active
-  ON document (collection_id, name)
-  WHERE gmt_deleted IS NULL;
-```
-- Within the same collection, active document names cannot be duplicated
-- Deleted documents are excluded from uniqueness check
-
-**Document Status Enum** (`DocumentStatus`):
-
-| Status | Description | When Set | Visibility |
-|--------|-------------|----------|------------|
-| `UPLOADED` | Uploaded to temporary storage | `upload_document` API | Frontend file selection UI |
-| `PENDING` | Waiting for index building | `confirm_documents` API | Document list (processing) |
-| `RUNNING` | Index building in progress | Celery task starts processing | Document list (processing) |
-| `COMPLETE` | All indexes completed | All indexes become ACTIVE | Document list (available) |
-| `FAILED` | Index building failed | Any index fails | Document list (failed) |
-| `DELETED` | Deleted | `delete_document` API | Not visible (soft delete) |
-| `EXPIRED` | Temporary document expired | Scheduled cleanup task | Not visible |
-
-**Document Metadata Example** (`doc_metadata` JSON field):
-```json
-{
-  "object_path": "user-xxx/col_xxx/doc_xxx/original.pdf",
-  "converted_path": "user-xxx/col_xxx/doc_xxx/converted.pdf",
-  "processed_content_path": "user-xxx/col_xxx/doc_xxx/processed_content.md",
-  "images": [
-    "user-xxx/col_xxx/doc_xxx/images/page_0.png",
-    "user-xxx/col_xxx/doc_xxx/images/page_1.png"
-  ],
-  "parser_used": "DocRayParser",
-  "parse_duration_ms": 5420,
-  "page_count": 50,
-  "custom_field": "value"
-}
-```
+### 6.2 Five Index Types
 
-### Table 2: document_index (Index Status Management)
-
-**Table Structure**:
-
-| Field | Type | Description | Index |
-|-------|------|-------------|-------|
-| `id` | Integer | Auto-increment ID, primary key | PK |
-| `document_id` | String(24) | Related document ID | ✅ Index |
-| `index_type` | Enum | Index type (see table below) | ✅ Index |
-| `status` | Enum | Index status (see table below) | ✅ Index |
-| `version` | Integer | Index version number | - |
-| `observed_version` | Integer | Processed version number | - |
-| `index_data` | Text | Index data (JSON), e.g., summary content | - |
-| `error_message` | Text | Error message (on failure) | - |
-| `gmt_created` | DateTime(tz) | Creation time | - |
-| `gmt_updated` | DateTime(tz) | Update time | - |
-| `gmt_last_reconciled` | DateTime(tz) | Last reconciliation time | - |
-
-**Unique Constraint**:
-```sql
-UNIQUE CONSTRAINT uq_document_index
-  ON document_index (document_id, index_type);
-```
-- Each document has only one record per index type
-
-**Index Type Enum** (`DocumentIndexType`):
-
-| Type | Value | Description | External Storage |
-|------|-------|-------------|------------------|
-| `VECTOR` | "VECTOR" | Vector index | Qdrant / Elasticsearch |
-| `FULLTEXT` | "FULLTEXT" | Full-text index | Elasticsearch |
-| `GRAPH` | "GRAPH" | Knowledge graph | Neo4j / PostgreSQL |
-| `SUMMARY` | "SUMMARY" | Document summary | PostgreSQL (index_data) |
-| `VISION` | "VISION" | Vision index | Qdrant + PostgreSQL |
-
-**Index Status Enum** (`DocumentIndexStatus`):
-
-| Status | Description | When Set |
-|--------|-------------|----------|
-| `PENDING` | Waiting for processing | `confirm_documents` creates index record |
-| `CREATING` | Creating | Celery Worker starts processing |
-| `ACTIVE` | Ready for use | Index building successful |
-| `DELETING` | Marked for deletion | `delete_document` API |
-| `DELETION_IN_PROGRESS` | Deleting | Celery Worker is deleting |
-| `FAILED` | Failed | Index building failed |
-
-**Version Control Mechanism**:
-- `version`: Expected index version (incremented on document update)
-- `observed_version`: Processed version number
-- When `version > observed_version`, triggers index update
-
-**Reconciler**:
-```python
-# Query indexes needing processing
-SELECT * FROM document_index
-WHERE status = 'PENDING'
-  AND observed_version < version;
-
-# Update after processing
-UPDATE document_index
-SET status = 'ACTIVE',
-    observed_version = version,
-    gmt_last_reconciled = NOW()
-WHERE id = ?;
+```mermaid
+flowchart TB
+    Doc[Your Document] --> Auto[System Auto Builds]
+    
+    Auto --> V[Vector Index<br/>Find Similar Content]
+    Auto --> F[Full-text Index<br/>Find Keywords]
+    Auto --> G[Graph Index<br/>Find Relationships]
+    Auto --> S[Summary Index<br/>Quick Overview]
+    Auto --> I[Vision Index<br/>Find Images]
+    
+    V --> Q1[Q: How to optimize performance?]
+    F --> Q2[Q: Config file path?]
+    G --> Q3[Q: A and B's relationship?]
+    S --> Q4[Q: What's doc about?]
+    I --> Q5[Q: What's in image?]
+    
+    style Doc fill:#e1f5ff
+    style Auto fill:#fff59d
+    style V fill:#bbdefb
+    style F fill:#c5e1a5
+    style G fill:#ffccbc
+    style S fill:#e1bee7
+    style I fill:#fff9c4
 ```
 
-### Table Relationship Diagram
+**Index Comparison**:
 
-```
-┌─────────────────────────────────┐
-│         collection              │
-│  ─────────────────────────────  │
-│  id (PK)                        │
-│  name                           │
-│  config (JSON)                  │
-│  status                         │
-│  ...                            │
-└────────────┬────────────────────┘
-             │ 1:N
-             ▼
-┌─────────────────────────────────┐
-│          document               │
-│  ─────────────────────────────  │
-│  id (PK)                        │
-│  collection_id (FK)             │◄──── Unique constraint: (collection_id, name)
-│  name                           │
-│  user                           │
-│  status (Enum)                  │
-│  size                           │
-│  content_hash (SHA-256)         │
-│  doc_metadata (JSON)            │
-│  gmt_created                    │
-│  gmt_deleted                    │
-│  ...                            │
-└────────────┬────────────────────┘
-             │ 1:N
-             ▼
-┌─────────────────────────────────┐
-│       document_index            │
-│  ─────────────────────────────  │
-│  id (PK)                        │
-│  document_id (FK)               │◄──── Unique constraint: (document_id, index_type)
-│  index_type (Enum)              │
-│  status (Enum)                  │
-│  version                        │
-│  observed_version               │
-│  index_data (JSON)              │
-│  error_message                  │
-│  gmt_last_reconciled            │
-│  ...                            │
-└─────────────────────────────────┘
-```
+| Index | Required | Suitable Questions | Speed |
+|-------|----------|-------------------|-------|
+| Vector | ✅ | Semantic similarity | Fast |
+| Full-text | ✅ | Exact keywords | Fast |
+| Graph | ❌ | Relationship queries | Slow |
+| Summary | ❌ | Quick overview | Medium |
+| Vision | ❌ | Image content | Medium |
 
-## State Machine and Lifecycle
+**Recommended Config**:
 
-### Document State Transitions
+- 💰 Save cost: Only enable vector + full-text
+- ⚡ Prioritize speed: Disable graph (slowest)
+- 🎯 Full features: Enable all
+
+### 6.3 Parallel Building
+
+Multiple indexes can build simultaneously, saving time:
 
 ```
-         ┌─────────────────────────────────────────────┐
-         │                                             │
-         │                                             ▼
-    [Upload] ──► UPLOADED ──► [Confirm] ──► PENDING ──► RUNNING ──► COMPLETE
-                     │                                   │
-                     │                                   ▼
-                     │                                FAILED
-                     │                                   │
-                     │                                   ▼
-                     └──────► [Delete] ──────────────► DELETED
-                                                         │
-                     ┌───────────────────────────────────┘
-                     │
-                     ▼
-                  EXPIRED (Scheduled cleanup of unconfirmed docs)
+Document parsing complete
+    ↓
+5 indexes start building simultaneously:
+- Vector index: 1 minute
+- Full-text index: 30 seconds
+- Graph index: 10 minutes ⏱️ (slowest)
+- Summary index: 3 minutes
+- Vision index: 2 minutes
+    ↓
+Total time: 10 minutes (the slowest one)
+If serial: 16.5 minutes
+
+Saved: 40% time!
 ```
 
-**Key Transitions**:
-1. **UPLOADED → PENDING**: User clicks "Save to Collection"
-2. **PENDING → RUNNING**: Celery Worker starts processing
-3. **RUNNING → COMPLETE**: All indexes successful
-4. **RUNNING → FAILED**: Any index fails
-5. **Any status → DELETED**: User deletes document
+### 6.4 Auto Retry
 
-### Index State Transitions
+If an index build fails, system auto retries:
 
 ```
-  [Create index record] ──► PENDING ──► CREATING ──► ACTIVE
-                                           │
-                                           ▼
-                                        FAILED
-                                           │
-                                           ▼
-                             ┌──────────► PENDING (retry)
-                             │
-    [Delete request] ────────┼──────────► DELETING ──► DELETION_IN_PROGRESS ──► (record deleted)
-                             │
-                             └──────────► (directly delete record, if PENDING/FAILED)
+1st retry: After 1 minute
+2nd retry: After 5 minutes
+3rd retry: After 15 minutes
+Still fails → Mark as failed, notify user
 ```
 
-## Async Task Scheduling (Celery)
-
-### Task Definitions
+Most temporary errors (network issues, service restarts) auto recover!
 
-**Main Task**: `reconcile_document_indexes`
-- Trigger timing:
-  - After `confirm_documents` API call
-  - Scheduled task (every 30 seconds)
-  - Manual trigger (admin interface)
-- Function: Scan `document_index` table, process indexes needing reconciliation
+## 7. Technical Implementation
 
-**Sub-tasks**:
-- `parse_document_task`: Parse document content
-- `create_vector_index_task`: Create vector index
-- `create_fulltext_index_task`: Create full-text index
-- `create_graph_index_task`: Create knowledge graph index
-- `create_summary_index_task`: Create summary index
-- `create_vision_index_task`: Create vision index
+> 💡 **Reading Tip**: This chapter contains technical details, mainly for developers and ops. General users can skip.
 
-### Task Scheduling Strategy
+### 7.1 Storage Architecture
 
-**Concurrency Control**:
-- Each Worker processes at most N documents simultaneously (default 4)
-- Multiple indexes of each document can be built in parallel
-- Use Celery's `task_acks_late=True` to ensure tasks aren't lost
+**File Storage Location**:
 
-**Failure Retry**:
-- Maximum 3 retries
-- Exponential backoff (1 min → 5 min → 15 min)
-- Marked as `FAILED` after 3 failures
-
-**Idempotency**:
-- All tasks support repeated execution
-- Use `observed_version` mechanism to avoid duplicate processing
-- Same input produces same output
+```
+Local storage (dev):
+.objects/user-xxx/collection-xxx/doc-xxx/
+    ├── original.pdf
+    └── images/page_0.png
 
-## Design Features and Advantages
+Cloud storage (production):
+s3://bucket/user-xxx/collection-xxx/doc-xxx/
+    ├── original.pdf
+    └── images/page_0.png
+```
 
-### 1. Two-Phase Commit Design
+**Configuration**:
 
-**Advantages**:
-- ✅ **Better User Experience**: Fast upload response, doesn't block user operations
-- ✅ **Selective Addition**: Can selectively confirm partial files after batch upload
-- ✅ **Reasonable Resource Control**: Unconfirmed documents don't build indexes, don't consume quota
-- ✅ **Failure Recovery Friendly**: Temporary documents can be periodically cleaned up without affecting business
+```bash
+# Local storage
+export OBJECT_STORE_TYPE=local
 
-**Status Isolation**:
-```
-Temporary status (UPLOADED):
-  - Not counted in quota
-  - Doesn't trigger indexing
-  - Can be automatically cleaned up
-
-Formal status (PENDING/RUNNING/COMPLETE):
-  - Counted in quota
-  - Triggers index building
-  - Won't be automatically cleaned up
+# Cloud storage (S3/MinIO)
+export OBJECT_STORE_TYPE=s3
+export OBJECT_STORE_S3_BUCKET=aperag
 ```
 
-### 2. Idempotency Design
+### 7.2 Parser Configuration
 
-**File-Level Idempotency**:
-- SHA-256 hash deduplication
-- Same file uploaded multiple times returns same `document_id`
-- Avoids storage space waste
+**Enable Different Parsers**:
 
-**API-Level Idempotency**:
-- `upload_document`: Repeated upload returns existing document
-- `confirm_documents`: Repeated confirmation doesn't create duplicate indexes
-- `delete_document`: Repeated deletion returns success (soft delete)
+```bash
+# DocRay (recommended, free, good performance)
+export USE_DOC_RAY=true
+export DOCRAY_HOST=http://docray:8639
 
-### 3. Multi-Tenancy Isolation
+# MinerU (optional, paid, highest precision)
+export USE_MINERU_API=false
+export MINERU_API_TOKEN=your_token
 
-**Storage Isolation**:
-```
-user-{user_A}/...  # User A's files
-user-{user_B}/...  # User B's files
+# MarkItDown (default enabled, fallback)
+export USE_MARKITDOWN=true
 ```
 
-**Database Isolation**:
-- All queries filter by `user` field
-- Collection-level permission control (`collection.user`)
-- Soft delete support (`gmt_deleted`)
+**Selection Recommendations**:
+- 💰 Free solution: DocRay + MarkItDown
+- 🎯 High precision: MinerU + DocRay + MarkItDown
 
-### 4. Flexible Storage Backend
+### 7.3 Index Configuration
 
-**Unified Interface**:
-```python
-AsyncObjectStore:
-  - put(path, data)
-  - get(path)
-  - delete_objects_by_prefix(prefix)
+Control which indexes to enable in Collection config:
+
+```json
+{
+  "enable_vector": true,          // Vector index (required)
+  "enable_fulltext": true,        // Full-text index (required)
+  "enable_knowledge_graph": true, // Graph index (optional)
+  "enable_summary": false,        // Summary index (optional)
+  "enable_vision": false          // Vision index (optional)
+}
 ```
 
-**Runtime Switching**:
-- Switch between Local/S3 via environment variables
-- No need to modify business code
-- Supports custom storage backends (just implement the interface)
+### 7.4 Performance Tuning
 
-### 5. Transaction Consistency
+**File Size Limits**:
 
-**Two-Phase Commit for Database + Object Store**:
-```python
-async with transaction:
-    # 1. Create database record
-    document = create_document_record()
-    
-    # 2. Upload to object store
-    await object_store.put(path, data)
-    
-    # 3. Update metadata
-    document.doc_metadata = json.dumps(metadata)
-    
-    # All operations succeed to commit, any failure rolls back
+```bash
+export MAX_DOCUMENT_SIZE=104857600  # 100 MB
+export MAX_EXTRACTED_SIZE=5368709120  # 5 GB
 ```
 
-**Failure Handling**:
-- Database record creation fails: Don't upload file
-- File upload fails: Rollback database record
-- Metadata update fails: Rollback previous operations
+**Concurrency Settings**:
+
+```bash
+export CELERY_WORKER_CONCURRENCY=16  # Process 16 docs concurrently
+export CELERY_TASK_TIME_LIMIT=3600   # Single task timeout 1 hour
+```
 
-### 6. Observability
+**Quota Settings**:
 
-**Audit Logging**:
-- `@audit` decorator records all document operations
-- Includes: user, time, operation type, resource ID
+```bash
+export MAX_DOCUMENT_COUNT=1000  # Max 1000 docs per user
+export MAX_DOCUMENT_COUNT_PER_COLLECTION=100  # Max 100 docs per collection
+```
 
-**Task Tracking**:
-- `gmt_last_reconciled`: Last processing time
-- `error_message`: Failure reason
-- Celery task ID: Link log tracing
+## 8. Common Questions
 
-**Monitoring Metrics**:
-- Document upload rate
-- Index building duration
-- Failure rate statistics
+### 8.1 File Upload Failed?
 
-## Performance Optimization
+**Possible Causes and Solutions**:
 
-### 1. Async Processing
+| Issue | Cause | Solution |
+|-------|-------|----------|
+| File too large | Over 100 MB | Compress or split file |
+| Format not supported | Special format | Convert to PDF or other common format |
+| Name conflict | Same name different content exists | Rename file |
+| Quota full | Reached document count limit | Delete old docs or upgrade quota |
 
-**Upload Doesn't Block**:
-- Returns immediately after file upload to object store
-- Index building executes asynchronously in Celery
-- Frontend gets progress via polling or WebSocket
+### 8.2 Document Processing Failed?
 
-### 2. Batch Operations
+System auto retries 3 times, if still fails:
 
-**Batch Confirmation**:
-```python
-confirm_documents(document_ids=[id1, id2, ..., idN])
 ```
-- Process multiple documents in one transaction
-- Batch create index records
-- Reduce database round-trips
-
-### 3. Caching Strategy
-
-**Parsing Result Cache**:
-- Parsed content saved to `processed_content.md`
-- Subsequent index rebuilds can read directly without re-parsing
-
-**Chunking Result Cache**:
-- Chunking results saved to `chunks/` directory
-- Vector index rebuilds can reuse chunking results
-
-### 4. Parallel Index Building
-
-**Multiple Indexes in Parallel**:
-```python
-# VECTOR, FULLTEXT, GRAPH can be built in parallel
-await asyncio.gather(
-    create_vector_index(),
-    create_fulltext_index(),
-    create_graph_index()
-)
+View error message → Fix based on prompt → Re-upload → System auto retries
 ```
 
-## Error Handling
-
-### Common Exceptions
+Common errors:
+- File corrupted → Recreate file
+- Content unrecognizable → Try converting format
+- Temporary network issues → System auto retries
 
-| Exception Type | HTTP Status | Trigger Scenario | Handling Suggestion |
-|---------------|-------------|------------------|---------------------|
-| `ResourceNotFoundException` | 404 | Collection/document doesn't exist | Check if ID is correct |
-| `CollectionInactiveException` | 400 | Collection not active | Wait for collection initialization |
-| `DocumentNameConflictException` | 409 | Same name, different content | Rename file or delete old document |
-| `QuotaExceededException` | 429 | Quota exceeded | Upgrade plan or delete old documents |
-| `InvalidFileTypeException` | 400 | Unsupported file type | Check supported file type list |
-| `FileSizeTooLargeException` | 413 | File too large | Split file or compress |
+### 8.3 How to Speed Up Processing?
 
-### Exception Propagation
+**Method 1**: Disable unneeded indexes
 
-```
-Service Layer throws exception
-    │
-    ▼
-View Layer catches and converts
-    │
-    ▼
-Exception Handler unified handling
-    │
-    ▼
-Return standard JSON response:
+```json
 {
-  "error_code": "QUOTA_EXCEEDED",
-  "message": "Document count limit exceeded",
-  "details": {
-    "limit": 1000,
-    "current": 1000
-  }
+  "enable_knowledge_graph": false  // Graph slowest, can disable
 }
 ```
 
-## Related Files Index
-
-### Core Implementation
+**Method 2**: Use faster LLM models
 
-- **View Layer**: `aperag/views/collections.py` - HTTP interface definition
-- **Service Layer**: `aperag/service/document_service.py` - Business logic
-- **Database Models**: `aperag/db/models.py` - Document, DocumentIndex table definitions
-- **Database Operations**: `aperag/db/ops.py` - CRUD operation encapsulation
+Select faster responding models in Collection config.
 
-### Object Storage
+### 8.4 Will Staging Files Be Lost?
 
-- **Interface Definition**: `aperag/objectstore/base.py` - AsyncObjectStore abstract class
-- **Local Implementation**: `aperag/objectstore/local.py` - Local filesystem storage
-- **S3 Implementation**: `aperag/objectstore/s3.py` - S3-compatible storage
+- ✅ Within 7 days: Won't be lost, can confirm anytime
+- ⚠️ After 7 days: Auto cleanup (save storage)
+- 💡 Recommendation: Confirm promptly after upload
 
-### Document Parsing
+## 9. Summary
 
-- **Main Controller**: `aperag/docparser/doc_parser.py` - DocParser
-- **Parser Implementations**:
-  - `aperag/docparser/mineru_parser.py` - MinerU PDF parsing
-  - `aperag/docparser/docray_parser.py` - DocRay document parsing
-  - `aperag/docparser/markitdown_parser.py` - MarkItDown universal parsing
-  - `aperag/docparser/image_parser.py` - Image OCR
-  - `aperag/docparser/audio_parser.py` - Audio transcription
-- **Document Processing**: `aperag/index/document_parser.py` - Parsing flow orchestration
+ApeRAG document upload makes it easy to add various format documents to your knowledge base.
 
-### Index Building
+### Core Advantages
 
-- **Index Management**: `aperag/index/manager.py` - DocumentIndexManager
-- **Vector Index**: `aperag/index/vector_index.py` - VectorIndexer
-- **Full-text Index**: `aperag/index/fulltext_index.py` - FulltextIndexer
-- **Knowledge Graph**: `aperag/index/graph_index.py` - GraphIndexer
-- **Document Summary**: `aperag/index/summary_index.py` - SummaryIndexer
-- **Vision Index**: `aperag/index/vision_index.py` - VisionIndexer
+1. ✅ **Supports 20+ formats**: PDF, Word, Excel, images, audio, etc.
+2. ✅ **Second-level upload response**: No wait, immediate return
+3. ✅ **Staging area design**: Upload first, select later, avoid mistakes
+4. ✅ **Smart parsing**: Auto recognize format, select best parser
+5. ✅ **Multi-index building**: Build 5 indexes simultaneously, meet different retrieval needs
+6. ✅ **Background processing**: Async execution, non-blocking
+7. ✅ **Auto retry**: Failures auto retry, improve success rate
+8. ✅ **Quota management**: Only consume on confirmation, reasonable resource control
 
-### Task Scheduling
+### Performance
 
-- **Task Definitions**: `config/celery_tasks.py` - Celery task registration
-- **Reconciler**: `aperag/tasks/reconciler.py` - DocumentIndexReconciler
-- **Document Tasks**: `aperag/tasks/document.py` - DocumentIndexTask
+| Operation | Time |
+|-----------|------|
+| Upload 100 files | < 1 minute |
+| Confirm addition | < 1 second |
+| Small doc processing (< 10 pages) | 1-3 minutes |
+| Medium doc (10-50 pages) | 3-10 minutes |
+| Large doc (100+ pages) | 10-30 minutes |
 
-### Frontend Implementation
+### Suitable Scenarios
 
-- **Document List**: `web/src/app/workspace/collections/[collectionId]/documents/page.tsx`
-- **Document Upload**: `web/src/app/workspace/collections/[collectionId]/documents/upload/document-upload.tsx`
+- 📚 Enterprise knowledge base building
+- 🔬 Research material organization
+- 📖 Personal note management
+- 🎓 Learning material archiving
 
-## Summary
+The system is both **simple to use** and **powerful**, suitable for various scales of knowledge management needs.
 
-ApeRAG's document upload module adopts a **two-phase commit + multi-parser chain invocation + parallel multi-index building** architecture design:
+---
 
-**Core Features**:
-1. ✅ **Two-Phase Commit**: Upload (temporary storage) → Confirm (formal addition), providing better user experience
-2. ✅ **SHA-256 Deduplication**: Prevents duplicate documents, supports idempotent upload
-3. ✅ **Flexible Storage Backend**: Local/S3 configurable switching, unified interface abstraction
-4. ✅ **Multi-Parser Architecture**: Supports MinerU, DocRay, MarkItDown and other parsers
-5. ✅ **Automatic Format Conversion**: PDF→images, audio→text, images→OCR text
-6. ✅ **Multi-Index Coordination**: Five index types: vector, full-text, graph, summary, vision
-7. ✅ **Quota Management**: Quota deducted at confirmation stage, reasonable resource control
-8. ✅ **Async Processing**: Celery task queue, doesn't block user operations
-9. ✅ **Transaction Consistency**: Two-phase commit for database + object store
-10. ✅ **Observability**: Audit logs, task tracking, complete error information recording
+## Related Documentation
 
-This design ensures both high performance and scalability, supports complex document processing scenarios (multi-format, multi-language, multi-modal), while maintaining good fault tolerance and user experience.
+- 📋 [System Architecture](./architecture.md) - ApeRAG overall architecture design
+- 📖 [Graph Index Creation Process](./graph_index_creation.md) - Graph index details
+- 🔗 [Index Pipeline Architecture](./indexing_architecture.md) - Complete indexing process
diff --git a/docs/zh-CN/design/document_upload_design.md b/docs/zh-CN/design/document_upload_design.md
index 307d77d0..8224383c 100644
--- a/docs/zh-CN/design/document_upload_design.md
+++ b/docs/zh-CN/design/document_upload_design.md
@@ -1,1077 +1,708 @@
-# ApeRAG 文档上传架构设计
+---
+title: 文档上传设计
+description: ApeRAG 文档上传的完整流程与核心设计
+keywords: 文档上传, 多格式支持, 文档解析, 智能索引
+---
 
-## 概述
+# 文档上传设计
 
-本文档详细说明 ApeRAG 项目中文档上传模块的完整架构设计，涵盖从文件上传、临时存储、文档解析、格式转换到最终索引构建的全链路流程。
+## 1. 文档上传是什么
 
-**核心设计理念**：采用**两阶段提交**模式，将文件上传（临时存储）和文档确认（正式添加）分离，提供更好的用户体验和资源管理能力。
+文档上传是 ApeRAG 的入口功能，让你可以把各种格式的文档添加到知识库中，系统会自动处理、索引，让这些知识可以被检索和对话。
 
-## 系统架构
+### 1.1 能上传什么
 
-### 整体架构图
+ApeRAG 支持 20+ 种文档格式，基本涵盖了日常工作中的所有文件类型：
 
+```mermaid
+flowchart LR
+    subgraph Input[📁 你的文档]
+        A1[PDF 报告]
+        A2[Word 文档]
+        A3[Excel 表格]
+        A4[图片截图]
+        A5[会议录音]
+        A6[Markdown 笔记]
+    end
+    
+    subgraph Process[🔄 ApeRAG 自动处理]
+        B[识别格式<br/>提取内容<br/>构建索引]
+    end
+    
+    subgraph Output[✨ 可检索的知识]
+        C[回答问题<br/>查找信息<br/>分析关系]
+    end
+    
+    A1 --> B
+    A2 --> B
+    A3 --> B
+    A4 --> B
+    A5 --> B
+    A6 --> B
+    
+    B --> C
+    
+    style Input fill:#e3f2fd
+    style Process fill:#fff59d
+    style Output fill:#c8e6c9
 ```
-┌─────────────────────────────────────────────────────────────┐
-│                        Frontend                             │
-│                       (Next.js)                             │
-└────────┬───────────────────────────────────┬────────────────┘
-         │                                   │
-         │ Step 1: Upload                    │ Step 2: Confirm
-         │ POST /documents/upload            │ POST /documents/confirm
-         ▼                                   ▼
-┌─────────────────────────────────────────────────────────────┐
-│  View Layer: aperag/views/collections.py                    │
-│  - HTTP请求处理                                              │
-│  - JWT身份验证                                               │
-│  - 参数验证                                                  │
-└────────┬───────────────────────────────────┬────────────────┘
-         │                                   │
-         │ document_service.upload_document() │ document_service.confirm_documents()
-         ▼                                   ▼
-┌─────────────────────────────────────────────────────────────┐
-│  Service Layer: aperag/service/document_service.py          │
-│  - 业务逻辑编排                                              │
-│  - 文件验证（类型、大小）                                     │
-│  - SHA-256 哈希去重                                          │
-│  - Quota 检查                                               │
-│  - 事务管理                                                  │
-└────────┬───────────────────────────────────┬────────────────┘
-         │                                   │
-         │ Step 1                            │ Step 2
-         ▼                                   ▼
-┌────────────────────────┐     ┌────────────────────────────┐
-│  1. 创建 Document 记录  │     │  1. 更新 Document 状态     │
-│     status=UPLOADED    │     │     UPLOADED → PENDING     │
-│  2. 保存到 ObjectStore │     │  2. 创建 DocumentIndex 记录│
-│  3. 计算 content_hash  │     │  3. 触发索引构建任务        │
-└────────┬───────────────┘     └────────┬───────────────────┘
-         │                              │
-         ▼                              ▼
-┌─────────────────────────────────────────────────────────────┐
-│                    Storage Layer                            │
-│                                                             │
-│  ┌───────────────┐  ┌──────────────────┐  ┌─────────────┐ │
-│  │  PostgreSQL   │  │  Object Store    │  │  Vector DB  │ │
-│  │               │  │                  │  │             │ │
-│  │ - document    │  │ - Local/S3       │  │ - Qdrant    │ │
-│  │ - document_   │  │ - 原始文件        │  │ - 向量索引  │ │
-│  │   index       │  │ - 转换后的文件    │  │             │ │
-│  └───────────────┘  └──────────────────┘  └─────────────┘ │
-│                                                             │
-│  ┌───────────────┐  ┌──────────────────┐                  │
-│  │ Elasticsearch │  │   Neo4j/PG       │                  │
-│  │               │  │                  │                  │
-│  │ - 全文索引     │  │ - 知识图谱       │                  │
-│  └───────────────┘  └──────────────────┘                  │
-└─────────────────────────────────────────────────────────────┘
-                         │
-                         ▼
-               ┌───────────────────┐
-               │  Celery Workers   │
-               │                   │
-               │  - 文档解析        │
-               │  - 格式转换        │
-               │  - 内容提取        │
-               │  - 文档分块        │
-               │  - 索引构建        │
-               └───────────────────┘
+
+**文档类型**：
+
+| 类别 | 格式 | 典型用途 |
+|------|------|---------|
+| **办公文档** | PDF, Word, PPT, Excel | 年度报告、会议纪要、数据表格 |
+| **文本文件** | TXT, MD, HTML, JSON | 技术文档、笔记、配置文件 |
+| **图片** | PNG, JPG, GIF | 产品截图、设计稿、图表 |
+| **音频** | MP3, WAV, M4A | 会议录音、采访录音 |
+| **压缩包** | ZIP, TAR, GZ | 批量文档打包 |
+
+### 1.2 上传后发生什么
+
+```mermaid
+flowchart TB
+    A[你上传一个 PDF] --> B{系统自动识别}
+    
+    B --> C[提取文字内容]
+    B --> D[识别表格结构]
+    B --> E[提取图片]
+    B --> F[识别公式]
+    
+    C --> G[构建索引]
+    D --> G
+    E --> G
+    F --> G
+    
+    G --> H1[向量索引<br/>支持语义搜索]
+    G --> H2[全文索引<br/>支持关键词搜索]
+    G --> H3[图谱索引<br/>支持关系查询]
+    
+    H1 --> I[完成！可以检索]
+    H2 --> I
+    H3 --> I
+    
+    style A fill:#e1f5ff
+    style B fill:#fff59d
+    style G fill:#ffe0b2
+    style I fill:#c8e6c9
 ```
 
-### 分层架构
+**简单来说**：你只管上传文件，系统自动帮你处理好一切！
+
+## 2. 实际应用场景
+
+看看文档上传在实际工作中的应用。
+
+### 2.1 企业知识库建设
+
+**场景**：公司要建立内部知识库。
+
+**上传内容**：
+- 📋 制度文档：员工手册、考勤制度、报销流程
+- 📊 业务资料：产品介绍、销售数据、财务报表
+- 🔧 技术文档：系统架构、API 文档、部署指南
+- 📁 项目资料：项目方案、会议记录、复盘总结
+
+**使用效果**：
 
 ```
-┌─────────────────────────────────────────────┐
-│  View Layer (views/collections.py)         │  HTTP 处理、认证、参数验证
-└─────────────────┬───────────────────────────┘
-                  │ 调用
-┌─────────────────▼───────────────────────────┐
-│  Service Layer (service/document_service.py)│  业务逻辑、事务编排、权限控制
-└─────────────────┬───────────────────────────┘
-                  │ 调用
-┌─────────────────▼───────────────────────────┐
-│  Repository Layer (db/ops.py, objectstore/) │  数据访问抽象、对象存储接口
-└─────────────────┬───────────────────────────┘
-                  │ 访问
-┌─────────────────▼───────────────────────────┐
-│  Storage Layer (PG, S3, Qdrant, ES, Neo4j) │  数据持久化
-└─────────────────────────────────────────────┘
+员工提问："出差报销流程是什么？"
+系统：从《财务制度.pdf》找到报销流程章节
+
+新人提问："公司的产品有哪些？"
+系统：从《产品手册.pptx》提取产品列表
+
+技术同学："这个 API 怎么调用？"
+系统：从《API文档.md》找到调用示例
 ```
 
-## 核心流程详解
+### 2.2 研究资料整理
 
-### 阶段 0: API 接口定义
+**场景**：研究生整理论文和学习资料。
 
-系统提供三个主要接口：
+**上传内容**：
+- 📖 学术论文 PDF
+- 📝 读书笔记 Markdown
+- 🎓 课程讲义 PPT
+- 📊 实验数据 Excel
 
-1. **上传文件**（两阶段模式 - 第一步）
-   - 接口：`POST /api/v1/collections/{collection_id}/documents/upload`
-   - 功能：上传文件到临时存储，状态为 `UPLOADED`
-   - 返回：`document_id`、`filename`、`size`、`status`
+**使用效果**：
 
-2. **确认文档**（两阶段模式 - 第二步）
-   - 接口：`POST /api/v1/collections/{collection_id}/documents/confirm`
-   - 功能：确认已上传的文档，触发索引构建
-   - 参数：`document_ids` 数组
-   - 返回：`confirmed_count`、`failed_count`、`failed_documents`
+```
+问："Graph RAG 相关的研究有哪些？"
+答：从多篇论文中找到相关内容
 
-3. **一步上传**（传统模式，兼容旧版）
-   - 接口：`POST /api/v1/collections/{collection_id}/documents`
-   - 功能：上传并直接添加到知识库，状态直接为 `PENDING`
-   - 支持批量上传
+问："某个作者的主要贡献是什么？"
+答：分析论文，总结作者的研究方向
+```
+
+### 2.3 个人知识管理
 
-### 阶段 1: 文件上传与临时存储
+**场景**：程序员积累技术笔记。
 
-#### 1.1 上传流程
+**上传内容**：
+- 💻 学习笔记 Markdown
+- 📸 技术截图 PNG
+- 🎬 教程录屏转的音频
+- 📚 技术书籍 PDF
+
+**使用效果**：
 
 ```
-用户选择文件
-    │
-    ▼
-前端调用 upload API
-    │
-    ▼
-View 层验证身份和参数
-    │
-    ▼
-Service 层处理业务逻辑：
-    │
-    ├─► 验证集合存在且激活
-    │
-    ├─► 验证文件类型和大小
-    │
-    ├─► 读取文件内容
-    │
-    ├─► 计算 SHA-256 哈希
-    │
-    └─► 事务处理：
-        │
-        ├─► 重复检测（按文件名+哈希）
-        │   ├─ 完全相同：返回已存在文档（幂等）
-        │   ├─ 同名不同内容：抛出冲突异常
-        │   └─ 新文档：继续创建
-        │
-        ├─► 创建 Document 记录（status=UPLOADED）
-        │
-        ├─► 上传到对象存储
-        │   └─ 路径：user-{user_id}/{collection_id}/{document_id}/original{suffix}
-        │
-        └─► 更新文档元数据（object_path）
+问："之前怎么解决过 Redis 连接问题？"
+答：从笔记《Redis问题排查.md》找到解决方案
+
+问："某个技术的最佳实践是什么？"
+答：从多个文档中总结最佳实践
 ```
 
-#### 1.2 文件验证
+### 2.4 多模态内容处理
 
-**支持的文件类型**：
-- 文档：`.pdf`, `.doc`, `.docx`, `.ppt`, `.pptx`, `.xls`, `.xlsx`
-- 文本：`.txt`, `.md`, `.html`, `.json`, `.xml`, `.yaml`, `.yml`, `.csv`
-- 图片：`.png`, `.jpg`, `.jpeg`, `.gif`, `.bmp`, `.tiff`, `.tif`
-- 音频：`.mp3`, `.wav`, `.m4a`
-- 压缩包：`.zip`, `.tar`, `.gz`, `.tgz`
+**场景**：产品团队的设计资料。
 
-**大小限制**：
-- 默认：100 MB（可通过 `MAX_DOCUMENT_SIZE` 环境变量配置）
-- 解压后总大小：5 GB（`MAX_EXTRACTED_SIZE`）
+**上传内容**：
+- 🎨 UI 设计稿（图片）
+- 📋 产品 PRD（Word）
+- 🎤 用户访谈录音
+- 📊 数据分析报告（Excel）
 
-#### 1.3 重复检测机制
+**系统处理**：
+- 设计稿 → OCR 提取文字 + Vision 理解设计意图
+- PRD → 提取产品需求和功能点
+- 录音 → 转文字，提取用户反馈
+- 数据报告 → 提取关键指标
 
-采用**文件名 + SHA-256 哈希**双重检测：
+**结果**：所有内容融合在一起，可以综合检索！
 
-| 场景 | 文件名 | 哈希值 | 系统行为 |
-|------|--------|--------|----------|
-| 完全相同 | 相同 | 相同 | 返回已存在文档（幂等操作） |
-| 文件名冲突 | 相同 | 不同 | 抛出 `DocumentNameConflictException` |
-| 新文档 | 不同 | - | 创建新文档记录 |
+## 3. 上传体验
 
-**优势**：
-- ✅ 支持幂等上传：网络重传不会创建重复文档
-- ✅ 避免内容冲突：同名不同内容会提示用户
-- ✅ 节省存储空间：相同内容只存储一次
+### 3.1 批量上传很简单
 
-### 阶段 2: 临时存储配置
+假设你要上传 50 个公司文档：
 
-#### 2.1 对象存储类型
+**Step 1：选择文件（10 秒）**
 
-系统支持两种对象存储后端，可通过环境变量切换：
+```
+点击"上传文档" → 选择 50 个 PDF → 点击"开始上传"
+```
 
-**1. Local 存储（本地文件系统）**
+**Step 2：快速上传（30 秒）**
 
-适用场景：
-- 开发测试环境
-- 小规模部署
-- 单机部署
+```
+进度条：1/50, 2/50, 3/50... 50/50 ✅
+所有文件秒传到暂存区，不需要等待处理
+```
 
-配置方式：
-```bash
-# 开发环境
-OBJECT_STORE_TYPE=local
-OBJECT_STORE_LOCAL_ROOT_DIR=.objects
+**Step 3：预览确认（1 分钟）**
 
-# Docker 环境
-OBJECT_STORE_TYPE=local
-OBJECT_STORE_LOCAL_ROOT_DIR=/shared/objects
 ```
+查看上传的文件列表：
+- ✅ 年度报告.pdf (5.2 MB)
+- ✅ 产品手册.pdf (3.1 MB)
+- ❌ 个人笔记.pdf (不该上传的) → 取消勾选
+- ✅ 技术文档.pdf (2.8 MB)
+...
 
-存储路径示例：
-```
-.objects/
-└── user-google-oauth2-123456/
-    └── col_abc123/
-        └── doc_xyz789/
-            ├── original.pdf              # 原始文件
-            ├── converted.pdf             # 转换后的 PDF
-            ├── processed_content.md      # 解析后的 Markdown
-            ├── chunks/                   # 分块数据
-            │   ├── chunk_0.json
-            │   └── chunk_1.json
-            └── images/                   # 提取的图片
-                ├── page_0.png
-                └── page_1.png
+点击"保存到知识库"
 ```
 
-**2. S3 存储（兼容 AWS S3/MinIO/OSS 等）**
+**Step 4：后台处理（5-30 分钟）**
 
-适用场景：
-- 生产环境
-- 大规模部署
-- 分布式部署
-- 需要高可用和容灾
+```
+系统自动处理：
+- 解析文档内容
+- 构建多种索引
+- 你可以继续其他工作，不需要等待
+```
+
+**Step 5：完成通知**
 
-配置方式：
-```bash
-OBJECT_STORE_TYPE=s3
-OBJECT_STORE_S3_ENDPOINT=http://127.0.0.1:9000  # MinIO/S3 地址
-OBJECT_STORE_S3_REGION=us-east-1                # AWS Region
-OBJECT_STORE_S3_ACCESS_KEY=minioadmin           # Access Key
-OBJECT_STORE_S3_SECRET_KEY=minioadmin           # Secret Key
-OBJECT_STORE_S3_BUCKET=aperag                   # Bucket 名称
-OBJECT_STORE_S3_PREFIX_PATH=dev/                # 可选的路径前缀
-OBJECT_STORE_S3_USE_PATH_STYLE=true             # MinIO 需要设置为 true
 ```
+通知："49 个文档处理完成，现在可以检索了"
+```
+
+### 3.2 处理时间参考
+
+不同大小的文档，处理速度不同：
+
+| 文档类型 | 大小 | 上传时间 | 处理时间 | 示例 |
+|---------|------|---------|---------|------|
+| 🏃 小文档 | < 5 页 | < 1 秒 | 1-3 分钟 | 通知、邮件 |
+| 🚶 中型文档 | 10-50 页 | < 3 秒 | 3-10 分钟 | 报告、手册 |
+| 🐌 大型文档 | 100+ 页 | < 10 秒 | 10-30 分钟 | 书籍、论文集 |
 
-#### 2.2 对象存储路径规则
+**关键点**：
+- ✅ 上传总是很快（秒级）
+- ⏳ 处理在后台进行（不阻塞）
+- 📊 可以实时查看处理进度
+
+### 3.3 实时进度查看
+
+上传后可以随时查看文档状态：
 
-**路径格式**：
 ```
-{prefix}/user-{user_id}/{collection_id}/{document_id}/{filename}
+文档列表：
+
+📄 annual_report.pdf
+   状态：处理中 (60%)
+   ├─ ✅ 文档解析：完成
+   ├─ ✅ 向量索引：完成
+   ├─ 🔄 全文索引：进行中
+   └─ ⏳ 图谱索引：等待中
+
+📄 product_manual.pdf
+   状态：已完成 ✅
+   可以检索
+
+📄 meeting_notes.pdf
+   状态：失败 ❌
+   错误：文件损坏
+   操作：重新上传
 ```
 
-**组成部分**：
-- `prefix`：可选的全局前缀（仅 S3）
-- `user_id`：用户 ID（`|` 替换为 `-`）
-- `collection_id`：集合 ID
-- `document_id`：文档 ID
-- `filename`：文件名（如 `original.pdf`、`page_0.png`）
+## 4. 核心特性
 
-**多租户隔离**：
-- 每个用户有独立的命名空间
-- 每个集合有独立的存储目录
-- 每个文档有独立的文件夹
+ApeRAG 的文档上传有一些独特的特性，让使用更加方便。
 
-### 阶段 3: 文档确认与索引构建
+### 4.1 暂存区设计
 
-#### 3.1 确认流程
+**核心理念**：先传后选，给你"后悔"的机会。
+
+**就像网购**：
 
 ```
-用户点击"保存到集合"
-    │
-    ▼
-前端调用 confirm API
-    │
-    ▼
-Service 层处理：
-    │
-    ├─► 验证集合配置
-    │
-    ├─► 检查 Quota（确认阶段才扣除配额）
-    │
-    └─► 对每个 document_id：
-        │
-        ├─► 验证文档状态为 UPLOADED
-        │
-        ├─► 更新文档状态：UPLOADED → PENDING
-        │
-        ├─► 根据集合配置创建索引记录：
-        │   ├─ VECTOR（向量索引，必选）
-        │   ├─ FULLTEXT（全文索引，必选）
-        │   ├─ GRAPH（知识图谱，可选）
-        │   ├─ SUMMARY（文档摘要，可选）
-        │   └─ VISION（视觉索引，可选）
-        │
-        └─► 返回确认结果
-    │
-    ▼
-触发 Celery 任务：reconcile_document_indexes
-    │
-    ▼
-后台异步处理索引构建
+网购流程：
+1. 加入购物车（暂存）
+2. 查看购物车，删除不想要的
+3. 提交订单（确认）
+
+文档上传：
+1. 上传到暂存区（快速上传）
+2. 查看列表，取消不需要的
+3. 保存到知识库（确认添加）
 ```
 
-#### 3.2 Quota（配额）管理
+**好处**：
 
-**检查时机**：
-- ❌ 不在上传阶段检查（临时存储不占用配额）
-- ✅ 在确认阶段检查（正式添加才消耗配额）
+- ✅ **快速上传**：20 个文件 5 秒传完，不用等处理
+- ✅ **选择性添加**：上传 100 个，只保存需要的 80 个
+- ✅ **节省配额**：暂存区的文件不占配额
+- ✅ **纠错方便**：发现错误直接取消，不用删除
 
-**配额类型**：
+### 4.2 智能处理
 
-1. **用户全局配额**
-   - `max_document_count`：用户总文档数量限制
-   - 默认：1000（可通过 `MAX_DOCUMENT_COUNT` 配置）
+**自动识别格式**：
 
-2. **单集合配额**
-   - `max_document_count_per_collection`：单个集合文档数量限制
-   - 不计入 `UPLOADED` 和 `DELETED` 状态的文档
+系统会自动识别文件类型，选择最合适的处理方式：
 
-**配额超限处理**：
-- 抛出 `QuotaExceededException`
-- 返回 HTTP 400 错误
-- 包含当前用量和配额上限信息
+- 📄 PDF → 提取文字、表格、图片、公式
+- 📋 Word → 转换格式、提取内容
+- 📊 Excel → 识别表格结构
+- 🎨 图片 → OCR 文字 + 理解内容
+- 🎤 音频 → 转录成文字
 
-### 阶段 4: 文档解析与格式转换
+**你不需要做任何额外操作**，系统自动处理！
 
-#### 4.1 Parser 架构
+### 4.3 后台处理
 
-系统采用**多 Parser 链式调用**架构，每个 Parser 负责特定类型的文件解析：
+上传完成后，系统在后台自动处理：
 
-```
-DocParser（主控制器）
-    │
-    ├─► MinerUParser
-    │   └─ 功能：高精度 PDF 解析（商业 API）
-    │   └─ 支持：.pdf
-    │
-    ├─► DocRayParser
-    │   └─ 功能：文档布局分析和内容提取
-    │   └─ 支持：.pdf, .docx, .pptx, .xlsx
-    │
-    ├─► ImageParser
-    │   └─ 功能：图片内容识别（OCR + 视觉理解）
-    │   └─ 支持：.jpg, .png, .gif, .bmp, .tiff
-    │
-    ├─► AudioParser
-    │   └─ 功能：音频转录（Speech-to-Text）
-    │   └─ 支持：.mp3, .wav, .m4a
-    │
-    └─► MarkItDownParser（兜底）
-        └─ 功能：通用文档转 Markdown
-        └─ 支持：几乎所有常见格式
+```mermaid
+sequenceDiagram
+    participant U as 你
+    participant S as 系统
+    
+    U->>S: 上传文件
+    S-->>U: 秒级返回 ✅
+    Note over U: 继续工作，不用等
+    
+    S->>S: 解析文档...
+    S->>S: 构建索引...
+    S-->>U: 处理完成通知 🔔
 ```
 
-#### 4.2 Parser 配置
+**优势**：
+- 不用等待，上传完就能干别的
+- 系统自动重试失败的文档
+- 实时查看处理进度
 
-**配置方式**：通过集合配置（Collection Config）动态控制
+### 4.4 自动清理
 
-```json
-{
-  "parser_config": {
-    "use_mineru": false,           // 是否启用 MinerU（需要 API Token）
-    "use_doc_ray": false,          // 是否启用 DocRay
-    "use_markitdown": true,        // 是否启用 MarkItDown（默认）
-    "mineru_api_token": "xxx"      // MinerU API Token（可选）
-  }
-}
-```
+暂存区的文件 7 天没确认会自动清理，防止占用存储空间。
 
-**环境变量配置**：
-```bash
-USE_MINERU_API=false              # 全局启用 MinerU
-MINERU_API_TOKEN=your_token       # MinerU API Token
+## 5. 文档解析原理
+
+上传后，系统需要把文档"读懂"。不同格式有不同的处理方式。
+
+### 5.1 解析器工作流程
+
+系统有多个解析器，会自动选择最合适的：
+
+```mermaid
+flowchart TD
+    File[上传 PDF] --> Try1{尝试 MinerU}
+    Try1 -->|成功| Result[解析完成]
+    Try1 -->|失败/未配置| Try2{尝试 DocRay}
+    Try2 -->|成功| Result
+    Try2 -->|失败/未配置| Try3[使用 MarkItDown]
+    Try3 --> Result
+    
+    style File fill:#e1f5ff
+    style Result fill:#c5e1a5
+    style Try1 fill:#fff3e0
+    style Try2 fill:#fff3e0
+    style Try3 fill:#c5e1a5
 ```
 
-#### 4.3 解析流程
+**解析器优先级**：
+
+1. **MinerU**：最强大，商业 API，需要付费
+   - 擅长：复杂 PDF、学术论文、带公式的文档
+   
+2. **DocRay**：开源，免费，布局分析强
+   - 擅长：表格、图表、多列排版
+   
+3. **MarkItDown**：通用，兜底，支持所有格式
+   - 擅长：简单文档、文本文件
+
+**自动降级**的好处：
+- 优先用最好的解析器
+- 不行就自动换下一个
+- 总有一个能处理成功
+
+**例子 1：复杂 PDF**
 
 ```
-Celery Worker 收到索引任务
-    │
-    ▼
-1. 从对象存储下载原始文件
-    │
-    ▼
-2. 根据文件扩展名选择 Parser
-    │
-    ├─► 尝试第一个匹配的 Parser
-    │   ├─ 成功：返回解析结果
-    │   └─ 失败：FallbackError → 尝试下一个 Parser
-    │
-    └─► 最终兜底：MarkItDownParser
-    │
-    ▼
-3. 解析结果（Parts）：
-    │
-    ├─► MarkdownPart：文本内容
-    │   └─ 包含：标题、段落、列表、表格等
-    │
-    ├─► PdfPart：PDF 文件
-    │   └─ 用于：线性化、页面渲染
-    │
-    └─► AssetBinPart：二进制资源
-        └─ 包含：图片、嵌入的文件等
-    │
-    ▼
-4. 后处理（Post-processing）：
-    │
-    ├─► PDF 页面转图片（Vision 索引需要）
-    │   └─ 每页渲染为 PNG 图片
-    │   └─ 保存到 {document_path}/images/page_N.png
-    │
-    ├─► PDF 线性化（加速浏览器加载）
-    │   └─ 使用 pikepdf 优化 PDF 结构
-    │   └─ 保存到 {document_path}/converted.pdf
-    │
-    └─► 提取文本内容（纯文本）
-        └─ 合并所有 MarkdownPart 内容
-        └─ 保存到 {document_path}/processed_content.md
-    │
-    ▼
-5. 保存到对象存储
+上传：年度报告.pdf (50 页，有表格和图表)
+    ↓
+DocRay 解析器自动：
+- 📝 提取所有文字内容
+- 📊 识别表格，保持结构
+- 🎨 提取图片和图表
+- 📐 识别 LaTeX 公式
+    ↓
+得到：
+- 完整的 Markdown 文档
+- 50 张页面截图（如果需要视觉索引）
 ```
 
-#### 4.4 格式转换示例
+**例子 2：图片截图**
 
-**示例 1：PDF 文档**
 ```
-输入：user_manual.pdf (5 MB)
-    │
-    ▼
-解析器选择：DocRayParser / MarkItDownParser
-    │
-    ▼
-输出 Parts：
-    ├─ MarkdownPart: "# User Manual\n\n## Chapter 1\n..."
-    └─ PdfPart: <原始 PDF 数据>
-    │
-    ▼
-后处理：
-    ├─ 渲染 50 页为图片 → images/page_0.png ~ page_49.png
-    ├─ 线性化 PDF → converted.pdf
-    └─ 提取文本 → processed_content.md
+上传：product_screenshot.png
+    ↓
+ImageParser 自动：
+- 📸 OCR 识别图片中的文字
+- 👁️ Vision AI 理解图片内容
+    ↓
+得到：
+- 文字："产品名称：ApeRAG，版本：2.0..."
+- 描述："这是一个产品介绍页面，包含产品名称、版本号和功能列表"
 ```
 
-**示例 2：图片文件**
+**例子 3：会议录音**
+
 ```
-输入：screenshot.png (2 MB)
-    │
-    ▼
-解析器选择：ImageParser
-    │
-    ▼
-输出 Parts：
-    ├─ MarkdownPart: "[OCR 提取的文字内容]"
-    └─ AssetBinPart: <原始图片数据> (vision_index=true)
-    │
-    ▼
-后处理：
-    └─ 保存原图副本 → images/file.png
+上传：meeting.mp3 (30 分钟)
+    ↓
+AudioParser 自动：
+- 🎤 语音转文字（STT）
+- 📝 生成会议记录
+    ↓
+得到：
+- "会议开始。主持人张三：大家好，今天讨论产品规划..."
+- 完整的会议文字记录
 ```
 
-**示例 3：音频文件**
+### 5.3 重复文件处理
+
+系统会自动检测重复上传：
+
 ```
-输入：meeting_record.mp3 (50 MB)
-    │
-    ▼
-解析器选择：AudioParser
-    │
-    ▼
-输出 Parts：
-    └─ MarkdownPart: "[转录的会议内容文本]"
-    │
-    ▼
-后处理：
-    └─ 保存转录文本 → processed_content.md
+第一次上传 report.pdf → 创建新文档 ✅
+第二次上传 report.pdf (内容相同) → 返回已存在文档 ✅
+第三次上传 report.pdf (内容不同) → 提示冲突，需重命名 ⚠️
 ```
 
-### 阶段 5: 索引构建
+**优势**：
+- 避免重复文档
+- 网络重传不会创建多个文档
+- 节省存储空间
 
-#### 5.1 索引类型与功能
+## 6. 索引构建
 
-| 索引类型 | 是否必选 | 功能描述 | 存储位置 |
-|---------|---------|----------|----------|
-| **VECTOR** | ✅ 必选 | 向量化检索，支持语义搜索 | Qdrant / Elasticsearch |
-| **FULLTEXT** | ✅ 必选 | 全文检索，支持关键词搜索 | Elasticsearch |
-| **GRAPH** | ❌ 可选 | 知识图谱，提取实体和关系 | Neo4j / PostgreSQL |
-| **SUMMARY** | ❌ 可选 | 文档摘要，LLM 生成 | PostgreSQL (index_data) |
-| **VISION** | ❌ 可选 | 视觉理解，图片内容分析 | Qdrant (向量) + PG (metadata) |
+文档解析后，系统会自动构建多种索引，让你可以用不同方式检索。
 
-#### 5.2 索引构建流程
+### 6.1 为什么需要多种索引
+
+不同的问题需要不同的检索方式：
 
 ```
-Celery Worker: reconcile_document_indexes 任务
-    │
-    ▼
-1. 扫描 DocumentIndex 表，找到需要处理的索引
-    │
-    ├─► PENDING 状态 + observed_version < version
-    │   └─ 需要创建或更新索引
-    │
-    └─► DELETING 状态
-        └─ 需要删除索引
-    │
-    ▼
-2. 按文档分组，逐个处理
-    │
-    ▼
-3. 对每个文档：
-    │
-    ├─► parse_document（解析文档）
-    │   ├─ 从对象存储下载原始文件
-    │   ├─ 调用 DocParser 解析
-    │   └─ 返回 ParsedDocumentData
-    │
-    └─► 对每个索引类型：
-        │
-        ├─► create_index (创建/更新索引)
-        │   │
-        │   ├─ VECTOR 索引：
-        │   │   ├─ 文档分块（Chunking）
-        │   │   ├─ Embedding 模型生成向量
-        │   │   └─ 写入 Qdrant
-        │   │
-        │   ├─ FULLTEXT 索引：
-        │   │   ├─ 提取纯文本内容
-        │   │   ├─ 按段落/章节分块
-        │   │   └─ 写入 Elasticsearch
-        │   │
-        │   ├─ GRAPH 索引：
-        │   │   ├─ 使用 LightRAG 提取实体
-        │   │   ├─ 提取实体间关系
-        │   │   └─ 写入 Neo4j/PostgreSQL
-        │   │
-        │   ├─ SUMMARY 索引：
-        │   │   ├─ 调用 LLM 生成摘要
-        │   │   └─ 保存到 DocumentIndex.index_data
-        │   │
-        │   └─ VISION 索引：
-        │       ├─ 提取图片 Assets
-        │       ├─ Vision LLM 理解图片内容
-        │       ├─ 生成图片描述向量
-        │       └─ 写入 Qdrant
-        │
-        └─► 更新索引状态
-            ├─ 成功：CREATING → ACTIVE
-            └─ 失败：CREATING → FAILED
-    │
-    ▼
-4. 更新文档总体状态
-    │
-    ├─ 所有索引都 ACTIVE → Document.status = COMPLETE
-    ├─ 任一索引 FAILED → Document.status = FAILED
-    └─ 部分索引仍在处理 → Document.status = RUNNING
-```
+问："如何优化数据库性能？"
+→ 需要：向量索引（语义相似搜索）
 
-#### 5.3 文档分块（Chunking）
+问："PostgreSQL 配置文件在哪？"
+→ 需要：全文索引（精确关键词搜索）
 
-**分块策略**：
-- 递归字符分割（RecursiveCharacterTextSplitter）
-- 按自然段落、章节优先切分
-- 保留上下文重叠（Overlap）
+问："张三和李四是什么关系？"
+→ 需要：图谱索引（关系查询）
 
-**分块参数**：
-```json
-{
-  "chunk_size": 1000,           // 每块最大字符数
-  "chunk_overlap": 200,         // 重叠字符数
-  "separators": ["\n\n", "\n", " ", ""]  // 分隔符优先级
-}
-```
+问："这个文档主要讲什么？"
+→ 需要：摘要索引（快速概览）
 
-**分块结果存储**：
-```
-{document_path}/chunks/
-    ├─ chunk_0.json: {"text": "...", "metadata": {...}}
-    ├─ chunk_1.json: {"text": "...", "metadata": {...}}
-    └─ ...
+问："这张图片里有什么？"
+→ 需要：视觉索引（图片内容搜索）
 ```
 
-## 数据库设计
-
-### 表 1: document（文档元数据）
-
-**表结构**：
-
-| 字段名 | 类型 | 说明 | 索引 |
-|--------|------|------|------|
-| `id` | String(24) | 文档 ID，主键，格式：`doc{random_id}` | PK |
-| `name` | String(1024) | 文件名 | - |
-| `user` | String(256) | 用户 ID（支持多种 IDP） | ✅ Index |
-| `collection_id` | String(24) | 所属集合 ID | ✅ Index |
-| `status` | Enum | 文档状态（见下表） | ✅ Index |
-| `size` | BigInteger | 文件大小（字节） | - |
-| `content_hash` | String(64) | SHA-256 哈希（用于去重） | ✅ Index |
-| `object_path` | Text | 对象存储路径（已废弃，用 doc_metadata） | - |
-| `doc_metadata` | Text | 文档元数据（JSON 字符串） | - |
-| `gmt_created` | DateTime(tz) | 创建时间（UTC） | - |
-| `gmt_updated` | DateTime(tz) | 更新时间（UTC） | - |
-| `gmt_deleted` | DateTime(tz) | 删除时间（软删除） | ✅ Index |
-
-**唯一约束**：
-```sql
-UNIQUE INDEX uq_document_collection_name_active
-  ON document (collection_id, name)
-  WHERE gmt_deleted IS NULL;
-```
-- 同一集合内，活跃文档的名称不能重复
-- 已删除的文档不参与唯一性检查
-
-**文档状态枚举**（`DocumentStatus`）：
-
-| 状态 | 说明 | 何时设置 | 可见性 |
-|------|------|----------|--------|
-| `UPLOADED` | 已上传到临时存储 | `upload_document` 接口 | 前端文件选择界面 |
-| `PENDING` | 等待索引构建 | `confirm_documents` 接口 | 文档列表（处理中） |
-| `RUNNING` | 索引构建中 | Celery 任务开始处理 | 文档列表（处理中） |
-| `COMPLETE` | 所有索引完成 | 所有索引变为 ACTIVE | 文档列表（可用） |
-| `FAILED` | 索引构建失败 | 任一索引失败 | 文档列表（失败） |
-| `DELETED` | 已删除 | `delete_document` 接口 | 不可见（软删除） |
-| `EXPIRED` | 临时文档过期 | 定时清理任务 | 不可见 |
-
-**文档元数据示例**（`doc_metadata` JSON 字段）：
-```json
-{
-  "object_path": "user-xxx/col_xxx/doc_xxx/original.pdf",
-  "converted_path": "user-xxx/col_xxx/doc_xxx/converted.pdf",
-  "processed_content_path": "user-xxx/col_xxx/doc_xxx/processed_content.md",
-  "images": [
-    "user-xxx/col_xxx/doc_xxx/images/page_0.png",
-    "user-xxx/col_xxx/doc_xxx/images/page_1.png"
-  ],
-  "parser_used": "DocRayParser",
-  "parse_duration_ms": 5420,
-  "page_count": 50,
-  "custom_field": "value"
-}
-```
+### 6.2 五种索引
 
-### 表 2: document_index（索引状态管理）
-
-**表结构**：
-
-| 字段名 | 类型 | 说明 | 索引 |
-|--------|------|------|------|
-| `id` | Integer | 自增 ID，主键 | PK |
-| `document_id` | String(24) | 关联的文档 ID | ✅ Index |
-| `index_type` | Enum | 索引类型（见下表） | ✅ Index |
-| `status` | Enum | 索引状态（见下表） | ✅ Index |
-| `version` | Integer | 索引版本号 | - |
-| `observed_version` | Integer | 已处理的版本号 | - |
-| `index_data` | Text | 索引数据（JSON），如摘要内容 | - |
-| `error_message` | Text | 错误信息（失败时） | - |
-| `gmt_created` | DateTime(tz) | 创建时间 | - |
-| `gmt_updated` | DateTime(tz) | 更新时间 | - |
-| `gmt_last_reconciled` | DateTime(tz) | 最后协调时间 | - |
-
-**唯一约束**：
-```sql
-UNIQUE CONSTRAINT uq_document_index
-  ON document_index (document_id, index_type);
-```
-- 每个文档的每种索引类型只有一条记录
-
-**索引类型枚举**（`DocumentIndexType`）：
-
-| 类型 | 值 | 说明 | 外部存储 |
-|------|-----|------|----------|
-| `VECTOR` | "VECTOR" | 向量索引 | Qdrant / Elasticsearch |
-| `FULLTEXT` | "FULLTEXT" | 全文索引 | Elasticsearch |
-| `GRAPH` | "GRAPH" | 知识图谱 | Neo4j / PostgreSQL |
-| `SUMMARY` | "SUMMARY" | 文档摘要 | PostgreSQL (index_data) |
-| `VISION` | "VISION" | 视觉索引 | Qdrant + PostgreSQL |
-
-**索引状态枚举**（`DocumentIndexStatus`）：
-
-| 状态 | 说明 | 何时设置 |
-|------|------|----------|
-| `PENDING` | 等待处理 | `confirm_documents` 创建索引记录 |
-| `CREATING` | 创建中 | Celery Worker 开始处理 |
-| `ACTIVE` | 就绪可用 | 索引构建成功 |
-| `DELETING` | 标记删除 | `delete_document` 接口 |
-| `DELETION_IN_PROGRESS` | 删除中 | Celery Worker 正在删除 |
-| `FAILED` | 失败 | 索引构建失败 |
-
-**版本控制机制**：
-- `version`：期望的索引版本（每次文档更新时 +1）
-- `observed_version`：已处理的版本号
-- `version > observed_version` 时，触发索引更新
-
-**协调器（Reconciler）**：
-```python
-# 查询需要处理的索引
-SELECT * FROM document_index
-WHERE status = 'PENDING'
-  AND observed_version < version;
-
-# 处理后更新
-UPDATE document_index
-SET status = 'ACTIVE',
-    observed_version = version,
-    gmt_last_reconciled = NOW()
-WHERE id = ?;
+```mermaid
+flowchart TB
+    Doc[你的文档] --> Auto[系统自动构建]
+    
+    Auto --> V[向量索引<br/>找相似内容]
+    Auto --> F[全文索引<br/>找关键词]
+    Auto --> G[图谱索引<br/>找关系]
+    Auto --> S[摘要索引<br/>快速了解]
+    Auto --> I[视觉索引<br/>找图片]
+    
+    V --> Q1[问：如何优化性能？]
+    F --> Q2[问：配置文件路径？]
+    G --> Q3[问：A 和 B 的关系？]
+    S --> Q4[问：文档讲什么？]
+    I --> Q5[问：图片里有什么？]
+    
+    style Doc fill:#e1f5ff
+    style Auto fill:#fff59d
+    style V fill:#bbdefb
+    style F fill:#c5e1a5
+    style G fill:#ffccbc
+    style S fill:#e1bee7
+    style I fill:#fff9c4
 ```
 
-### 表关系图
+**索引对比**：
 
-```
-┌─────────────────────────────────┐
-│         collection              │
-│  ─────────────────────────────  │
-│  id (PK)                        │
-│  name                           │
-│  config (JSON)                  │
-│  status                         │
-│  ...                            │
-└────────────┬────────────────────┘
-             │ 1:N
-             ▼
-┌─────────────────────────────────┐
-│          document               │
-│  ─────────────────────────────  │
-│  id (PK)                        │
-│  collection_id (FK)             │◄──── 唯一约束: (collection_id, name)
-│  name                           │
-│  user                           │
-│  status (Enum)                  │
-│  size                           │
-│  content_hash (SHA-256)         │
-│  doc_metadata (JSON)            │
-│  gmt_created                    │
-│  gmt_deleted                    │
-│  ...                            │
-└────────────┬────────────────────┘
-             │ 1:N
-             ▼
-┌─────────────────────────────────┐
-│       document_index            │
-│  ─────────────────────────────  │
-│  id (PK)                        │
-│  document_id (FK)               │◄──── 唯一约束: (document_id, index_type)
-│  index_type (Enum)              │
-│  status (Enum)                  │
-│  version                        │
-│  observed_version               │
-│  index_data (JSON)              │
-│  error_message                  │
-│  gmt_last_reconciled            │
-│  ...                            │
-└─────────────────────────────────┘
-```
+| 索引 | 必须 | 适合问题 | 速度 |
+|------|------|---------|------|
+| 向量 | ✅ | 语义相似 | 快 |
+| 全文 | ✅ | 精确关键词 | 快 |
+| 图谱 | ❌ | 关系查询 | 慢 |
+| 摘要 | ❌ | 快速了解 | 中 |
+| 视觉 | ❌ | 图片内容 | 中 |
 
-## 状态机与生命周期
+**推荐配置**：
 
-### 文档状态转换
+- 💰 节省成本：只启用向量 + 全文
+- ⚡ 追求速度：禁用图谱（最慢）
+- 🎯 功能完整：全部启用
+
+### 6.3 并行构建
+
+多种索引可以同时构建，节省时间：
 
 ```
-         ┌─────────────────────────────────────────────┐
-         │                                             │
-         │                                             ▼
-    [上传文件] ──► UPLOADED ──► [确认] ──► PENDING ──► RUNNING ──► COMPLETE
-                     │                                   │
-                     │                                   ▼
-                     │                                FAILED
-                     │                                   │
-                     │                                   ▼
-                     └──────► [删除] ──────────────► DELETED
-                                                         │
-                     ┌───────────────────────────────────┘
-                     │
-                     ▼
-                  EXPIRED (定时清理未确认的文档)
+文档解析完成
+    ↓
+5 种索引同时开始构建：
+- 向量索引：1 分钟 
+- 全文索引：30 秒
+- 图谱索引：10 分钟 ⏱️ (最慢)
+- 摘要索引：3 分钟
+- 视觉索引：2 分钟
+    ↓
+总时间：10 分钟（最慢的那个）
+如果串行：16.5 分钟
+
+节省：40% 时间！
 ```
 
-**关键转换**：
-1. **UPLOADED → PENDING**：用户点击"保存到集合"
-2. **PENDING → RUNNING**：Celery Worker 开始处理
-3. **RUNNING → COMPLETE**：所有索引都成功
-4. **RUNNING → FAILED**：任一索引失败
-5. **任何状态 → DELETED**：用户删除文档
+### 6.4 自动重试
 
-### 索引状态转换
+如果某个索引构建失败，系统会自动重试：
 
 ```
-  [创建索引记录] ──► PENDING ──► CREATING ──► ACTIVE
-                                   │
-                                   ▼
-                                FAILED
-                                   │
-                                   ▼
-                     ┌──────────► PENDING (重试)
-                     │
-    [删除请求] ──────┼──────────► DELETING ──► DELETION_IN_PROGRESS ──► (记录删除)
-                     │
-                     └──────────► (直接删除记录，如果 PENDING/FAILED)
+第 1 次：1 分钟后重试
+第 2 次：5 分钟后重试
+第 3 次：15 分钟后重试
+仍失败 → 标记为失败，通知用户
 ```
 
-## 异步任务调度（Celery）
-
-### 任务定义
+大部分临时错误（网络问题、服务重启）都能自动恢复！
 
-**主任务**：`reconcile_document_indexes`
-- 触发时机：
-  - `confirm_documents` 接口调用后
-  - 定时任务（每 30 秒）
-  - 手动触发（管理界面）
-- 功能：扫描 `document_index` 表，处理需要协调的索引
+## 7. 技术实现
 
-**子任务**：
-- `parse_document_task`：解析文档内容
-- `create_vector_index_task`：创建向量索引
-- `create_fulltext_index_task`：创建全文索引
-- `create_graph_index_task`：创建知识图谱索引
-- `create_summary_index_task`：创建摘要索引
-- `create_vision_index_task`：创建视觉索引
+> 💡 **阅读建议**：这一章是技术细节，主要面向开发者和运维人员。普通用户可以跳过。
 
-### 任务调度策略
+### 7.1 存储架构
 
-**并发控制**：
-- 每个 Worker 最多同时处理 N 个文档（默认 4）
-- 每个文档的多个索引可以并行构建
-- 使用 Celery 的 `task_acks_late=True` 确保任务不丢失
+**文件存储位置**：
 
-**失败重试**：
-- 最多重试 3 次
-- 指数退避（1分钟 → 5分钟 → 15分钟）
-- 3 次失败后标记为 `FAILED`
-
-**幂等性**：
-- 所有任务支持重复执行
-- 使用 `observed_version` 机制避免重复处理
-- 相同输入产生相同输出
+```
+本地存储（开发）：
+.objects/user-xxx/collection-xxx/doc-xxx/
+    ├── original.pdf
+    └── images/page_0.png
 
-## 设计特点与优势
+云存储（生产）：
+s3://bucket/user-xxx/collection-xxx/doc-xxx/
+    ├── original.pdf
+    └── images/page_0.png
+```
 
-### 1. 两阶段提交设计
+**配置**：
 
-**优势**：
-- ✅ **用户体验更好**：快速上传响应，不阻塞用户操作
-- ✅ **选择性添加**：批量上传后可选择性确认部分文件
-- ✅ **资源控制合理**：未确认的文档不构建索引，不消耗配额
-- ✅ **故障恢复友好**：临时文档可以定期清理，不影响业务
+```bash
+# 本地存储
+export OBJECT_STORE_TYPE=local
 
-**状态隔离**：
-```
-临时状态（UPLOADED）：
-  - 不计入配额
-  - 不触发索引
-  - 可以被自动清理
-
-正式状态（PENDING/RUNNING/COMPLETE）：
-  - 计入配额
-  - 触发索引构建
-  - 不会被自动清理
+# 云存储（S3/MinIO）
+export OBJECT_STORE_TYPE=s3
+export OBJECT_STORE_S3_BUCKET=aperag
 ```
 
-### 2. 幂等性设计
+### 7.2 解析器配置
 
-**文件级别幂等**：
-- SHA-256 哈希去重
-- 相同文件多次上传返回同一 `document_id`
-- 避免存储空间浪费
+**启用不同解析器**：
 
-**接口级别幂等**：
-- `upload_document`：重复上传返回已存在文档
-- `confirm_documents`：重复确认不会创建重复索引
-- `delete_document`：重复删除返回成功（软删除）
+```bash
+# DocRay（推荐，免费，效果好）
+export USE_DOC_RAY=true
+export DOCRAY_HOST=http://docray:8639
 
-### 3. 多租户隔离
+# MinerU（可选，付费，精度最高）
+export USE_MINERU_API=false
+export MINERU_API_TOKEN=your_token
 
-**存储隔离**：
-```
-user-{user_A}/...  # 用户 A 的文件
-user-{user_B}/...  # 用户 B 的文件
+# MarkItDown（默认启用，兜底）
+export USE_MARKITDOWN=true
 ```
 
-**数据库隔离**：
-- 所有查询都带 `user` 字段过滤
-- 集合级别的权限控制（`collection.user`）
-- 软删除支持（`gmt_deleted`）
+**选择建议**：
+- 💰 免费方案：DocRay + MarkItDown
+- 🎯 高精度：MinerU + DocRay + MarkItDown
 
-### 4. 灵活的存储后端
+### 7.3 索引配置
 
-**统一接口**：
-```python
-AsyncObjectStore:
-  - put(path, data)
-  - get(path)
-  - delete_objects_by_prefix(prefix)
+在 Collection 配置中控制启用哪些索引：
+
+```json
+{
+  "enable_vector": true,          // 向量索引（必选）
+  "enable_fulltext": true,        // 全文索引（必选）
+  "enable_knowledge_graph": true, // 图谱索引（可选）
+  "enable_summary": false,        // 摘要索引（可选）
+  "enable_vision": false          // 视觉索引（可选）
+}
 ```
 
-**运行时切换**：
-- 通过环境变量切换 Local/S3
-- 无需修改业务代码
-- 支持自定义存储后端（实现接口即可）
+### 7.4 性能调优
 
-### 5. 事务一致性
+**文件大小限制**：
 
-**数据库 + 对象存储的两阶段提交**：
-```python
-async with transaction:
-    # 1. 创建数据库记录
-    document = create_document_record()
-    
-    # 2. 上传到对象存储
-    await object_store.put(path, data)
-    
-    # 3. 更新元数据
-    document.doc_metadata = json.dumps(metadata)
-    
-    # 所有操作成功才提交，任一失败则回滚
+```bash
+export MAX_DOCUMENT_SIZE=104857600  # 100 MB
+export MAX_EXTRACTED_SIZE=5368709120  # 5 GB
 ```
 
-**失败处理**：
-- 数据库记录创建失败：不上传文件
-- 文件上传失败：回滚数据库记录
-- 元数据更新失败：回滚前面的操作
+**并发设置**：
+
+```bash
+export CELERY_WORKER_CONCURRENCY=16  # 并发处理 16 个文档
+export CELERY_TASK_TIME_LIMIT=3600   # 单个任务超时 1 小时
+```
 
-### 6. 可观测性
+**配额设置**：
 
-**审计日志**：
-- `@audit` 装饰器记录所有文档操作
-- 包含：用户、时间、操作类型、资源 ID
+```bash
+export MAX_DOCUMENT_COUNT=1000  # 用户最多 1000 个文档
+export MAX_DOCUMENT_COUNT_PER_COLLECTION=100  # 单集合最多 100 个
+```
 
-**任务追踪**：
-- `gmt_last_reconciled`：最后处理时间
-- `error_message`：失败原因
-- Celery 任务 ID：关联日志追踪
+## 8. 常见问题
 
-**监控指标**：
-- 文档上传速率
-- 索引构建耗时
-- 失败率统计
+### 8.1 文件上传失败？
 
-## 性能优化
+**可能原因和解决方法**：
 
-### 1. 异步处理
+| 问题 | 原因 | 解决方法 |
+|------|------|---------|
+| 文件太大 | 超过 100 MB | 压缩或分割文件 |
+| 格式不支持 | 特殊格式 | 转换成 PDF 或其他常见格式 |
+| 同名冲突 | 已存在同名不同内容文件 | 重命名文件 |
+| 配额已满 | 达到文档数量上限 | 删除旧文档或升级配额 |
 
-**上传不阻塞**：
-- 文件上传到对象存储后立即返回
-- 索引构建在 Celery 中异步执行
-- 前端通过轮询或 WebSocket 获取进度
+### 8.2 文档处理失败？
 
-### 2. 批量操作
+系统会自动重试 3 次，如果仍失败：
 
-**批量确认**：
-```python
-confirm_documents(document_ids=[id1, id2, ..., idN])
 ```
-- 一次事务处理多个文档
-- 批量创建索引记录
-- 减少数据库往返
-
-### 3. 缓存策略
-
-**解析结果缓存**：
-- 解析后的内容保存到 `processed_content.md`
-- 后续索引重建可直接读取，无需重新解析
-
-**分块结果缓存**：
-- 分块结果保存到 `chunks/` 目录
-- 向量索引重建可复用分块结果
-
-### 4. 并行索引构建
-
-**多索引并行**：
-```python
-# VECTOR、FULLTEXT、GRAPH 可以并行构建
-await asyncio.gather(
-    create_vector_index(),
-    create_fulltext_index(),
-    create_graph_index()
-)
+查看错误信息 → 根据提示修复 → 重新上传 → 系统自动重试
 ```
 
-## 错误处理
-
-### 常见异常
+常见错误：
+- 文件损坏 → 重新制作文件
+- 内容无法识别 → 尝试转换格式
+- 临时网络问题 → 系统会自动重试
 
-| 异常类型 | HTTP 状态码 | 触发场景 | 处理建议 |
-|---------|------------|----------|----------|
-| `ResourceNotFoundException` | 404 | 集合/文档不存在 | 检查 ID 是否正确 |
-| `CollectionInactiveException` | 400 | 集合未激活 | 等待集合初始化完成 |
-| `DocumentNameConflictException` | 409 | 同名不同内容 | 重命名文件或删除旧文档 |
-| `QuotaExceededException` | 429 | 配额超限 | 升级套餐或删除旧文档 |
-| `InvalidFileTypeException` | 400 | 不支持的文件类型 | 查看支持的文件类型列表 |
-| `FileSizeTooLargeException` | 413 | 文件过大 | 分割文件或压缩 |
+### 8.3 如何加快处理速度？
 
-### 异常传播
+**方法 1**：禁用不需要的索引
 
-```
-Service Layer 抛出异常
-    │
-    ▼
-View Layer 捕获并转换
-    │
-    ▼
-Exception Handler 统一处理
-    │
-    ▼
-返回标准 JSON 响应：
+```json
 {
-  "error_code": "QUOTA_EXCEEDED",
-  "message": "Document count limit exceeded",
-  "details": {
-    "limit": 1000,
-    "current": 1000
-  }
+  "enable_knowledge_graph": false  // 图谱最慢，可选禁用
 }
 ```
 
-## 相关文件索引
-
-### 核心实现
+**方法 2**：使用更快的 LLM 模型
 
-- **View 层**：`aperag/views/collections.py` - HTTP 接口定义
-- **Service 层**：`aperag/service/document_service.py` - 业务逻辑
-- **数据库模型**：`aperag/db/models.py` - Document, DocumentIndex 表定义
-- **数据库操作**：`aperag/db/ops.py` - CRUD 操作封装
+在 Collection 配置中选择响应更快的模型。
 
-### 对象存储
+### 8.4 暂存区文件会丢失吗？
 
-- **接口定义**：`aperag/objectstore/base.py` - AsyncObjectStore 抽象类
-- **Local 实现**：`aperag/objectstore/local.py` - 本地文件系统存储
-- **S3 实现**：`aperag/objectstore/s3.py` - S3 兼容存储
+- ✅ 7 天内：不会丢失，可以随时确认
+- ⚠️ 7 天后：自动清理（节省存储）
+- 💡 建议：上传后及时确认
 
-### 文档解析
+## 9. 总结
 
-- **主控制器**：`aperag/docparser/doc_parser.py` - DocParser
-- **Parser 实现**：
-  - `aperag/docparser/mineru_parser.py` - MinerU PDF 解析
-  - `aperag/docparser/docray_parser.py` - DocRay 文档解析
-  - `aperag/docparser/markitdown_parser.py` - MarkItDown 通用解析
-  - `aperag/docparser/image_parser.py` - 图片 OCR
-  - `aperag/docparser/audio_parser.py` - 音频转录
-- **文档处理**：`aperag/index/document_parser.py` - 解析流程编排
+ApeRAG 的文档上传让你可以轻松地把各种格式的文档添加到知识库。
 
-### 索引构建
+### 核心优势
 
-- **索引管理**：`aperag/index/manager.py` - DocumentIndexManager
-- **向量索引**：`aperag/index/vector_index.py` - VectorIndexer
-- **全文索引**：`aperag/index/fulltext_index.py` - FulltextIndexer
-- **知识图谱**：`aperag/index/graph_index.py` - GraphIndexer
-- **文档摘要**：`aperag/index/summary_index.py` - SummaryIndexer
-- **视觉索引**：`aperag/index/vision_index.py` - VisionIndexer
+1. ✅ **支持 20+ 种格式**：PDF、Word、Excel、图片、音频等
+2. ✅ **秒级上传响应**：不用等待，立即返回
+3. ✅ **暂存区设计**：先传后选，避免误操作
+4. ✅ **智能解析**：自动识别格式，选择最佳解析器
+5. ✅ **多索引构建**：同时构建 5 种索引，满足不同检索需求
+6. ✅ **后台处理**：异步执行，不阻塞用户
+7. ✅ **自动重试**：失败自动重试，提高成功率
+8. ✅ **配额管理**：确认时才消耗，合理控制资源
 
-### 任务调度
+### 性能表现
 
-- **任务定义**：`config/celery_tasks.py` - Celery 任务注册
-- **协调器**：`aperag/tasks/reconciler.py` - DocumentIndexReconciler
-- **文档任务**：`aperag/tasks/document.py` - DocumentIndexTask
+| 操作 | 时间 |
+|------|------|
+| 上传 100 个文件 | < 1 分钟 |
+| 确认添加 | < 1 秒 |
+| 小文档处理（< 10 页） | 1-3 分钟 |
+| 中型文档（10-50 页） | 3-10 分钟 |
+| 大型文档（100+ 页） | 10-30 分钟 |
 
-### 前端实现
+### 适用场景
 
-- **文档列表**：`web/src/app/workspace/collections/[collectionId]/documents/page.tsx`
-- **文档上传**：`web/src/app/workspace/collections/[collectionId]/documents/upload/document-upload.tsx`
+- 📚 企业知识库建设
+- 🔬 研究资料整理
+- 📖 个人笔记管理
+- 🎓 学习资料归档
 
-## 总结
+整个系统既**简单易用**，又**功能强大**，适合各种规模的知识管理需求。
 
-ApeRAG 的文档上传模块采用**两阶段提交 + 多 Parser 链式调用 + 多索引并行构建**的架构设计：
+---
 
-**核心特性**：
-1. ✅ **两阶段提交**：上传（临时存储）→ 确认（正式添加），提供更好的用户体验
-2. ✅ **SHA-256 去重**：避免重复文档，支持幂等上传
-3. ✅ **灵活存储后端**：Local/S3 可配置切换，统一接口抽象
-4. ✅ **多 Parser 架构**：支持 MinerU、DocRay、MarkItDown 等多种解析器
-5. ✅ **格式自动转换**：PDF→图片、音频→文本、图片→OCR 文本
-6. ✅ **多索引协调**：向量、全文、图谱、摘要、视觉五种索引类型
-7. ✅ **配额管理**：确认阶段才扣除配额，合理控制资源
-8. ✅ **异步处理**：Celery 任务队列，不阻塞用户操作
-9. ✅ **事务一致性**：数据库 + 对象存储的两阶段提交
-10. ✅ **可观测性**：审计日志、任务追踪、错误信息完整记录
+## 相关文档
 
-这种设计既保证了高性能和可扩展性，又支持复杂的文档处理场景（多格式、多语言、多模态），同时具有良好的容错能力和用户体验。
+- 📋 [系统架构](./architecture.md) - ApeRAG 整体架构设计
+- 📖 [图索引构建流程](./graph_index_creation.md) - 图谱索引详解
+- 🔗 [索引链路架构](./indexing_architecture.md) - 完整索引流程
diff --git a/scripts/sync-docs.py b/scripts/sync-docs.py
index 1ec151b9..b1ab100e 100755
--- a/scripts/sync-docs.py
+++ b/scripts/sync-docs.py
@@ -77,7 +77,7 @@
 SYNC_WHITELIST = [
     # English docs - Design
     "en-US/design/architecture.md",
-    # "en-US/design/document_upload_design.md",
+    "en-US/design/document_upload_design.md",
     "en-US/design/graph_index_creation.md",
     # "en-US/design/chat_history_design.md",
     
@@ -93,7 +93,7 @@
     
     # Chinese docs - Design
     "zh-CN/design/architecture.md",
-    # "zh-CN/design/document_upload_design.md",
+    "zh-CN/design/document_upload_design.md",
     "zh-CN/design/graph_index_creation.md",
     # "zh-CN/design/chat_history_design.md",
     
diff --git a/web/docs/en-US/design/document_upload_design.md b/web/docs/en-US/design/document_upload_design.md
index fa5c2754..5de9cbaf 100644
--- a/web/docs/en-US/design/document_upload_design.md
+++ b/web/docs/en-US/design/document_upload_design.md
@@ -1,227 +1,710 @@
 ---
-title: Document Upload Architecture Design
-description: Detailed explanation of ApeRAG document upload module's complete architecture design, including upload process, temporary storage configuration, document parsing, format conversion, database design, etc.
-keywords: [document upload, architecture, object store, parser, index building, two-phase commit]
+title: Document Upload Design
+description: Complete process and core design of ApeRAG document upload
+keywords: Document Upload, Multi-format Support, Document Parsing, Smart Indexing
 ---
 
-# ApeRAG Document Upload Architecture Design
-
-## Overview
-
-This document details the complete architecture design of the document upload module in the ApeRAG project, covering the full pipeline from file upload, temporary storage, document parsing, format conversion to final index construction.
-
-**Core Design Philosophy**: Adopts a **two-phase commit** pattern, separating file upload (temporary storage) from document confirmation (formal addition), providing better user experience and resource management capabilities.
-
-## System Architecture
-
-### Overall Architecture
-
-```
-┌─────────────────────────────────────────────────────────────┐
-│                        Frontend                             │
-│                       (Next.js)                             │
-└────────┬───────────────────────────────────┬────────────────┘
-         │                                   │
-         │ Step 1: Upload                    │ Step 2: Confirm
-         │ POST /documents/upload            │ POST /documents/confirm
-         ▼                                   ▼
-┌─────────────────────────────────────────────────────────────┐
-│  View Layer: aperag/views/collections.py                    │
-│  - HTTP request handling                                    │
-│  - JWT authentication                                       │
-│  - Parameter validation                                     │
-└────────┬───────────────────────────────────┬────────────────┘
-         │                                   │
-         │ document_service.upload_document() │ document_service.confirm_documents()
-         ▼                                   ▼
-┌─────────────────────────────────────────────────────────────┐
-│  Service Layer: aperag/service/document_service.py          │
-│  - Business logic orchestration                             │
-│  - File validation (type, size)                             │
-│  - SHA-256 hash deduplication                               │
-│  - Quota checking                                           │
-│  - Transaction management                                   │
-└────────┬───────────────────────────────────┬────────────────┘
-         │                                   │
-         │ Step 1                            │ Step 2
-         ▼                                   ▼
-┌────────────────────────┐     ┌────────────────────────────┐
-│  1. Create Document    │     │  1. Update Document status │
-│     status=UPLOADED    │     │     UPLOADED → PENDING     │
-│  2. Save to ObjectStore│     │  2. Create DocumentIndex   │
-│  3. Calculate hash     │     │  3. Trigger indexing tasks │
-└────────┬───────────────┘     └────────┬───────────────────┘
-         │                              │
-         ▼                              ▼
-┌─────────────────────────────────────────────────────────────┐
-│                    Storage Layer                            │
-│                                                             │
-│  ┌───────────────┐  ┌──────────────────┐  ┌─────────────┐ │
-│  │  PostgreSQL   │  │  Object Store    │  │  Vector DB  │ │
-│  │               │  │                  │  │             │ │
-│  │ - document    │  │ - Local/S3       │  │ - Qdrant    │ │
-│  │ - document_   │  │ - Original files │  │ - Vectors   │ │
-│  │   index       │  │ - Converted files│  │             │ │
-│  └───────────────┘  └──────────────────┘  └─────────────┘ │
-│                                                             │
-│  ┌───────────────┐  ┌──────────────────┐                  │
-│  │ Elasticsearch │  │   Neo4j/PG       │                  │
-│  │               │  │                  │                  │
-│  │ - Full-text   │  │ - Knowledge Graph│                  │
-│  └───────────────┘  └──────────────────┘                  │
-└─────────────────────────────────────────────────────────────┘
-                         │
-                         ▼
-               ┌───────────────────┐
-               │  Celery Workers   │
-               │                   │
-               │  - Doc parsing    │
-               │  - Format convert │
-               │  - Content extract│
-               │  - Doc chunking   │
-               │  - Index building │
-               └───────────────────┘
-```
-
-### Layered Architecture
-
-```
-┌─────────────────────────────────────────────┐
-│  View Layer (views/collections.py)         │  HTTP handling, auth, validation
-└─────────────────┬───────────────────────────┘
-                  │ calls
-┌─────────────────▼───────────────────────────┐
-│  Service Layer (service/document_service.py)│  Business logic, transaction, permission
-└─────────────────┬───────────────────────────┘
-                  │ calls
-┌─────────────────▼───────────────────────────┐
-│  Repository Layer (db/ops.py, objectstore/) │  Data access abstraction
-└─────────────────┬───────────────────────────┘
-                  │ accesses
-┌─────────────────▼───────────────────────────┐
-│  Storage Layer (PG, S3, Qdrant, ES, Neo4j) │  Data persistence
-└─────────────────────────────────────────────┘
-```
-
-## Core Process Details
-
-For the complete documentation including:
-- API Interface definitions
-- File upload and temporary storage
-- Document confirmation and index building
-- Parser architecture and format conversion
-- Index building flow
-- Database design (document and document_index tables)
-- State machine and lifecycle
-- Async task scheduling (Celery)
-- Design features and advantages
-- Performance optimization
-- Error handling
-
-Please refer to the main design document at `/docs/en-US/design/document_upload_design.md`.
-
-## Quick Reference
-
-### API Endpoints
-
-1. **Upload File**: `POST /api/v1/collections/{collection_id}/documents/upload`
-2. **Confirm Documents**: `POST /api/v1/collections/{collection_id}/documents/confirm`
-3. **One-step Upload**: `POST /api/v1/collections/{collection_id}/documents`
-
-### Document Status Flow
-
-```
-[Upload] → UPLOADED → [Confirm] → PENDING → RUNNING → COMPLETE
-                          ↓                     ↓
-                       [Delete]              FAILED
-                          ↓                     ↓
-                       DELETED ←──────────────┘
-```
-
-### Object Storage Configuration
-
-**Local Storage**:
+# Document Upload Design
+
+## 1. What is Document Upload
+
+Document upload is the entry point of ApeRAG, allowing you to add various formats of documents to your knowledge base. The system automatically processes, indexes, and makes this knowledge searchable and conversational.
+
+### 1.1 What Can You Upload
+
+ApeRAG supports 20+ document formats, covering virtually all file types used in daily work:
+
+```mermaid
+flowchart LR
+    subgraph Input[📁 Your Documents]
+        A1[PDF Reports]
+        A2[Word Docs]
+        A3[Excel Sheets]
+        A4[Screenshots]
+        A5[Meeting Recordings]
+        A6[Markdown Notes]
+    end
+    
+    subgraph Process[🔄 ApeRAG Auto Processing]
+        B[Recognize Format<br/>Extract Content<br/>Build Indexes]
+    end
+    
+    subgraph Output[✨ Searchable Knowledge]
+        C[Answer Questions<br/>Find Information<br/>Analyze Relationships]
+    end
+    
+    A1 --> B
+    A2 --> B
+    A3 --> B
+    A4 --> B
+    A5 --> B
+    A6 --> B
+    
+    B --> C
+    
+    style Input fill:#e3f2fd
+    style Process fill:#fff59d
+    style Output fill:#c8e6c9
+```
+
+**Document Types**:
+
+| Category | Formats | Typical Use |
+|----------|---------|-------------|
+| **Office Docs** | PDF, Word, PPT, Excel | Annual reports, meeting minutes, data sheets |
+| **Text Files** | TXT, MD, HTML, JSON | Technical docs, notes, config files |
+| **Images** | PNG, JPG, GIF | Product screenshots, designs, charts |
+| **Audio** | MP3, WAV, M4A | Meeting recordings, interviews |
+| **Archives** | ZIP, TAR, GZ | Batch document packages |
+
+### 1.2 What Happens After Upload
+
+```mermaid
+flowchart TB
+    A[You upload a PDF] --> B{System Auto Recognizes}
+    
+    B --> C[Extract text content]
+    B --> D[Identify table structure]
+    B --> E[Extract images]
+    B --> F[Recognize formulas]
+    
+    C --> G[Build indexes]
+    D --> G
+    E --> G
+    F --> G
+    
+    G --> H1[Vector Index<br/>Semantic search]
+    G --> H2[Full-text Index<br/>Keyword search]
+    G --> H3[Graph Index<br/>Relationship query]
+    
+    H1 --> I[Done! Can retrieve]
+    H2 --> I
+    H3 --> I
+    
+    style A fill:#e1f5ff
+    style B fill:#fff59d
+    style G fill:#ffe0b2
+    style I fill:#c8e6c9
+```
+
+**Simply put**: You just upload files, the system automatically handles everything!
+
+## 2. Practical Applications
+
+See how document upload works in real scenarios.
+
+### 2.1 Enterprise Knowledge Base
+
+**Scenario**: Company building internal knowledge base.
+
+**Upload Content**:
+- 📋 Policy documents: Employee handbook, attendance policies, reimbursement procedures
+- 📊 Business materials: Product introductions, sales data, financial reports
+- 🔧 Technical docs: System architecture, API documentation, deployment guides
+- 📁 Project materials: Project proposals, meeting records, retrospectives
+
+**Results**:
+
+```
+Employee asks: "What's the business trip reimbursement process?"
+System: Finds reimbursement process section from "Finance Policy.pdf"
+
+New hire asks: "What products does the company have?"
+System: Extracts product list from "Product Manual.pptx"
+
+Developer: "How to call this API?"
+System: Finds calling example from "API Docs.md"
+```
+
+### 2.2 Research Material Organization
+
+**Scenario**: Graduate student organizing papers and study materials.
+
+**Upload Content**:
+- 📖 Academic papers (PDF)
+- 📝 Reading notes (Markdown)
+- 🎓 Course slides (PPT)
+- 📊 Experiment data (Excel)
+
+**Results**:
+
+```
+Q: "What research exists on Graph RAG?"
+A: Finds relevant content from multiple papers
+
+Q: "What are an author's main contributions?"
+A: Analyzes papers, summarizes research directions
+```
+
+### 2.3 Personal Knowledge Management
+
+**Scenario**: Developer accumulating technical notes.
+
+**Upload Content**:
+- 💻 Study notes (Markdown)
+- 📸 Technical screenshots (PNG)
+- 🎬 Tutorial audio
+- 📚 Technical books (PDF)
+
+**Results**:
+
+```
+Q: "How did I solve Redis connection issues before?"
+A: Finds solution from "Redis Troubleshooting.md"
+
+Q: "What are best practices for this tech?"
+A: Summarizes best practices from multiple documents
+```
+
+### 2.4 Multimodal Content Processing
+
+**Scenario**: Product team's design materials.
+
+**Upload Content**:
+- 🎨 UI designs (images)
+- 📋 Product PRDs (Word)
+- 🎤 User interview recordings
+- 📊 Data analysis reports (Excel)
+
+**System Processing**:
+- Designs → OCR extract text + Vision understand design intent
+- PRD → Extract product requirements and features
+- Recordings → Transcribe to text, extract user feedback
+- Reports → Extract key metrics
+
+**Result**: All content integrated, searchable together!
+
+## 3. Upload Experience
+
+### 3.1 Batch Upload is Simple
+
+Suppose you need to upload 50 company documents:
+
+**Step 1: Select Files (10 seconds)**
+
+```
+Click "Upload Documents" → Select 50 PDFs → Click "Start Upload"
+```
+
+**Step 2: Quick Upload (30 seconds)**
+
+```
+Progress: 1/50, 2/50, 3/50... 50/50 ✅
+All files uploaded to staging in seconds, no wait for processing
+```
+
+**Step 3: Preview and Confirm (1 minute)**
+
+```
+View uploaded file list:
+- ✅ annual_report.pdf (5.2 MB)
+- ✅ product_manual.pdf (3.1 MB)
+- ❌ personal_notes.pdf (shouldn't upload) → Uncheck
+- ✅ technical_docs.pdf (2.8 MB)
+...
+
+Click "Save to Knowledge Base"
+```
+
+**Step 4: Background Processing (5-30 minutes)**
+
+```
+System auto processes:
+- Parse document content
+- Build multiple indexes
+- You can continue other work, no need to wait
+```
+
+**Step 5: Completion Notification**
+
+```
+Notification: "49 documents processed, ready for retrieval"
+```
+
+### 3.2 Processing Time Reference
+
+Different sized documents have different processing speeds:
+
+| Document Type | Size | Upload Time | Processing Time | Example |
+|--------------|------|-------------|-----------------|---------|
+| 🏃 Small | < 5 pages | < 1 sec | 1-3 minutes | Notices, emails |
+| 🚶 Medium | 10-50 pages | < 3 sec | 3-10 minutes | Reports, manuals |
+| 🐌 Large | 100+ pages | < 10 sec | 10-30 minutes | Books, paper collections |
+
+**Key Points**:
+- ✅ Upload always fast (seconds)
+- ⏳ Processing happens in background (non-blocking)
+- 📊 Can view processing progress in real-time
+
+### 3.3 Real-time Progress Tracking
+
+After upload, you can check document status anytime:
+
+```
+Document List:
+
+📄 annual_report.pdf
+   Status: Processing (60%)
+   ├─ ✅ Document Parsing: Complete
+   ├─ ✅ Vector Index: Complete
+   ├─ 🔄 Full-text Index: In Progress
+   └─ ⏳ Graph Index: Waiting
+
+📄 product_manual.pdf
+   Status: Complete ✅
+   Can retrieve
+
+📄 meeting_notes.pdf
+   Status: Failed ❌
+   Error: File corrupted
+   Action: Re-upload
+```
+
+## 4. Core Features
+
+ApeRAG document upload has unique features making it more convenient.
+
+### 4.1 Staging Area Design
+
+**Core Idea**: Upload first, select later - gives you a chance to "regret".
+
+**Like online shopping**:
+
+```
+Shopping process:
+1. Add to cart (staging)
+2. Review cart, remove unwanted items
+3. Submit order (confirm)
+
+Document upload:
+1. Upload to staging (quick upload)
+2. Review list, cancel unneeded ones
+3. Save to knowledge base (confirm addition)
+```
+
+**Benefits**:
+
+- ✅ **Fast Upload**: 20 files uploaded in 5 seconds, no wait for processing
+- ✅ **Selective Addition**: Upload 100, save only the 80 needed
+- ✅ **Save Quota**: Staging files don't consume quota
+- ✅ **Easy Correction**: Found error? Cancel directly, no need to delete
+
+### 4.2 Smart Processing
+
+**Auto Format Recognition**:
+
+System auto recognizes file type and selects appropriate processing:
+
+- 📄 PDF → Extract text, tables, images, formulas
+- 📋 Word → Convert format, extract content
+- 📊 Excel → Recognize table structure
+- 🎨 Images → OCR text + understand content
+- 🎤 Audio → Transcribe to text
+
+**No extra operations needed**, system handles automatically!
+
+### 4.3 Background Processing
+
+After upload, system auto processes in background:
+
+```mermaid
+sequenceDiagram
+    participant U as You
+    participant S as System
+    
+    U->>S: Upload file
+    S-->>U: Second-level return ✅
+    Note over U: Continue work, no wait
+    
+    S->>S: Parse document...
+    S->>S: Build indexes...
+    S-->>U: Processing complete notification 🔔
+```
+
+**Advantages**:
+- No wait, upload then do other things
+- System auto retries failed documents
+- Real-time view processing progress
+
+### 4.4 Auto Cleanup
+
+Staging area files not confirmed in 7 days are auto cleaned, preventing storage waste.
+
+## 5. Document Parsing Principles
+
+After upload, system needs to "understand" the document. Different formats have different processing methods.
+
+### 5.1 Parser Workflow
+
+System has multiple parsers, auto selects most suitable:
+
+```mermaid
+flowchart TD
+    File[Upload PDF] --> Try1{Try MinerU}
+    Try1 -->|Success| Result[Parsing Complete]
+    Try1 -->|Fail/Not Configured| Try2{Try DocRay}
+    Try2 -->|Success| Result
+    Try2 -->|Fail/Not Configured| Try3[Use MarkItDown]
+    Try3 --> Result
+    
+    style File fill:#e1f5ff
+    style Result fill:#c5e1a5
+    style Try1 fill:#fff3e0
+    style Try2 fill:#fff3e0
+    style Try3 fill:#c5e1a5
+```
+
+**Parser Priority**:
+
+1. **MinerU**: Most powerful, commercial API, paid
+   - Good at: Complex PDFs, academic papers, documents with formulas
+   
+2. **DocRay**: Open source, free, strong layout analysis
+   - Good at: Tables, charts, multi-column layouts
+   
+3. **MarkItDown**: Generic, fallback, supports all formats
+   - Good at: Simple documents, text files
+
+**Auto degradation benefits**:
+- Try best parser first
+- Auto switch to next if fails
+- Always one succeeds
+
+### 5.2 Specific Examples
+
+**Example 1: Complex PDF**
+
+```
+Upload: annual_report.pdf (50 pages, with tables and charts)
+    ↓
+DocRay parser auto:
+- 📝 Extract all text content
+- 📊 Recognize tables, maintain structure
+- 🎨 Extract images and charts
+- 📐 Recognize LaTeX formulas
+    ↓
+Get:
+- Complete Markdown document
+- 50 page screenshots (if vision index needed)
+```
+
+**Example 2: Image Screenshot**
+
+```
+Upload: product_screenshot.png
+    ↓
+ImageParser auto:
+- 📸 OCR recognize text in image
+- 👁️ Vision AI understand image content
+    ↓
+Get:
+- Text: "Product name: ApeRAG, Version: 2.0..."
+- Description: "This is a product intro page with name, version, and feature list"
+```
+
+**Example 3: Meeting Recording**
+
+```
+Upload: meeting.mp3 (30 minutes)
+    ↓
+AudioParser auto:
+- 🎤 Speech-to-text (STT)
+- 📝 Generate meeting transcript
+    ↓
+Get:
+- "Meeting starts. Host John: Hello everyone, today we discuss product planning..."
+- Complete meeting text transcript
+```
+
+### 5.3 Duplicate File Handling
+
+System auto detects duplicate uploads:
+
+```
+First upload report.pdf → Create new document ✅
+Second upload report.pdf (same content) → Return existing document ✅
+Third upload report.pdf (different content) → Conflict warning, need rename ⚠️
+```
+
+**Advantages**:
+- Avoid duplicate documents
+- Network retries don't create multiple documents
+- Save storage space
+
+## 6. Index Building
+
+After document parsing, system auto builds multiple indexes for different retrieval methods.
+
+### 6.1 Why Multiple Indexes Needed
+
+Different questions need different retrieval methods:
+
+```
+Q: "How to optimize database performance?"
+→ Need: Vector index (semantic similarity search)
+
+Q: "Where is PostgreSQL config file?"
+→ Need: Full-text index (exact keyword search)
+
+Q: "What's the relationship between John and Mike?"
+→ Need: Graph index (relationship query)
+
+Q: "What's this document mainly about?"
+→ Need: Summary index (quick overview)
+
+Q: "What's in this image?"
+→ Need: Vision index (image content search)
+```
+
+### 6.2 Five Index Types
+
+```mermaid
+flowchart TB
+    Doc[Your Document] --> Auto[System Auto Builds]
+    
+    Auto --> V[Vector Index<br/>Find Similar Content]
+    Auto --> F[Full-text Index<br/>Find Keywords]
+    Auto --> G[Graph Index<br/>Find Relationships]
+    Auto --> S[Summary Index<br/>Quick Overview]
+    Auto --> I[Vision Index<br/>Find Images]
+    
+    V --> Q1[Q: How to optimize performance?]
+    F --> Q2[Q: Config file path?]
+    G --> Q3[Q: A and B's relationship?]
+    S --> Q4[Q: What's doc about?]
+    I --> Q5[Q: What's in image?]
+    
+    style Doc fill:#e1f5ff
+    style Auto fill:#fff59d
+    style V fill:#bbdefb
+    style F fill:#c5e1a5
+    style G fill:#ffccbc
+    style S fill:#e1bee7
+    style I fill:#fff9c4
+```
+
+**Index Comparison**:
+
+| Index | Required | Suitable Questions | Speed |
+|-------|----------|-------------------|-------|
+| Vector | ✅ | Semantic similarity | Fast |
+| Full-text | ✅ | Exact keywords | Fast |
+| Graph | ❌ | Relationship queries | Slow |
+| Summary | ❌ | Quick overview | Medium |
+| Vision | ❌ | Image content | Medium |
+
+**Recommended Config**:
+
+- 💰 Save cost: Only enable vector + full-text
+- ⚡ Prioritize speed: Disable graph (slowest)
+- 🎯 Full features: Enable all
+
+### 6.3 Parallel Building
+
+Multiple indexes can build simultaneously, saving time:
+
+```
+Document parsing complete
+    ↓
+5 indexes start building simultaneously:
+- Vector index: 1 minute
+- Full-text index: 30 seconds
+- Graph index: 10 minutes ⏱️ (slowest)
+- Summary index: 3 minutes
+- Vision index: 2 minutes
+    ↓
+Total time: 10 minutes (the slowest one)
+If serial: 16.5 minutes
+
+Saved: 40% time!
+```
+
+### 6.4 Auto Retry
+
+If an index build fails, system auto retries:
+
+```
+1st retry: After 1 minute
+2nd retry: After 5 minutes
+3rd retry: After 15 minutes
+Still fails → Mark as failed, notify user
+```
+
+Most temporary errors (network issues, service restarts) auto recover!
+
+## 7. Technical Implementation
+
+> 💡 **Reading Tip**: This chapter contains technical details, mainly for developers and ops. General users can skip.
+
+### 7.1 Storage Architecture
+
+**File Storage Location**:
+
+```
+Local storage (dev):
+.objects/user-xxx/collection-xxx/doc-xxx/
+    ├── original.pdf
+    └── images/page_0.png
+
+Cloud storage (production):
+s3://bucket/user-xxx/collection-xxx/doc-xxx/
+    ├── original.pdf
+    └── images/page_0.png
+```
+
+**Configuration**:
+
 ```bash
-OBJECT_STORE_TYPE=local
-OBJECT_STORE_LOCAL_ROOT_DIR=.objects
+# Local storage
+export OBJECT_STORE_TYPE=local
+
+# Cloud storage (S3/MinIO)
+export OBJECT_STORE_TYPE=s3
+export OBJECT_STORE_S3_BUCKET=aperag
 ```
 
-**S3 Storage**:
+### 7.2 Parser Configuration
+
+**Enable Different Parsers**:
+
 ```bash
-OBJECT_STORE_TYPE=s3
-OBJECT_STORE_S3_ENDPOINT=http://127.0.0.1:9000
-OBJECT_STORE_S3_BUCKET=aperag
-OBJECT_STORE_S3_ACCESS_KEY=minioadmin
-OBJECT_STORE_S3_SECRET_KEY=minioadmin
-```
-
-### Supported Parsers
-
-- **MinerUParser**: High-precision PDF parsing
-- **DocRayParser**: Document layout analysis
-- **ImageParser**: Image OCR and vision understanding
-- **AudioParser**: Audio transcription
-- **MarkItDownParser**: Universal fallback parser
-
-### Index Types
-
-| Type | Required | Storage |
-|------|----------|---------|
-| VECTOR | ✅ | Qdrant |
-| FULLTEXT | ✅ | Elasticsearch |
-| GRAPH | ❌ | Neo4j/PostgreSQL |
-| SUMMARY | ❌ | PostgreSQL |
-| VISION | ❌ | Qdrant + PostgreSQL |
-
-## Related Files
-
-### Backend Core
-- `aperag/views/collections.py` - View layer
-- `aperag/service/document_service.py` - Service layer
-- `aperag/db/models.py` - Database models
-
-### Object Storage
-- `aperag/objectstore/base.py` - Storage interface
-- `aperag/objectstore/local.py` - Local storage
-- `aperag/objectstore/s3.py` - S3 storage
-
-### Document Parsing
-- `aperag/docparser/doc_parser.py` - Main parser
-- `aperag/docparser/mineru_parser.py` - MinerU parser
-- `aperag/docparser/docray_parser.py` - DocRay parser
-- `aperag/docparser/markitdown_parser.py` - MarkItDown parser
-- `aperag/docparser/image_parser.py` - Image parser
-- `aperag/docparser/audio_parser.py` - Audio parser
-
-### Index Building
-- `aperag/index/vector_index.py` - Vector indexer
-- `aperag/index/fulltext_index.py` - Full-text indexer
-- `aperag/index/graph_index.py` - Graph indexer
-- `aperag/index/summary_index.py` - Summary indexer
-- `aperag/index/vision_index.py` - Vision indexer
-
-### Task Scheduling
-- `config/celery_tasks.py` - Celery tasks
-- `aperag/tasks/reconciler.py` - Index reconciler
-- `aperag/tasks/document.py` - Document tasks
-
-### Frontend
-- `web/src/app/workspace/collections/[collectionId]/documents/upload/document-upload.tsx` - Upload component
-
-## Summary
-
-ApeRAG's document upload module adopts a **two-phase commit + multi-parser chain invocation + parallel multi-index building** architecture:
-
-**Core Features**:
-1. ✅ **Two-Phase Commit**: Upload (temporary) → Confirm (formal), better UX
-2. ✅ **SHA-256 Deduplication**: Prevents duplicates, idempotent upload
-3. ✅ **Flexible Storage**: Local/S3 configurable, unified interface
-4. ✅ **Multi-Parser**: MinerU, DocRay, MarkItDown, and more
-5. ✅ **Auto Conversion**: PDF→images, audio→text, image→OCR
-6. ✅ **Multi-Index**: Vector, full-text, graph, summary, vision
-7. ✅ **Quota Management**: Deducted at confirmation stage
-8. ✅ **Async Processing**: Celery task queue, non-blocking
-9. ✅ **Transaction Consistency**: Database + object store 2PC
-10. ✅ **Observability**: Audit logs, task tracking, error recording
-
-For complete details, please refer to `/docs/en-US/design/document_upload_design.md`.
+# DocRay (recommended, free, good performance)
+export USE_DOC_RAY=true
+export DOCRAY_HOST=http://docray:8639
+
+# MinerU (optional, paid, highest precision)
+export USE_MINERU_API=false
+export MINERU_API_TOKEN=your_token
+
+# MarkItDown (default enabled, fallback)
+export USE_MARKITDOWN=true
+```
+
+**Selection Recommendations**:
+- 💰 Free solution: DocRay + MarkItDown
+- 🎯 High precision: MinerU + DocRay + MarkItDown
+
+### 7.3 Index Configuration
+
+Control which indexes to enable in Collection config:
+
+```json
+{
+  "enable_vector": true,          // Vector index (required)
+  "enable_fulltext": true,        // Full-text index (required)
+  "enable_knowledge_graph": true, // Graph index (optional)
+  "enable_summary": false,        // Summary index (optional)
+  "enable_vision": false          // Vision index (optional)
+}
+```
+
+### 7.4 Performance Tuning
+
+**File Size Limits**:
+
+```bash
+export MAX_DOCUMENT_SIZE=104857600  # 100 MB
+export MAX_EXTRACTED_SIZE=5368709120  # 5 GB
+```
+
+**Concurrency Settings**:
+
+```bash
+export CELERY_WORKER_CONCURRENCY=16  # Process 16 docs concurrently
+export CELERY_TASK_TIME_LIMIT=3600   # Single task timeout 1 hour
+```
+
+**Quota Settings**:
+
+```bash
+export MAX_DOCUMENT_COUNT=1000  # Max 1000 docs per user
+export MAX_DOCUMENT_COUNT_PER_COLLECTION=100  # Max 100 docs per collection
+```
+
+## 8. Common Questions
+
+### 8.1 File Upload Failed?
+
+**Possible Causes and Solutions**:
+
+| Issue | Cause | Solution |
+|-------|-------|----------|
+| File too large | Over 100 MB | Compress or split file |
+| Format not supported | Special format | Convert to PDF or other common format |
+| Name conflict | Same name different content exists | Rename file |
+| Quota full | Reached document count limit | Delete old docs or upgrade quota |
+
+### 8.2 Document Processing Failed?
+
+System auto retries 3 times, if still fails:
+
+```
+View error message → Fix based on prompt → Re-upload → System auto retries
+```
+
+Common errors:
+- File corrupted → Recreate file
+- Content unrecognizable → Try converting format
+- Temporary network issues → System auto retries
+
+### 8.3 How to Speed Up Processing?
+
+**Method 1**: Disable unneeded indexes
+
+```json
+{
+  "enable_knowledge_graph": false  // Graph slowest, can disable
+}
+```
+
+**Method 2**: Use faster LLM models
+
+Select faster responding models in Collection config.
+
+### 8.4 Will Staging Files Be Lost?
+
+- ✅ Within 7 days: Won't be lost, can confirm anytime
+- ⚠️ After 7 days: Auto cleanup (save storage)
+- 💡 Recommendation: Confirm promptly after upload
+
+## 9. Summary
+
+ApeRAG document upload makes it easy to add various format documents to your knowledge base.
+
+### Core Advantages
+
+1. ✅ **Supports 20+ formats**: PDF, Word, Excel, images, audio, etc.
+2. ✅ **Second-level upload response**: No wait, immediate return
+3. ✅ **Staging area design**: Upload first, select later, avoid mistakes
+4. ✅ **Smart parsing**: Auto recognize format, select best parser
+5. ✅ **Multi-index building**: Build 5 indexes simultaneously, meet different retrieval needs
+6. ✅ **Background processing**: Async execution, non-blocking
+7. ✅ **Auto retry**: Failures auto retry, improve success rate
+8. ✅ **Quota management**: Only consume on confirmation, reasonable resource control
+
+### Performance
+
+| Operation | Time |
+|-----------|------|
+| Upload 100 files | < 1 minute |
+| Confirm addition | < 1 second |
+| Small doc processing (< 10 pages) | 1-3 minutes |
+| Medium doc (10-50 pages) | 3-10 minutes |
+| Large doc (100+ pages) | 10-30 minutes |
+
+### Suitable Scenarios
+
+- 📚 Enterprise knowledge base building
+- 🔬 Research material organization
+- 📖 Personal note management
+- 🎓 Learning material archiving
+
+The system is both **simple to use** and **powerful**, suitable for various scales of knowledge management needs.
+
+---
+
+## Related Documentation
+
+- 📋 [System Architecture](./architecture.md) - ApeRAG overall architecture design
+- 📖 [Graph Index Creation Process](./graph_index_creation.md) - Graph index details
+- 🔗 [Index Pipeline Architecture](./indexing_architecture.md) - Complete indexing process
diff --git a/web/docs/zh-CN/design/document_upload_design.md b/web/docs/zh-CN/design/document_upload_design.md
index 3a0a0ec6..8224383c 100644
--- a/web/docs/zh-CN/design/document_upload_design.md
+++ b/web/docs/zh-CN/design/document_upload_design.md
@@ -1,1083 +1,708 @@
 ---
-title: 文档上传架构设计
-description: 详细说明ApeRAG文档上传模块的完整架构设计，包括上传流程、临时存储配置、文档解析、格式转换、数据库设计等
-keywords: [document upload, architecture, object store, parser, index building, two-phase commit]
+title: 文档上传设计
+description: ApeRAG 文档上传的完整流程与核心设计
+keywords: 文档上传, 多格式支持, 文档解析, 智能索引
 ---
 
-# ApeRAG 文档上传架构设计
+# 文档上传设计
 
-## 概述
+## 1. 文档上传是什么
 
-本文档详细说明 ApeRAG 项目中文档上传模块的完整架构设计，涵盖从文件上传、临时存储、文档解析、格式转换到最终索引构建的全链路流程。
+文档上传是 ApeRAG 的入口功能，让你可以把各种格式的文档添加到知识库中，系统会自动处理、索引，让这些知识可以被检索和对话。
 
-**核心设计理念**：采用**两阶段提交**模式，将文件上传（临时存储）和文档确认（正式添加）分离，提供更好的用户体验和资源管理能力。
+### 1.1 能上传什么
 
-## 系统架构
-
-### 整体架构图
+ApeRAG 支持 20+ 种文档格式，基本涵盖了日常工作中的所有文件类型：
 
+```mermaid
+flowchart LR
+    subgraph Input[📁 你的文档]
+        A1[PDF 报告]
+        A2[Word 文档]
+        A3[Excel 表格]
+        A4[图片截图]
+        A5[会议录音]
+        A6[Markdown 笔记]
+    end
+    
+    subgraph Process[🔄 ApeRAG 自动处理]
+        B[识别格式<br/>提取内容<br/>构建索引]
+    end
+    
+    subgraph Output[✨ 可检索的知识]
+        C[回答问题<br/>查找信息<br/>分析关系]
+    end
+    
+    A1 --> B
+    A2 --> B
+    A3 --> B
+    A4 --> B
+    A5 --> B
+    A6 --> B
+    
+    B --> C
+    
+    style Input fill:#e3f2fd
+    style Process fill:#fff59d
+    style Output fill:#c8e6c9
 ```
-┌─────────────────────────────────────────────────────────────┐
-│                        Frontend                             │
-│                       (Next.js)                             │
-└────────┬───────────────────────────────────┬────────────────┘
-         │                                   │
-         │ Step 1: Upload                    │ Step 2: Confirm
-         │ POST /documents/upload            │ POST /documents/confirm
-         ▼                                   ▼
-┌─────────────────────────────────────────────────────────────┐
-│  View Layer: aperag/views/collections.py                    │
-│  - HTTP请求处理                                              │
-│  - JWT身份验证                                               │
-│  - 参数验证                                                  │
-└────────┬───────────────────────────────────┬────────────────┘
-         │                                   │
-         │ document_service.upload_document() │ document_service.confirm_documents()
-         ▼                                   ▼
-┌─────────────────────────────────────────────────────────────┐
-│  Service Layer: aperag/service/document_service.py          │
-│  - 业务逻辑编排                                              │
-│  - 文件验证（类型、大小）                                     │
-│  - SHA-256 哈希去重                                          │
-│  - Quota 检查                                               │
-│  - 事务管理                                                  │
-└────────┬───────────────────────────────────┬────────────────┘
-         │                                   │
-         │ Step 1                            │ Step 2
-         ▼                                   ▼
-┌────────────────────────┐     ┌────────────────────────────┐
-│  1. 创建 Document 记录  │     │  1. 更新 Document 状态     │
-│     status=UPLOADED    │     │     UPLOADED → PENDING     │
-│  2. 保存到 ObjectStore │     │  2. 创建 DocumentIndex 记录│
-│  3. 计算 content_hash  │     │  3. 触发索引构建任务        │
-└────────┬───────────────┘     └────────┬───────────────────┘
-         │                              │
-         ▼                              ▼
-┌─────────────────────────────────────────────────────────────┐
-│                    Storage Layer                            │
-│                                                             │
-│  ┌───────────────┐  ┌──────────────────┐  ┌─────────────┐ │
-│  │  PostgreSQL   │  │  Object Store    │  │  Vector DB  │ │
-│  │               │  │                  │  │             │ │
-│  │ - document    │  │ - Local/S3       │  │ - Qdrant    │ │
-│  │ - document_   │  │ - 原始文件        │  │ - 向量索引  │ │
-│  │   index       │  │ - 转换后的文件    │  │             │ │
-│  └───────────────┘  └──────────────────┘  └─────────────┘ │
-│                                                             │
-│  ┌───────────────┐  ┌──────────────────┐                  │
-│  │ Elasticsearch │  │   Neo4j/PG       │                  │
-│  │               │  │                  │                  │
-│  │ - 全文索引     │  │ - 知识图谱       │                  │
-│  └───────────────┘  └──────────────────┘                  │
-└─────────────────────────────────────────────────────────────┘
-                         │
-                         ▼
-               ┌───────────────────┐
-               │  Celery Workers   │
-               │                   │
-               │  - 文档解析        │
-               │  - 格式转换        │
-               │  - 内容提取        │
-               │  - 文档分块        │
-               │  - 索引构建        │
-               └───────────────────┘
+
+**文档类型**：
+
+| 类别 | 格式 | 典型用途 |
+|------|------|---------|
+| **办公文档** | PDF, Word, PPT, Excel | 年度报告、会议纪要、数据表格 |
+| **文本文件** | TXT, MD, HTML, JSON | 技术文档、笔记、配置文件 |
+| **图片** | PNG, JPG, GIF | 产品截图、设计稿、图表 |
+| **音频** | MP3, WAV, M4A | 会议录音、采访录音 |
+| **压缩包** | ZIP, TAR, GZ | 批量文档打包 |
+
+### 1.2 上传后发生什么
+
+```mermaid
+flowchart TB
+    A[你上传一个 PDF] --> B{系统自动识别}
+    
+    B --> C[提取文字内容]
+    B --> D[识别表格结构]
+    B --> E[提取图片]
+    B --> F[识别公式]
+    
+    C --> G[构建索引]
+    D --> G
+    E --> G
+    F --> G
+    
+    G --> H1[向量索引<br/>支持语义搜索]
+    G --> H2[全文索引<br/>支持关键词搜索]
+    G --> H3[图谱索引<br/>支持关系查询]
+    
+    H1 --> I[完成！可以检索]
+    H2 --> I
+    H3 --> I
+    
+    style A fill:#e1f5ff
+    style B fill:#fff59d
+    style G fill:#ffe0b2
+    style I fill:#c8e6c9
 ```
 
-### 分层架构
+**简单来说**：你只管上传文件，系统自动帮你处理好一切！
+
+## 2. 实际应用场景
+
+看看文档上传在实际工作中的应用。
+
+### 2.1 企业知识库建设
+
+**场景**：公司要建立内部知识库。
+
+**上传内容**：
+- 📋 制度文档：员工手册、考勤制度、报销流程
+- 📊 业务资料：产品介绍、销售数据、财务报表
+- 🔧 技术文档：系统架构、API 文档、部署指南
+- 📁 项目资料：项目方案、会议记录、复盘总结
+
+**使用效果**：
 
 ```
-┌─────────────────────────────────────────────┐
-│  View Layer (views/collections.py)         │  HTTP 处理、认证、参数验证
-└─────────────────┬───────────────────────────┘
-                  │ 调用
-┌─────────────────▼───────────────────────────┐
-│  Service Layer (service/document_service.py)│  业务逻辑、事务编排、权限控制
-└─────────────────┬───────────────────────────┘
-                  │ 调用
-┌─────────────────▼───────────────────────────┐
-│  Repository Layer (db/ops.py, objectstore/) │  数据访问抽象、对象存储接口
-└─────────────────┬───────────────────────────┘
-                  │ 访问
-┌─────────────────▼───────────────────────────┐
-│  Storage Layer (PG, S3, Qdrant, ES, Neo4j) │  数据持久化
-└─────────────────────────────────────────────┘
+员工提问："出差报销流程是什么？"
+系统：从《财务制度.pdf》找到报销流程章节
+
+新人提问："公司的产品有哪些？"
+系统：从《产品手册.pptx》提取产品列表
+
+技术同学："这个 API 怎么调用？"
+系统：从《API文档.md》找到调用示例
 ```
 
-## 核心流程详解
+### 2.2 研究资料整理
 
-### 阶段 0: API 接口定义
+**场景**：研究生整理论文和学习资料。
 
-系统提供三个主要接口：
+**上传内容**：
+- 📖 学术论文 PDF
+- 📝 读书笔记 Markdown
+- 🎓 课程讲义 PPT
+- 📊 实验数据 Excel
 
-1. **上传文件**（两阶段模式 - 第一步）
-   - 接口：`POST /api/v1/collections/{collection_id}/documents/upload`
-   - 功能：上传文件到临时存储，状态为 `UPLOADED`
-   - 返回：`document_id`、`filename`、`size`、`status`
+**使用效果**：
 
-2. **确认文档**（两阶段模式 - 第二步）
-   - 接口：`POST /api/v1/collections/{collection_id}/documents/confirm`
-   - 功能：确认已上传的文档，触发索引构建
-   - 参数：`document_ids` 数组
-   - 返回：`confirmed_count`、`failed_count`、`failed_documents`
+```
+问："Graph RAG 相关的研究有哪些？"
+答：从多篇论文中找到相关内容
+
+问："某个作者的主要贡献是什么？"
+答：分析论文，总结作者的研究方向
+```
+
+### 2.3 个人知识管理
 
-3. **一步上传**（传统模式，兼容旧版）
-   - 接口：`POST /api/v1/collections/{collection_id}/documents`
-   - 功能：上传并直接添加到知识库，状态直接为 `PENDING`
-   - 支持批量上传
+**场景**：程序员积累技术笔记。
 
-### 阶段 1: 文件上传与临时存储
+**上传内容**：
+- 💻 学习笔记 Markdown
+- 📸 技术截图 PNG
+- 🎬 教程录屏转的音频
+- 📚 技术书籍 PDF
 
-#### 1.1 上传流程
+**使用效果**：
 
 ```
-用户选择文件
-    │
-    ▼
-前端调用 upload API
-    │
-    ▼
-View 层验证身份和参数
-    │
-    ▼
-Service 层处理业务逻辑：
-    │
-    ├─► 验证集合存在且激活
-    │
-    ├─► 验证文件类型和大小
-    │
-    ├─► 读取文件内容
-    │
-    ├─► 计算 SHA-256 哈希
-    │
-    └─► 事务处理：
-        │
-        ├─► 重复检测（按文件名+哈希）
-        │   ├─ 完全相同：返回已存在文档（幂等）
-        │   ├─ 同名不同内容：抛出冲突异常
-        │   └─ 新文档：继续创建
-        │
-        ├─► 创建 Document 记录（status=UPLOADED）
-        │
-        ├─► 上传到对象存储
-        │   └─ 路径：user-{user_id}/{collection_id}/{document_id}/original{suffix}
-        │
-        └─► 更新文档元数据（object_path）
+问："之前怎么解决过 Redis 连接问题？"
+答：从笔记《Redis问题排查.md》找到解决方案
+
+问："某个技术的最佳实践是什么？"
+答：从多个文档中总结最佳实践
 ```
 
-#### 1.2 文件验证
+### 2.4 多模态内容处理
 
-**支持的文件类型**：
-- 文档：`.pdf`, `.doc`, `.docx`, `.ppt`, `.pptx`, `.xls`, `.xlsx`
-- 文本：`.txt`, `.md`, `.html`, `.json`, `.xml`, `.yaml`, `.yml`, `.csv`
-- 图片：`.png`, `.jpg`, `.jpeg`, `.gif`, `.bmp`, `.tiff`, `.tif`
-- 音频：`.mp3`, `.wav`, `.m4a`
-- 压缩包：`.zip`, `.tar`, `.gz`, `.tgz`
+**场景**：产品团队的设计资料。
 
-**大小限制**：
-- 默认：100 MB（可通过 `MAX_DOCUMENT_SIZE` 环境变量配置）
-- 解压后总大小：5 GB（`MAX_EXTRACTED_SIZE`）
+**上传内容**：
+- 🎨 UI 设计稿（图片）
+- 📋 产品 PRD（Word）
+- 🎤 用户访谈录音
+- 📊 数据分析报告（Excel）
 
-#### 1.3 重复检测机制
+**系统处理**：
+- 设计稿 → OCR 提取文字 + Vision 理解设计意图
+- PRD → 提取产品需求和功能点
+- 录音 → 转文字，提取用户反馈
+- 数据报告 → 提取关键指标
 
-采用**文件名 + SHA-256 哈希**双重检测：
+**结果**：所有内容融合在一起，可以综合检索！
 
-| 场景 | 文件名 | 哈希值 | 系统行为 |
-|------|--------|--------|----------|
-| 完全相同 | 相同 | 相同 | 返回已存在文档（幂等操作） |
-| 文件名冲突 | 相同 | 不同 | 抛出 `DocumentNameConflictException` |
-| 新文档 | 不同 | - | 创建新文档记录 |
+## 3. 上传体验
 
-**优势**：
-- ✅ 支持幂等上传：网络重传不会创建重复文档
-- ✅ 避免内容冲突：同名不同内容会提示用户
-- ✅ 节省存储空间：相同内容只存储一次
+### 3.1 批量上传很简单
+
+假设你要上传 50 个公司文档：
 
-### 阶段 2: 临时存储配置
+**Step 1：选择文件（10 秒）**
 
-#### 2.1 对象存储类型
+```
+点击"上传文档" → 选择 50 个 PDF → 点击"开始上传"
+```
 
-系统支持两种对象存储后端，可通过环境变量切换：
+**Step 2：快速上传（30 秒）**
 
-**1. Local 存储（本地文件系统）**
+```
+进度条：1/50, 2/50, 3/50... 50/50 ✅
+所有文件秒传到暂存区，不需要等待处理
+```
 
-适用场景：
-- 开发测试环境
-- 小规模部署
-- 单机部署
+**Step 3：预览确认（1 分钟）**
 
-配置方式：
-```bash
-# 开发环境
-OBJECT_STORE_TYPE=local
-OBJECT_STORE_LOCAL_ROOT_DIR=.objects
+```
+查看上传的文件列表：
+- ✅ 年度报告.pdf (5.2 MB)
+- ✅ 产品手册.pdf (3.1 MB)
+- ❌ 个人笔记.pdf (不该上传的) → 取消勾选
+- ✅ 技术文档.pdf (2.8 MB)
+...
 
-# Docker 环境
-OBJECT_STORE_TYPE=local
-OBJECT_STORE_LOCAL_ROOT_DIR=/shared/objects
+点击"保存到知识库"
 ```
 
-存储路径示例：
+**Step 4：后台处理（5-30 分钟）**
+
 ```
-.objects/
-└── user-google-oauth2-123456/
-    └── col_abc123/
-        └── doc_xyz789/
-            ├── original.pdf              # 原始文件
-            ├── converted.pdf             # 转换后的 PDF
-            ├── processed_content.md      # 解析后的 Markdown
-            ├── chunks/                   # 分块数据
-            │   ├── chunk_0.json
-            │   └── chunk_1.json
-            └── images/                   # 提取的图片
-                ├── page_0.png
-                └── page_1.png
+系统自动处理：
+- 解析文档内容
+- 构建多种索引
+- 你可以继续其他工作，不需要等待
 ```
 
-**2. S3 存储（兼容 AWS S3/MinIO/OSS 等）**
-
-适用场景：
-- 生产环境
-- 大规模部署
-- 分布式部署
-- 需要高可用和容灾
+**Step 5：完成通知**
 
-配置方式：
-```bash
-OBJECT_STORE_TYPE=s3
-OBJECT_STORE_S3_ENDPOINT=http://127.0.0.1:9000  # MinIO/S3 地址
-OBJECT_STORE_S3_REGION=us-east-1                # AWS Region
-OBJECT_STORE_S3_ACCESS_KEY=minioadmin           # Access Key
-OBJECT_STORE_S3_SECRET_KEY=minioadmin           # Secret Key
-OBJECT_STORE_S3_BUCKET=aperag                   # Bucket 名称
-OBJECT_STORE_S3_PREFIX_PATH=dev/                # 可选的路径前缀
-OBJECT_STORE_S3_USE_PATH_STYLE=true             # MinIO 需要设置为 true
 ```
+通知："49 个文档处理完成，现在可以检索了"
+```
+
+### 3.2 处理时间参考
+
+不同大小的文档，处理速度不同：
+
+| 文档类型 | 大小 | 上传时间 | 处理时间 | 示例 |
+|---------|------|---------|---------|------|
+| 🏃 小文档 | < 5 页 | < 1 秒 | 1-3 分钟 | 通知、邮件 |
+| 🚶 中型文档 | 10-50 页 | < 3 秒 | 3-10 分钟 | 报告、手册 |
+| 🐌 大型文档 | 100+ 页 | < 10 秒 | 10-30 分钟 | 书籍、论文集 |
+
+**关键点**：
+- ✅ 上传总是很快（秒级）
+- ⏳ 处理在后台进行（不阻塞）
+- 📊 可以实时查看处理进度
+
+### 3.3 实时进度查看
 
-#### 2.2 对象存储路径规则
+上传后可以随时查看文档状态：
 
-**路径格式**：
 ```
-{prefix}/user-{user_id}/{collection_id}/{document_id}/{filename}
+文档列表：
+
+📄 annual_report.pdf
+   状态：处理中 (60%)
+   ├─ ✅ 文档解析：完成
+   ├─ ✅ 向量索引：完成
+   ├─ 🔄 全文索引：进行中
+   └─ ⏳ 图谱索引：等待中
+
+📄 product_manual.pdf
+   状态：已完成 ✅
+   可以检索
+
+📄 meeting_notes.pdf
+   状态：失败 ❌
+   错误：文件损坏
+   操作：重新上传
 ```
 
-**组成部分**：
-- `prefix`：可选的全局前缀（仅 S3）
-- `user_id`：用户 ID（`|` 替换为 `-`）
-- `collection_id`：集合 ID
-- `document_id`：文档 ID
-- `filename`：文件名（如 `original.pdf`、`page_0.png`）
+## 4. 核心特性
+
+ApeRAG 的文档上传有一些独特的特性，让使用更加方便。
 
-**多租户隔离**：
-- 每个用户有独立的命名空间
-- 每个集合有独立的存储目录
-- 每个文档有独立的文件夹
+### 4.1 暂存区设计
 
-### 阶段 3: 文档确认与索引构建
+**核心理念**：先传后选，给你"后悔"的机会。
 
-#### 3.1 确认流程
+**就像网购**：
 
 ```
-用户点击"保存到集合"
-    │
-    ▼
-前端调用 confirm API
-    │
-    ▼
-Service 层处理：
-    │
-    ├─► 验证集合配置
-    │
-    ├─► 检查 Quota（确认阶段才扣除配额）
-    │
-    └─► 对每个 document_id：
-        │
-        ├─► 验证文档状态为 UPLOADED
-        │
-        ├─► 更新文档状态：UPLOADED → PENDING
-        │
-        ├─► 根据集合配置创建索引记录：
-        │   ├─ VECTOR（向量索引，必选）
-        │   ├─ FULLTEXT（全文索引，必选）
-        │   ├─ GRAPH（知识图谱，可选）
-        │   ├─ SUMMARY（文档摘要，可选）
-        │   └─ VISION（视觉索引，可选）
-        │
-        └─► 返回确认结果
-    │
-    ▼
-触发 Celery 任务：reconcile_document_indexes
-    │
-    ▼
-后台异步处理索引构建
+网购流程：
+1. 加入购物车（暂存）
+2. 查看购物车，删除不想要的
+3. 提交订单（确认）
+
+文档上传：
+1. 上传到暂存区（快速上传）
+2. 查看列表，取消不需要的
+3. 保存到知识库（确认添加）
 ```
 
-#### 3.2 Quota（配额）管理
+**好处**：
 
-**检查时机**：
-- ❌ 不在上传阶段检查（临时存储不占用配额）
-- ✅ 在确认阶段检查（正式添加才消耗配额）
+- ✅ **快速上传**：20 个文件 5 秒传完，不用等处理
+- ✅ **选择性添加**：上传 100 个，只保存需要的 80 个
+- ✅ **节省配额**：暂存区的文件不占配额
+- ✅ **纠错方便**：发现错误直接取消，不用删除
 
-**配额类型**：
+### 4.2 智能处理
 
-1. **用户全局配额**
-   - `max_document_count`：用户总文档数量限制
-   - 默认：1000（可通过 `MAX_DOCUMENT_COUNT` 配置）
+**自动识别格式**：
 
-2. **单集合配额**
-   - `max_document_count_per_collection`：单个集合文档数量限制
-   - 不计入 `UPLOADED` 和 `DELETED` 状态的文档
+系统会自动识别文件类型，选择最合适的处理方式：
 
-**配额超限处理**：
-- 抛出 `QuotaExceededException`
-- 返回 HTTP 400 错误
-- 包含当前用量和配额上限信息
+- 📄 PDF → 提取文字、表格、图片、公式
+- 📋 Word → 转换格式、提取内容
+- 📊 Excel → 识别表格结构
+- 🎨 图片 → OCR 文字 + 理解内容
+- 🎤 音频 → 转录成文字
 
-### 阶段 4: 文档解析与格式转换
+**你不需要做任何额外操作**，系统自动处理！
 
-#### 4.1 Parser 架构
+### 4.3 后台处理
 
-系统采用**多 Parser 链式调用**架构，每个 Parser 负责特定类型的文件解析：
+上传完成后，系统在后台自动处理：
 
-```
-DocParser（主控制器）
-    │
-    ├─► MinerUParser
-    │   └─ 功能：高精度 PDF 解析（商业 API）
-    │   └─ 支持：.pdf
-    │
-    ├─► DocRayParser
-    │   └─ 功能：文档布局分析和内容提取
-    │   └─ 支持：.pdf, .docx, .pptx, .xlsx
-    │
-    ├─► ImageParser
-    │   └─ 功能：图片内容识别（OCR + 视觉理解）
-    │   └─ 支持：.jpg, .png, .gif, .bmp, .tiff
-    │
-    ├─► AudioParser
-    │   └─ 功能：音频转录（Speech-to-Text）
-    │   └─ 支持：.mp3, .wav, .m4a
-    │
-    └─► MarkItDownParser（兜底）
-        └─ 功能：通用文档转 Markdown
-        └─ 支持：几乎所有常见格式
+```mermaid
+sequenceDiagram
+    participant U as 你
+    participant S as 系统
+    
+    U->>S: 上传文件
+    S-->>U: 秒级返回 ✅
+    Note over U: 继续工作，不用等
+    
+    S->>S: 解析文档...
+    S->>S: 构建索引...
+    S-->>U: 处理完成通知 🔔
 ```
 
-#### 4.2 Parser 配置
+**优势**：
+- 不用等待，上传完就能干别的
+- 系统自动重试失败的文档
+- 实时查看处理进度
 
-**配置方式**：通过集合配置（Collection Config）动态控制
+### 4.4 自动清理
 
-```json
-{
-  "parser_config": {
-    "use_mineru": false,           // 是否启用 MinerU（需要 API Token）
-    "use_doc_ray": false,          // 是否启用 DocRay
-    "use_markitdown": true,        // 是否启用 MarkItDown（默认）
-    "mineru_api_token": "xxx"      // MinerU API Token（可选）
-  }
-}
-```
+暂存区的文件 7 天没确认会自动清理，防止占用存储空间。
 
-**环境变量配置**：
-```bash
-USE_MINERU_API=false              # 全局启用 MinerU
-MINERU_API_TOKEN=your_token       # MinerU API Token
+## 5. 文档解析原理
+
+上传后，系统需要把文档"读懂"。不同格式有不同的处理方式。
+
+### 5.1 解析器工作流程
+
+系统有多个解析器，会自动选择最合适的：
+
+```mermaid
+flowchart TD
+    File[上传 PDF] --> Try1{尝试 MinerU}
+    Try1 -->|成功| Result[解析完成]
+    Try1 -->|失败/未配置| Try2{尝试 DocRay}
+    Try2 -->|成功| Result
+    Try2 -->|失败/未配置| Try3[使用 MarkItDown]
+    Try3 --> Result
+    
+    style File fill:#e1f5ff
+    style Result fill:#c5e1a5
+    style Try1 fill:#fff3e0
+    style Try2 fill:#fff3e0
+    style Try3 fill:#c5e1a5
 ```
 
-#### 4.3 解析流程
+**解析器优先级**：
+
+1. **MinerU**：最强大，商业 API，需要付费
+   - 擅长：复杂 PDF、学术论文、带公式的文档
+   
+2. **DocRay**：开源，免费，布局分析强
+   - 擅长：表格、图表、多列排版
+   
+3. **MarkItDown**：通用，兜底，支持所有格式
+   - 擅长：简单文档、文本文件
+
+**自动降级**的好处：
+- 优先用最好的解析器
+- 不行就自动换下一个
+- 总有一个能处理成功
+
+**例子 1：复杂 PDF**
 
 ```
-Celery Worker 收到索引任务
-    │
-    ▼
-1. 从对象存储下载原始文件
-    │
-    ▼
-2. 根据文件扩展名选择 Parser
-    │
-    ├─► 尝试第一个匹配的 Parser
-    │   ├─ 成功：返回解析结果
-    │   └─ 失败：FallbackError → 尝试下一个 Parser
-    │
-    └─► 最终兜底：MarkItDownParser
-    │
-    ▼
-3. 解析结果（Parts）：
-    │
-    ├─► MarkdownPart：文本内容
-    │   └─ 包含：标题、段落、列表、表格等
-    │
-    ├─► PdfPart：PDF 文件
-    │   └─ 用于：线性化、页面渲染
-    │
-    └─► AssetBinPart：二进制资源
-        └─ 包含：图片、嵌入的文件等
-    │
-    ▼
-4. 后处理（Post-processing）：
-    │
-    ├─► PDF 页面转图片（Vision 索引需要）
-    │   └─ 每页渲染为 PNG 图片
-    │   └─ 保存到 {document_path}/images/page_N.png
-    │
-    ├─► PDF 线性化（加速浏览器加载）
-    │   └─ 使用 pikepdf 优化 PDF 结构
-    │   └─ 保存到 {document_path}/converted.pdf
-    │
-    └─► 提取文本内容（纯文本）
-        └─ 合并所有 MarkdownPart 内容
-        └─ 保存到 {document_path}/processed_content.md
-    │
-    ▼
-5. 保存到对象存储
+上传：年度报告.pdf (50 页，有表格和图表)
+    ↓
+DocRay 解析器自动：
+- 📝 提取所有文字内容
+- 📊 识别表格，保持结构
+- 🎨 提取图片和图表
+- 📐 识别 LaTeX 公式
+    ↓
+得到：
+- 完整的 Markdown 文档
+- 50 张页面截图（如果需要视觉索引）
 ```
 
-#### 4.4 格式转换示例
+**例子 2：图片截图**
 
-**示例 1：PDF 文档**
 ```
-输入：user_manual.pdf (5 MB)
-    │
-    ▼
-解析器选择：DocRayParser / MarkItDownParser
-    │
-    ▼
-输出 Parts：
-    ├─ MarkdownPart: "# User Manual\n\n## Chapter 1\n..."
-    └─ PdfPart: <原始 PDF 数据>
-    │
-    ▼
-后处理：
-    ├─ 渲染 50 页为图片 → images/page_0.png ~ page_49.png
-    ├─ 线性化 PDF → converted.pdf
-    └─ 提取文本 → processed_content.md
+上传：product_screenshot.png
+    ↓
+ImageParser 自动：
+- 📸 OCR 识别图片中的文字
+- 👁️ Vision AI 理解图片内容
+    ↓
+得到：
+- 文字："产品名称：ApeRAG，版本：2.0..."
+- 描述："这是一个产品介绍页面，包含产品名称、版本号和功能列表"
 ```
 
-**示例 2：图片文件**
+**例子 3：会议录音**
+
 ```
-输入：screenshot.png (2 MB)
-    │
-    ▼
-解析器选择：ImageParser
-    │
-    ▼
-输出 Parts：
-    ├─ MarkdownPart: "[OCR 提取的文字内容]"
-    └─ AssetBinPart: <原始图片数据> (vision_index=true)
-    │
-    ▼
-后处理：
-    └─ 保存原图副本 → images/file.png
+上传：meeting.mp3 (30 分钟)
+    ↓
+AudioParser 自动：
+- 🎤 语音转文字（STT）
+- 📝 生成会议记录
+    ↓
+得到：
+- "会议开始。主持人张三：大家好，今天讨论产品规划..."
+- 完整的会议文字记录
 ```
 
-**示例 3：音频文件**
+### 5.3 重复文件处理
+
+系统会自动检测重复上传：
+
 ```
-输入：meeting_record.mp3 (50 MB)
-    │
-    ▼
-解析器选择：AudioParser
-    │
-    ▼
-输出 Parts：
-    └─ MarkdownPart: "[转录的会议内容文本]"
-    │
-    ▼
-后处理：
-    └─ 保存转录文本 → processed_content.md
+第一次上传 report.pdf → 创建新文档 ✅
+第二次上传 report.pdf (内容相同) → 返回已存在文档 ✅
+第三次上传 report.pdf (内容不同) → 提示冲突，需重命名 ⚠️
 ```
 
-### 阶段 5: 索引构建
+**优势**：
+- 避免重复文档
+- 网络重传不会创建多个文档
+- 节省存储空间
 
-#### 5.1 索引类型与功能
+## 6. 索引构建
 
-| 索引类型 | 是否必选 | 功能描述 | 存储位置 |
-|---------|---------|----------|----------|
-| **VECTOR** | ✅ 必选 | 向量化检索，支持语义搜索 | Qdrant / Elasticsearch |
-| **FULLTEXT** | ✅ 必选 | 全文检索，支持关键词搜索 | Elasticsearch |
-| **GRAPH** | ❌ 可选 | 知识图谱，提取实体和关系 | Neo4j / PostgreSQL |
-| **SUMMARY** | ❌ 可选 | 文档摘要，LLM 生成 | PostgreSQL (index_data) |
-| **VISION** | ❌ 可选 | 视觉理解，图片内容分析 | Qdrant (向量) + PG (metadata) |
+文档解析后，系统会自动构建多种索引，让你可以用不同方式检索。
 
-#### 5.2 索引构建流程
+### 6.1 为什么需要多种索引
+
+不同的问题需要不同的检索方式：
 
 ```
-Celery Worker: reconcile_document_indexes 任务
-    │
-    ▼
-1. 扫描 DocumentIndex 表，找到需要处理的索引
-    │
-    ├─► PENDING 状态 + observed_version < version
-    │   └─ 需要创建或更新索引
-    │
-    └─► DELETING 状态
-        └─ 需要删除索引
-    │
-    ▼
-2. 按文档分组，逐个处理
-    │
-    ▼
-3. 对每个文档：
-    │
-    ├─► parse_document（解析文档）
-    │   ├─ 从对象存储下载原始文件
-    │   ├─ 调用 DocParser 解析
-    │   └─ 返回 ParsedDocumentData
-    │
-    └─► 对每个索引类型：
-        │
-        ├─► create_index (创建/更新索引)
-        │   │
-        │   ├─ VECTOR 索引：
-        │   │   ├─ 文档分块（Chunking）
-        │   │   ├─ Embedding 模型生成向量
-        │   │   └─ 写入 Qdrant
-        │   │
-        │   ├─ FULLTEXT 索引：
-        │   │   ├─ 提取纯文本内容
-        │   │   ├─ 按段落/章节分块
-        │   │   └─ 写入 Elasticsearch
-        │   │
-        │   ├─ GRAPH 索引：
-        │   │   ├─ 使用 LightRAG 提取实体
-        │   │   ├─ 提取实体间关系
-        │   │   └─ 写入 Neo4j/PostgreSQL
-        │   │
-        │   ├─ SUMMARY 索引：
-        │   │   ├─ 调用 LLM 生成摘要
-        │   │   └─ 保存到 DocumentIndex.index_data
-        │   │
-        │   └─ VISION 索引：
-        │       ├─ 提取图片 Assets
-        │       ├─ Vision LLM 理解图片内容
-        │       ├─ 生成图片描述向量
-        │       └─ 写入 Qdrant
-        │
-        └─► 更新索引状态
-            ├─ 成功：CREATING → ACTIVE
-            └─ 失败：CREATING → FAILED
-    │
-    ▼
-4. 更新文档总体状态
-    │
-    ├─ 所有索引都 ACTIVE → Document.status = COMPLETE
-    ├─ 任一索引 FAILED → Document.status = FAILED
-    └─ 部分索引仍在处理 → Document.status = RUNNING
-```
+问："如何优化数据库性能？"
+→ 需要：向量索引（语义相似搜索）
 
-#### 5.3 文档分块（Chunking）
+问："PostgreSQL 配置文件在哪？"
+→ 需要：全文索引（精确关键词搜索）
 
-**分块策略**：
-- 递归字符分割（RecursiveCharacterTextSplitter）
-- 按自然段落、章节优先切分
-- 保留上下文重叠（Overlap）
+问："张三和李四是什么关系？"
+→ 需要：图谱索引（关系查询）
 
-**分块参数**：
-```json
-{
-  "chunk_size": 1000,           // 每块最大字符数
-  "chunk_overlap": 200,         // 重叠字符数
-  "separators": ["\n\n", "\n", " ", ""]  // 分隔符优先级
-}
-```
+问："这个文档主要讲什么？"
+→ 需要：摘要索引（快速概览）
 
-**分块结果存储**：
-```
-{document_path}/chunks/
-    ├─ chunk_0.json: {"text": "...", "metadata": {...}}
-    ├─ chunk_1.json: {"text": "...", "metadata": {...}}
-    └─ ...
+问："这张图片里有什么？"
+→ 需要：视觉索引（图片内容搜索）
 ```
 
-## 数据库设计
-
-### 表 1: document（文档元数据）
-
-**表结构**：
-
-| 字段名 | 类型 | 说明 | 索引 |
-|--------|------|------|------|
-| `id` | String(24) | 文档 ID，主键，格式：`doc{random_id}` | PK |
-| `name` | String(1024) | 文件名 | - |
-| `user` | String(256) | 用户 ID（支持多种 IDP） | ✅ Index |
-| `collection_id` | String(24) | 所属集合 ID | ✅ Index |
-| `status` | Enum | 文档状态（见下表） | ✅ Index |
-| `size` | BigInteger | 文件大小（字节） | - |
-| `content_hash` | String(64) | SHA-256 哈希（用于去重） | ✅ Index |
-| `object_path` | Text | 对象存储路径（已废弃，用 doc_metadata） | - |
-| `doc_metadata` | Text | 文档元数据（JSON 字符串） | - |
-| `gmt_created` | DateTime(tz) | 创建时间（UTC） | - |
-| `gmt_updated` | DateTime(tz) | 更新时间（UTC） | - |
-| `gmt_deleted` | DateTime(tz) | 删除时间（软删除） | ✅ Index |
-
-**唯一约束**：
-```sql
-UNIQUE INDEX uq_document_collection_name_active
-  ON document (collection_id, name)
-  WHERE gmt_deleted IS NULL;
-```
-- 同一集合内，活跃文档的名称不能重复
-- 已删除的文档不参与唯一性检查
-
-**文档状态枚举**（`DocumentStatus`）：
-
-| 状态 | 说明 | 何时设置 | 可见性 |
-|------|------|----------|--------|
-| `UPLOADED` | 已上传到临时存储 | `upload_document` 接口 | 前端文件选择界面 |
-| `PENDING` | 等待索引构建 | `confirm_documents` 接口 | 文档列表（处理中） |
-| `RUNNING` | 索引构建中 | Celery 任务开始处理 | 文档列表（处理中） |
-| `COMPLETE` | 所有索引完成 | 所有索引变为 ACTIVE | 文档列表（可用） |
-| `FAILED` | 索引构建失败 | 任一索引失败 | 文档列表（失败） |
-| `DELETED` | 已删除 | `delete_document` 接口 | 不可见（软删除） |
-| `EXPIRED` | 临时文档过期 | 定时清理任务 | 不可见 |
-
-**文档元数据示例**（`doc_metadata` JSON 字段）：
-```json
-{
-  "object_path": "user-xxx/col_xxx/doc_xxx/original.pdf",
-  "converted_path": "user-xxx/col_xxx/doc_xxx/converted.pdf",
-  "processed_content_path": "user-xxx/col_xxx/doc_xxx/processed_content.md",
-  "images": [
-    "user-xxx/col_xxx/doc_xxx/images/page_0.png",
-    "user-xxx/col_xxx/doc_xxx/images/page_1.png"
-  ],
-  "parser_used": "DocRayParser",
-  "parse_duration_ms": 5420,
-  "page_count": 50,
-  "custom_field": "value"
-}
-```
+### 6.2 五种索引
 
-### 表 2: document_index（索引状态管理）
-
-**表结构**：
-
-| 字段名 | 类型 | 说明 | 索引 |
-|--------|------|------|------|
-| `id` | Integer | 自增 ID，主键 | PK |
-| `document_id` | String(24) | 关联的文档 ID | ✅ Index |
-| `index_type` | Enum | 索引类型（见下表） | ✅ Index |
-| `status` | Enum | 索引状态（见下表） | ✅ Index |
-| `version` | Integer | 索引版本号 | - |
-| `observed_version` | Integer | 已处理的版本号 | - |
-| `index_data` | Text | 索引数据（JSON），如摘要内容 | - |
-| `error_message` | Text | 错误信息（失败时） | - |
-| `gmt_created` | DateTime(tz) | 创建时间 | - |
-| `gmt_updated` | DateTime(tz) | 更新时间 | - |
-| `gmt_last_reconciled` | DateTime(tz) | 最后协调时间 | - |
-
-**唯一约束**：
-```sql
-UNIQUE CONSTRAINT uq_document_index
-  ON document_index (document_id, index_type);
-```
-- 每个文档的每种索引类型只有一条记录
-
-**索引类型枚举**（`DocumentIndexType`）：
-
-| 类型 | 值 | 说明 | 外部存储 |
-|------|-----|------|----------|
-| `VECTOR` | "VECTOR" | 向量索引 | Qdrant / Elasticsearch |
-| `FULLTEXT` | "FULLTEXT" | 全文索引 | Elasticsearch |
-| `GRAPH` | "GRAPH" | 知识图谱 | Neo4j / PostgreSQL |
-| `SUMMARY` | "SUMMARY" | 文档摘要 | PostgreSQL (index_data) |
-| `VISION` | "VISION" | 视觉索引 | Qdrant + PostgreSQL |
-
-**索引状态枚举**（`DocumentIndexStatus`）：
-
-| 状态 | 说明 | 何时设置 |
-|------|------|----------|
-| `PENDING` | 等待处理 | `confirm_documents` 创建索引记录 |
-| `CREATING` | 创建中 | Celery Worker 开始处理 |
-| `ACTIVE` | 就绪可用 | 索引构建成功 |
-| `DELETING` | 标记删除 | `delete_document` 接口 |
-| `DELETION_IN_PROGRESS` | 删除中 | Celery Worker 正在删除 |
-| `FAILED` | 失败 | 索引构建失败 |
-
-**版本控制机制**：
-- `version`：期望的索引版本（每次文档更新时 +1）
-- `observed_version`：已处理的版本号
-- `version > observed_version` 时，触发索引更新
-
-**协调器（Reconciler）**：
-```python
-# 查询需要处理的索引
-SELECT * FROM document_index
-WHERE status = 'PENDING'
-  AND observed_version < version;
-
-# 处理后更新
-UPDATE document_index
-SET status = 'ACTIVE',
-    observed_version = version,
-    gmt_last_reconciled = NOW()
-WHERE id = ?;
+```mermaid
+flowchart TB
+    Doc[你的文档] --> Auto[系统自动构建]
+    
+    Auto --> V[向量索引<br/>找相似内容]
+    Auto --> F[全文索引<br/>找关键词]
+    Auto --> G[图谱索引<br/>找关系]
+    Auto --> S[摘要索引<br/>快速了解]
+    Auto --> I[视觉索引<br/>找图片]
+    
+    V --> Q1[问：如何优化性能？]
+    F --> Q2[问：配置文件路径？]
+    G --> Q3[问：A 和 B 的关系？]
+    S --> Q4[问：文档讲什么？]
+    I --> Q5[问：图片里有什么？]
+    
+    style Doc fill:#e1f5ff
+    style Auto fill:#fff59d
+    style V fill:#bbdefb
+    style F fill:#c5e1a5
+    style G fill:#ffccbc
+    style S fill:#e1bee7
+    style I fill:#fff9c4
 ```
 
-### 表关系图
+**索引对比**：
 
-```
-┌─────────────────────────────────┐
-│         collection              │
-│  ─────────────────────────────  │
-│  id (PK)                        │
-│  name                           │
-│  config (JSON)                  │
-│  status                         │
-│  ...                            │
-└────────────┬────────────────────┘
-             │ 1:N
-             ▼
-┌─────────────────────────────────┐
-│          document               │
-│  ─────────────────────────────  │
-│  id (PK)                        │
-│  collection_id (FK)             │◄──── 唯一约束: (collection_id, name)
-│  name                           │
-│  user                           │
-│  status (Enum)                  │
-│  size                           │
-│  content_hash (SHA-256)         │
-│  doc_metadata (JSON)            │
-│  gmt_created                    │
-│  gmt_deleted                    │
-│  ...                            │
-└────────────┬────────────────────┘
-             │ 1:N
-             ▼
-┌─────────────────────────────────┐
-│       document_index            │
-│  ─────────────────────────────  │
-│  id (PK)                        │
-│  document_id (FK)               │◄──── 唯一约束: (document_id, index_type)
-│  index_type (Enum)              │
-│  status (Enum)                  │
-│  version                        │
-│  observed_version               │
-│  index_data (JSON)              │
-│  error_message                  │
-│  gmt_last_reconciled            │
-│  ...                            │
-└─────────────────────────────────┘
-```
+| 索引 | 必须 | 适合问题 | 速度 |
+|------|------|---------|------|
+| 向量 | ✅ | 语义相似 | 快 |
+| 全文 | ✅ | 精确关键词 | 快 |
+| 图谱 | ❌ | 关系查询 | 慢 |
+| 摘要 | ❌ | 快速了解 | 中 |
+| 视觉 | ❌ | 图片内容 | 中 |
+
+**推荐配置**：
 
-## 状态机与生命周期
+- 💰 节省成本：只启用向量 + 全文
+- ⚡ 追求速度：禁用图谱（最慢）
+- 🎯 功能完整：全部启用
 
-### 文档状态转换
+### 6.3 并行构建
+
+多种索引可以同时构建，节省时间：
 
 ```
-         ┌─────────────────────────────────────────────┐
-         │                                             │
-         │                                             ▼
-    [上传文件] ──► UPLOADED ──► [确认] ──► PENDING ──► RUNNING ──► COMPLETE
-                     │                                   │
-                     │                                   ▼
-                     │                                FAILED
-                     │                                   │
-                     │                                   ▼
-                     └──────► [删除] ──────────────► DELETED
-                                                         │
-                     ┌───────────────────────────────────┘
-                     │
-                     ▼
-                  EXPIRED (定时清理未确认的文档)
+文档解析完成
+    ↓
+5 种索引同时开始构建：
+- 向量索引：1 分钟 
+- 全文索引：30 秒
+- 图谱索引：10 分钟 ⏱️ (最慢)
+- 摘要索引：3 分钟
+- 视觉索引：2 分钟
+    ↓
+总时间：10 分钟（最慢的那个）
+如果串行：16.5 分钟
+
+节省：40% 时间！
 ```
 
-**关键转换**：
-1. **UPLOADED → PENDING**：用户点击"保存到集合"
-2. **PENDING → RUNNING**：Celery Worker 开始处理
-3. **RUNNING → COMPLETE**：所有索引都成功
-4. **RUNNING → FAILED**：任一索引失败
-5. **任何状态 → DELETED**：用户删除文档
+### 6.4 自动重试
 
-### 索引状态转换
+如果某个索引构建失败，系统会自动重试：
 
 ```
-  [创建索引记录] ──► PENDING ──► CREATING ──► ACTIVE
-                                   │
-                                   ▼
-                                FAILED
-                                   │
-                                   ▼
-                     ┌──────────► PENDING (重试)
-                     │
-    [删除请求] ──────┼──────────► DELETING ──► DELETION_IN_PROGRESS ──► (记录删除)
-                     │
-                     └──────────► (直接删除记录，如果 PENDING/FAILED)
+第 1 次：1 分钟后重试
+第 2 次：5 分钟后重试
+第 3 次：15 分钟后重试
+仍失败 → 标记为失败，通知用户
 ```
 
-## 异步任务调度（Celery）
-
-### 任务定义
-
-**主任务**：`reconcile_document_indexes`
-- 触发时机：
-  - `confirm_documents` 接口调用后
-  - 定时任务（每 30 秒）
-  - 手动触发（管理界面）
-- 功能：扫描 `document_index` 表，处理需要协调的索引
+大部分临时错误（网络问题、服务重启）都能自动恢复！
 
-**子任务**：
-- `parse_document_task`：解析文档内容
-- `create_vector_index_task`：创建向量索引
-- `create_fulltext_index_task`：创建全文索引
-- `create_graph_index_task`：创建知识图谱索引
-- `create_summary_index_task`：创建摘要索引
-- `create_vision_index_task`：创建视觉索引
+## 7. 技术实现
 
-### 任务调度策略
+> 💡 **阅读建议**：这一章是技术细节，主要面向开发者和运维人员。普通用户可以跳过。
 
-**并发控制**：
-- 每个 Worker 最多同时处理 N 个文档（默认 4）
-- 每个文档的多个索引可以并行构建
-- 使用 Celery 的 `task_acks_late=True` 确保任务不丢失
+### 7.1 存储架构
 
-**失败重试**：
-- 最多重试 3 次
-- 指数退避（1分钟 → 5分钟 → 15分钟）
-- 3 次失败后标记为 `FAILED`
+**文件存储位置**：
 
-**幂等性**：
-- 所有任务支持重复执行
-- 使用 `observed_version` 机制避免重复处理
-- 相同输入产生相同输出
+```
+本地存储（开发）：
+.objects/user-xxx/collection-xxx/doc-xxx/
+    ├── original.pdf
+    └── images/page_0.png
 
-## 设计特点与优势
+云存储（生产）：
+s3://bucket/user-xxx/collection-xxx/doc-xxx/
+    ├── original.pdf
+    └── images/page_0.png
+```
 
-### 1. 两阶段提交设计
+**配置**：
 
-**优势**：
-- ✅ **用户体验更好**：快速上传响应，不阻塞用户操作
-- ✅ **选择性添加**：批量上传后可选择性确认部分文件
-- ✅ **资源控制合理**：未确认的文档不构建索引，不消耗配额
-- ✅ **故障恢复友好**：临时文档可以定期清理，不影响业务
+```bash
+# 本地存储
+export OBJECT_STORE_TYPE=local
 
-**状态隔离**：
-```
-临时状态（UPLOADED）：
-  - 不计入配额
-  - 不触发索引
-  - 可以被自动清理
-
-正式状态（PENDING/RUNNING/COMPLETE）：
-  - 计入配额
-  - 触发索引构建
-  - 不会被自动清理
+# 云存储（S3/MinIO）
+export OBJECT_STORE_TYPE=s3
+export OBJECT_STORE_S3_BUCKET=aperag
 ```
 
-### 2. 幂等性设计
+### 7.2 解析器配置
 
-**文件级别幂等**：
-- SHA-256 哈希去重
-- 相同文件多次上传返回同一 `document_id`
-- 避免存储空间浪费
+**启用不同解析器**：
 
-**接口级别幂等**：
-- `upload_document`：重复上传返回已存在文档
-- `confirm_documents`：重复确认不会创建重复索引
-- `delete_document`：重复删除返回成功（软删除）
+```bash
+# DocRay（推荐，免费，效果好）
+export USE_DOC_RAY=true
+export DOCRAY_HOST=http://docray:8639
 
-### 3. 多租户隔离
+# MinerU（可选，付费，精度最高）
+export USE_MINERU_API=false
+export MINERU_API_TOKEN=your_token
 
-**存储隔离**：
-```
-user-{user_A}/...  # 用户 A 的文件
-user-{user_B}/...  # 用户 B 的文件
+# MarkItDown（默认启用，兜底）
+export USE_MARKITDOWN=true
 ```
 
-**数据库隔离**：
-- 所有查询都带 `user` 字段过滤
-- 集合级别的权限控制（`collection.user`）
-- 软删除支持（`gmt_deleted`）
+**选择建议**：
+- 💰 免费方案：DocRay + MarkItDown
+- 🎯 高精度：MinerU + DocRay + MarkItDown
+
+### 7.3 索引配置
 
-### 4. 灵活的存储后端
+在 Collection 配置中控制启用哪些索引：
 
-**统一接口**：
-```python
-AsyncObjectStore:
-  - put(path, data)
-  - get(path)
-  - delete_objects_by_prefix(prefix)
+```json
+{
+  "enable_vector": true,          // 向量索引（必选）
+  "enable_fulltext": true,        // 全文索引（必选）
+  "enable_knowledge_graph": true, // 图谱索引（可选）
+  "enable_summary": false,        // 摘要索引（可选）
+  "enable_vision": false          // 视觉索引（可选）
+}
 ```
 
-**运行时切换**：
-- 通过环境变量切换 Local/S3
-- 无需修改业务代码
-- 支持自定义存储后端（实现接口即可）
+### 7.4 性能调优
 
-### 5. 事务一致性
+**文件大小限制**：
 
-**数据库 + 对象存储的两阶段提交**：
-```python
-async with transaction:
-    # 1. 创建数据库记录
-    document = create_document_record()
-    
-    # 2. 上传到对象存储
-    await object_store.put(path, data)
-    
-    # 3. 更新元数据
-    document.doc_metadata = json.dumps(metadata)
-    
-    # 所有操作成功才提交，任一失败则回滚
+```bash
+export MAX_DOCUMENT_SIZE=104857600  # 100 MB
+export MAX_EXTRACTED_SIZE=5368709120  # 5 GB
 ```
 
-**失败处理**：
-- 数据库记录创建失败：不上传文件
-- 文件上传失败：回滚数据库记录
-- 元数据更新失败：回滚前面的操作
+**并发设置**：
+
+```bash
+export CELERY_WORKER_CONCURRENCY=16  # 并发处理 16 个文档
+export CELERY_TASK_TIME_LIMIT=3600   # 单个任务超时 1 小时
+```
 
-### 6. 可观测性
+**配额设置**：
 
-**审计日志**：
-- `@audit` 装饰器记录所有文档操作
-- 包含：用户、时间、操作类型、资源 ID
+```bash
+export MAX_DOCUMENT_COUNT=1000  # 用户最多 1000 个文档
+export MAX_DOCUMENT_COUNT_PER_COLLECTION=100  # 单集合最多 100 个
+```
 
-**任务追踪**：
-- `gmt_last_reconciled`：最后处理时间
-- `error_message`：失败原因
-- Celery 任务 ID：关联日志追踪
+## 8. 常见问题
 
-**监控指标**：
-- 文档上传速率
-- 索引构建耗时
-- 失败率统计
+### 8.1 文件上传失败？
 
-## 性能优化
+**可能原因和解决方法**：
 
-### 1. 异步处理
+| 问题 | 原因 | 解决方法 |
+|------|------|---------|
+| 文件太大 | 超过 100 MB | 压缩或分割文件 |
+| 格式不支持 | 特殊格式 | 转换成 PDF 或其他常见格式 |
+| 同名冲突 | 已存在同名不同内容文件 | 重命名文件 |
+| 配额已满 | 达到文档数量上限 | 删除旧文档或升级配额 |
 
-**上传不阻塞**：
-- 文件上传到对象存储后立即返回
-- 索引构建在 Celery 中异步执行
-- 前端通过轮询或 WebSocket 获取进度
+### 8.2 文档处理失败？
 
-### 2. 批量操作
+系统会自动重试 3 次，如果仍失败：
 
-**批量确认**：
-```python
-confirm_documents(document_ids=[id1, id2, ..., idN])
 ```
-- 一次事务处理多个文档
-- 批量创建索引记录
-- 减少数据库往返
-
-### 3. 缓存策略
-
-**解析结果缓存**：
-- 解析后的内容保存到 `processed_content.md`
-- 后续索引重建可直接读取，无需重新解析
-
-**分块结果缓存**：
-- 分块结果保存到 `chunks/` 目录
-- 向量索引重建可复用分块结果
-
-### 4. 并行索引构建
-
-**多索引并行**：
-```python
-# VECTOR、FULLTEXT、GRAPH 可以并行构建
-await asyncio.gather(
-    create_vector_index(),
-    create_fulltext_index(),
-    create_graph_index()
-)
+查看错误信息 → 根据提示修复 → 重新上传 → 系统自动重试
 ```
 
-## 错误处理
+常见错误：
+- 文件损坏 → 重新制作文件
+- 内容无法识别 → 尝试转换格式
+- 临时网络问题 → 系统会自动重试
 
-### 常见异常
+### 8.3 如何加快处理速度？
 
-| 异常类型 | HTTP 状态码 | 触发场景 | 处理建议 |
-|---------|------------|----------|----------|
-| `ResourceNotFoundException` | 404 | 集合/文档不存在 | 检查 ID 是否正确 |
-| `CollectionInactiveException` | 400 | 集合未激活 | 等待集合初始化完成 |
-| `DocumentNameConflictException` | 409 | 同名不同内容 | 重命名文件或删除旧文档 |
-| `QuotaExceededException` | 429 | 配额超限 | 升级套餐或删除旧文档 |
-| `InvalidFileTypeException` | 400 | 不支持的文件类型 | 查看支持的文件类型列表 |
-| `FileSizeTooLargeException` | 413 | 文件过大 | 分割文件或压缩 |
+**方法 1**：禁用不需要的索引
 
-### 异常传播
-
-```
-Service Layer 抛出异常
-    │
-    ▼
-View Layer 捕获并转换
-    │
-    ▼
-Exception Handler 统一处理
-    │
-    ▼
-返回标准 JSON 响应：
+```json
 {
-  "error_code": "QUOTA_EXCEEDED",
-  "message": "Document count limit exceeded",
-  "details": {
-    "limit": 1000,
-    "current": 1000
-  }
+  "enable_knowledge_graph": false  // 图谱最慢，可选禁用
 }
 ```
 
-## 相关文件索引
-
-### 核心实现
+**方法 2**：使用更快的 LLM 模型
 
-- **View 层**：`aperag/views/collections.py` - HTTP 接口定义
-- **Service 层**：`aperag/service/document_service.py` - 业务逻辑
-- **数据库模型**：`aperag/db/models.py` - Document, DocumentIndex 表定义
-- **数据库操作**：`aperag/db/ops.py` - CRUD 操作封装
+在 Collection 配置中选择响应更快的模型。
 
-### 对象存储
+### 8.4 暂存区文件会丢失吗？
 
-- **接口定义**：`aperag/objectstore/base.py` - AsyncObjectStore 抽象类
-- **Local 实现**：`aperag/objectstore/local.py` - 本地文件系统存储
-- **S3 实现**：`aperag/objectstore/s3.py` - S3 兼容存储
+- ✅ 7 天内：不会丢失，可以随时确认
+- ⚠️ 7 天后：自动清理（节省存储）
+- 💡 建议：上传后及时确认
 
-### 文档解析
+## 9. 总结
 
-- **主控制器**：`aperag/docparser/doc_parser.py` - DocParser
-- **Parser 实现**：
-  - `aperag/docparser/mineru_parser.py` - MinerU PDF 解析
-  - `aperag/docparser/docray_parser.py` - DocRay 文档解析
-  - `aperag/docparser/markitdown_parser.py` - MarkItDown 通用解析
-  - `aperag/docparser/image_parser.py` - 图片 OCR
-  - `aperag/docparser/audio_parser.py` - 音频转录
-- **文档处理**：`aperag/index/document_parser.py` - 解析流程编排
+ApeRAG 的文档上传让你可以轻松地把各种格式的文档添加到知识库。
 
-### 索引构建
+### 核心优势
 
-- **索引管理**：`aperag/index/manager.py` - DocumentIndexManager
-- **向量索引**：`aperag/index/vector_index.py` - VectorIndexer
-- **全文索引**：`aperag/index/fulltext_index.py` - FulltextIndexer
-- **知识图谱**：`aperag/index/graph_index.py` - GraphIndexer
-- **文档摘要**：`aperag/index/summary_index.py` - SummaryIndexer
-- **视觉索引**：`aperag/index/vision_index.py` - VisionIndexer
+1. ✅ **支持 20+ 种格式**：PDF、Word、Excel、图片、音频等
+2. ✅ **秒级上传响应**：不用等待，立即返回
+3. ✅ **暂存区设计**：先传后选，避免误操作
+4. ✅ **智能解析**：自动识别格式，选择最佳解析器
+5. ✅ **多索引构建**：同时构建 5 种索引，满足不同检索需求
+6. ✅ **后台处理**：异步执行，不阻塞用户
+7. ✅ **自动重试**：失败自动重试，提高成功率
+8. ✅ **配额管理**：确认时才消耗，合理控制资源
 
-### 任务调度
+### 性能表现
 
-- **任务定义**：`config/celery_tasks.py` - Celery 任务注册
-- **协调器**：`aperag/tasks/reconciler.py` - DocumentIndexReconciler
-- **文档任务**：`aperag/tasks/document.py` - DocumentIndexTask
+| 操作 | 时间 |
+|------|------|
+| 上传 100 个文件 | < 1 分钟 |
+| 确认添加 | < 1 秒 |
+| 小文档处理（< 10 页） | 1-3 分钟 |
+| 中型文档（10-50 页） | 3-10 分钟 |
+| 大型文档（100+ 页） | 10-30 分钟 |
 
-### 前端实现
+### 适用场景
 
-- **文档列表**：`web/src/app/workspace/collections/[collectionId]/documents/page.tsx`
-- **文档上传**：`web/src/app/workspace/collections/[collectionId]/documents/upload/document-upload.tsx`
+- 📚 企业知识库建设
+- 🔬 研究资料整理
+- 📖 个人笔记管理
+- 🎓 学习资料归档
 
-## 总结
+整个系统既**简单易用**，又**功能强大**，适合各种规模的知识管理需求。
 
-ApeRAG 的文档上传模块采用**两阶段提交 + 多 Parser 链式调用 + 多索引并行构建**的架构设计：
+---
 
-**核心特性**：
-1. ✅ **两阶段提交**：上传（临时存储）→ 确认（正式添加），提供更好的用户体验
-2. ✅ **SHA-256 去重**：避免重复文档，支持幂等上传
-3. ✅ **灵活存储后端**：Local/S3 可配置切换，统一接口抽象
-4. ✅ **多 Parser 架构**：支持 MinerU、DocRay、MarkItDown 等多种解析器
-5. ✅ **格式自动转换**：PDF→图片、音频→文本、图片→OCR 文本
-6. ✅ **多索引协调**：向量、全文、图谱、摘要、视觉五种索引类型
-7. ✅ **配额管理**：确认阶段才扣除配额，合理控制资源
-8. ✅ **异步处理**：Celery 任务队列，不阻塞用户操作
-9. ✅ **事务一致性**：数据库 + 对象存储的两阶段提交
-10. ✅ **可观测性**：审计日志、任务追踪、错误信息完整记录
+## 相关文档
 
-这种设计既保证了高性能和可扩展性，又支持复杂的文档处理场景（多格式、多语言、多模态），同时具有良好的容错能力和用户体验。
+- 📋 [系统架构](./architecture.md) - ApeRAG 整体架构设计
+- 📖 [图索引构建流程](./graph_index_creation.md) - 图谱索引详解
+- 🔗 [索引链路架构](./indexing_architecture.md) - 完整索引流程