OCRFlux Pipeline

### Pipeline Name

OCRFlux

### URL

https://github.com/chatdoc-com/OCRFlux

### GitHub URL

https://github.com/chatdoc-com/OCRFlux

### License

Apache-2.0

### Custom License

_No response_

### Pipeline Description

OCRFlux is a multimodal large language model based toolkit designed to convert PDFs and images into clean, readable, plain Markdown text. It excels in complex layout handling, including multi-column layouts, figures, insets, complicated tables, and equations. The system also provides automated removal of headers and footers, alongside native support for cross-page table and paragraph merging, a pioneering feature among open-source OCR tools. Built on a 3 billion parameter vision-language model, it can run efficiently on GPUs such as the GTX 3090. OCRFlux provides batch inference support for whole documents and detailed parsing quality with benchmarks demonstrating significant improvements over several leading OCR models.​

### Primary Language

_No response_

### Demo (if available)

https://ocrflux.pdfparser.io/

### Has the pipeline been benchmarked? If yes, provide benchmark results or a link to evaluation metrics.

_No response_

### Does it have an API?

No

### API URL (if applicable)

_No response_

### API Pricing Page (if applicable)

_No response_

### API Average Price per 1000 Page (if applicable)

_No response_

### Additional Notes

- Recommended GPU: 24GB or more VRAM for best performance, but supports tensor parallelism to divide workload across multiple smaller GPUs
- Includes Docker container support for easy deployment
- Supports various command-line options for customizing inference, GPU memory utilization, page merging behavior, and data type selection
- Outputs results as JSONL files convertible into Markdown documents
- Developed and maintained by ChatDOC team
- Has 2.3k stars on GitHub


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OCRFlux Pipeline #31

Pipeline Name

URL

GitHub URL

License

Custom License

Pipeline Description

Primary Language

Demo (if available)

Has the pipeline been benchmarked? If yes, provide benchmark results or a link to evaluation metrics.

Does it have an API?

API URL (if applicable)

API Pricing Page (if applicable)

API Average Price per 1000 Page (if applicable)

Additional Notes

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

OCRFlux Pipeline #31

Description

Pipeline Name

URL

GitHub URL

License

Custom License

Pipeline Description

Primary Language

Demo (if available)

Has the pipeline been benchmarked? If yes, provide benchmark results or a link to evaluation metrics.

Does it have an API?

API URL (if applicable)

API Pricing Page (if applicable)

API Average Price per 1000 Page (if applicable)

Additional Notes

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions