Skip to content

v0.3.4: OCR support, batch indexing in Meilisearch

Choose a tag to compare

@wayneshn wayneshn released this 26 Sep 15:04
· 10 commits to main since this release
b71dd55

What's Changed

  • Feat: Tika Integration and Batch Indexing by @wayneshn and @axeldunkel in #132

    • Enhanced Text Extraction: We've integrated Apache Tika to provide text and metadata extraction from a wide range of file types, including PDFs, Office documents, and image-based files. This improves the search capabilities by making the content of attachments fully searchable.
    • Improved Indexing Performance: The indexing process now supports batching, which will significantly speeds up the ingestion and indexing of large volumes of emails.
    • Configurable Batch Size: The batch size can be configured using the MEILI_INDEXING_BATCH environment variable.
  • docs: Add OCR docs by @wayneshn in #144

Full Changelog: v0.3.3...v0.3.4