Skip to content
Change the repository type filter

All

    Repositories list

    • parsekit

      Public
      Ruby document parsing toolkit with zero runtime dependencies. Parse PDFs, DOCX, XLSX, and images (with OCR) using a single, lightweight gem. Statically links Mu…
      Ruby
      14306Updated Feb 16, 2026Feb 16, 2026
    • Ruby gem for running state-of-the-art language models locally. Access LLMs, embeddings, rerankers, and NER models directly from Ruby using Rust-powered Candle w…
      Rust
      619042Updated Feb 3, 2026Feb 3, 2026
    • Ragnar is a pure Ruby command-line RAG (Retrieval-Augmented Generation) tool with zero external dependencies. It provides local document indexing, semantic sear…
      Ruby
      2804Updated Feb 2, 2026Feb 2, 2026
    • topical

      Public
      Ruby library for fast, flexible topic modeling — built on modern embeddings and clustering techniques to uncover themes in text.
      Ruby
      01113Updated Jan 29, 2026Jan 29, 2026
    • High-performance UMAP dimensionality reduction for Ruby, powered by the annembed Rust crate. Fast, memory-efficient manifold learning with model persistence.
      Ruby
      2600Updated Jan 29, 2026Jan 29, 2026
    • Turn any Thor CLI into an interactive REPL with persistent state, auto-completion, and configurable default handlers for unrecognized input.
      Ruby
      0201Updated Jan 28, 2026Jan 28, 2026
    • lancelot

      Public
      Ruby bindings for the Lance columnar data format. Built on the Lance Rust crate, Lancelot brings high-performance vector search, full-text search, and hybrid re…
      Ruby
      0702Updated Dec 15, 2025Dec 15, 2025
    • tokenkit

      Public
      Fast, Rust-backed word-level tokenization for Ruby. Unlike subword tokenizers (BPE, WordPiece) designed for LLMs, TokenKit provides linguistic tokenization for …
      Ruby
      0302Updated Dec 15, 2025Dec 15, 2025
    • spellkit

      Public
      Fast, safe typo correction for Ruby. SymSpell-based spell checker with Rust performance, term protection via regex patterns, and hot-reloadable dictionaries. Su…
      Ruby
      0802Updated Dec 15, 2025Dec 15, 2025
    • Support for local LLMs, running inside of your Ruby process for RubyLLM
      Ruby
      1100Updated Dec 11, 2025Dec 11, 2025
    • fastsheet

      Public
      FastSheet is the fastest XLSX file parser for Ruby (at the time of release). It leverages a Rust library for high-performance parsing, making it significantly f…
      Ruby
      7100Updated Oct 9, 2025Oct 9, 2025
    • Scientist-labs portfolio page.
      TypeScript
      2.1k000Updated Sep 30, 2025Sep 30, 2025
    • phrasekit

      Public
      Weak supervision for NER: mine domain-specific phrases from unlabeled corpora, score by salience, and auto-generate training labels. Ruby gem with high-performa…
      Rust
      0001Updated Sep 28, 2025Sep 28, 2025
    • indradb

      Public
      A graph database written in rust
      Rust
      132101Updated Sep 25, 2025Sep 25, 2025
    • annembed

      Public
      data embedding based on approximate nearest neighbour
      Rust
      8003Updated Sep 6, 2025Sep 6, 2025
    • Rust implementation of the HNSW algorithm (Malkov-Yashunin)
      Rust
      38001Updated Aug 19, 2025Aug 19, 2025
    • rrf

      Public
      Reciprocal Rank Fusion for Ruby
      Ruby
      0511Updated Jul 18, 2025Jul 18, 2025