Skip to content
Change the repository type filter

All

    Repositories list

    • Bud AI Foundry - A comprehensive inference stack for compound AI deployment, optimization and scaling. Bud Stack provides intelligent infrastructure automation…
      Python
      3103920Updated Feb 16, 2026Feb 16, 2026
    • BudConnect is a cloud service that provides compatibility checking and update synchronization for Bud inference runtimes on customer infrastructure. It acts as …
      Python
      0001Updated Feb 13, 2026Feb 13, 2026
    • Official Python SDK for the BudAI Foundry Platform.
      Python
      0000Updated Feb 13, 2026Feb 13, 2026
    • simulator

      Public
      Decoder only, Encoder only, Diffusion model simulation for SLO, Memory, Infra calculations. For model inference and training
      Python
      22001Updated Feb 8, 2026Feb 8, 2026
    • WaaV

      Public
      Bud WaaV is high performant, Scalable Audio AI Gateway written in Rust.
      Rust
      0201Updated Feb 4, 2026Feb 4, 2026
    • LayerZero

      Public
      LayerZero is a GenAI kernel orchestration, and Dispatch system, that allow Inference engine developers to easily develop cross platform Inference engines.
      Python
      0000Updated Feb 3, 2026Feb 3, 2026
    • aibrix

      Public
      Cost-efficient and pluggable Infrastructure components for GenAI inference
      Go
      524021Updated Jan 26, 2026Jan 26, 2026
    • scaler

      Public
      Bud Scaler is a Kubernetes components designed for GenAI workloads. It provides intelligent scaling, GPU virtualisation, routing strategies etc.
      Go
      0001Updated Jan 26, 2026Jan 26, 2026
    • GenAI components at micro-service level; GenAI service composer to create mega-service
      Python
      218020Updated Jan 22, 2026Jan 22, 2026
    • Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterpris…
      Shell
      338000Updated Jan 21, 2026Jan 21, 2026
    • Python
      0001Updated Jan 16, 2026Jan 16, 2026
    • budtiktok

      Public
      The fastest tokeniser ever, with SIMD & Algorithmic optimisations
      Python
      0000Updated Jan 16, 2026Jan 16, 2026
    • stove8s

      Public
      Preheats your containers for you
      Go
      0100Updated Jan 13, 2026Jan 13, 2026
    • A manager to load vllm plugins without rebuilding image for each new plugin.
      Python
      0000Updated Jan 10, 2026Jan 10, 2026
    • Bud Flow Lang is a domain-specific language embedded in Python that helps you easily write portable, high-performance SIMD programs.
      C++
      0000Updated Jan 7, 2026Jan 7, 2026
    • Profile SIMD code across multiple ISAs and hardware with ease. SIMD bench also comes with perf analyser that automatically recommends optimisations.
      C++
      0000Updated Dec 30, 2025Dec 30, 2025
    • A GPU Virtualisation benchmarking tool - For LLM like workloads, overhead evaluations and on resource isolation metrics
      Cuda
      0800Updated Dec 30, 2025Dec 30, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      13k000Updated Dec 26, 2025Dec 26, 2025
    • vLLM Plugins for additional features like decoding strategies, monitoring, models etc
      Python
      0000Updated Dec 24, 2025Dec 24, 2025
    • Python
      0132Updated Dec 15, 2025Dec 15, 2025
    • A curated list of plugins built on top of vLLM
      0600Updated Dec 12, 2025Dec 12, 2025
    • scid

      Public
      CI/CD/Stateless/Helm/Sops/ExecJobs/WatchPaths/Git/SlackIntegration
      Go
      1000Updated Dec 4, 2025Dec 4, 2025
    • An agent that can detect legal obligations with an SLM
      Python
      0000Updated Dec 2, 2025Dec 2, 2025
    • OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets…
      Python
      737000Updated Nov 26, 2025Nov 26, 2025
    • .github

      Public
      0000Updated Nov 26, 2025Nov 26, 2025
    • HAMi

      Public
      Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)
      Go
      475000Updated Nov 24, 2025Nov 24, 2025
    • A tool to download and upload models from MinIO to a local path.
      Python
      0000Updated Oct 20, 2025Oct 20, 2025
    • Nix
      0000Updated Sep 30, 2025Sep 30, 2025
    • d-UI

      Public
      A protocol layer for LLMs to create, interact with and respond to UI interactions by the end-user, allowing LLMs/Tools/APIs to also capture, respond and Interac…
      Python
      0000Updated Sep 29, 2025Sep 29, 2025
    • NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
      Go
      452000Updated Sep 13, 2025Sep 13, 2025