prompt-prix

prompt-prix

Find your optimal open-weights model.

prompt-prix is a visual tool for running benchmark test suites across multiple LLMs simultaneously, helping you discover which model and quantization best fits your VRAM constraints and task requirements.

The Problem

You have a 24GB GPU. Should you run qwen2.5-72b-instruct-q4_k_m or llama-3.1-70b-instruct-q5_k_s for tool calling? BFCL gives you leaderboard scores for full-precision models. That doesn't answer your question. This is a different kind of metric.

The Solution

Run existing benchmarks against your candidate models, on your hardware, and see results side-by-side.

Fan-out dispatch: Same test case → N models in parallel
Work-stealing scheduler: Efficient multi-GPU utilization across heterogeneous workstations
Visual comparison: Real-time streaming with Model × Test result grid
Benchmark-native: Consumes BFCL and Inspect AI test formats directly

Status

🚧 Active Development

The working codebase is on the development/testing branch.

Ecosystem Position

Tool	Purpose
BFCL	Function-calling benchmark with leaderboard
Inspect AI	Evaluation framework (UK AISI)
prompt-prix	Visual fan-out for model selection

prompt-prix complements these tools—it's a visual layer for comparing models during selection, not a replacement for rigorous evaluation.

Architecture Highlights

Adapter pattern: OpenAI-compatible API now (LM Studio), extensible to Ollama/vLLM
Fail-fast validation: Invalid benchmark files rejected immediately
Pydantic state management: Explicit, typed, observable
Work-stealing dispatcher: Asymmetric GPU setups handled automatically

License

MIT

Links

Development branch — working code
BFCL — upstream benchmark source
Inspect AI — UK AISI evaluation framework

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
.claude		.claude
docs		docs
examples		examples
prompt_prix		prompt_prix
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

prompt-prix

The Problem

The Solution

Status

Ecosystem Position

Architecture Highlights

License

Links

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

License

shanevcantwell/prompt-prix

Folders and files

Latest commit

History

Repository files navigation

prompt-prix

The Problem

The Solution

Status

Ecosystem Position

Architecture Highlights

License

Links

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages