LLMTrace

Zero-code LLM observability and security for production.

LLMTrace is a transparent proxy that captures, analyzes, and secures your LLM interactions in real-time. Drop it between your app and any OpenAI-compatible API to get instant visibility into prompt injection attacks, PII leaks, cost overruns, and performance bottlenecks — without changing a single line of code.

Why LLMTrace?

Production LLM applications face three critical blind spots:

Security vulnerabilities — Prompt injection, data leakage, PII exposure
Cost runaway — Uncontrolled API spend, inefficient token usage
Performance opacity — No visibility into latency, failure rates, or user behavior

LLMTrace solves this by sitting transparently between your application and LLM providers, giving you complete observability and control.

Key Features

Transparent Proxy — Drop-in replacement for any OpenAI-compatible API
ML Ensemble Detection — Multi-detector majority voting (regex, DeBERTa, InjecGuard, PIGuard)
Real-time Security — Prompt injection detection, PII scanning, data leakage prevention
Performance Monitoring — Latency, token usage, streaming metrics (TTFT), error tracking
Cost Control — Per-agent budgets, rate limits, anomaly detection
Multi-tenant Ready — Isolated per API key or custom tenant headers
High Performance — Built in Rust, handles streaming responses, circuit breaker protection

Security Performance

Metric	Value
Accuracy	87.6%
Precision	95.5%
F1 Score	86.9%
Recall	79.7%

Tested on a 153-sample adversarial corpus across 12 attack categories including CyberSecEval2, BIPIA, TensorTrust, and InjecAgent. See benchmarks/ for methodology and full results.

Quick Start

1. Install

curl -sS https://raw.githubusercontent.com/epappas/llmtrace/main/scripts/install.sh | bash

Or use one of the other methods:

cargo install llmtrace                # from crates.io
docker pull ghcr.io/epappas/llmtrace-proxy:latest  # Docker

2. Run

export OPENAI_API_KEY="sk-..."
llmtrace-proxy --config config.yaml

3. Try it with your existing code

import openai

# Before: Point to OpenAI directly
client = openai.OpenAI()

# After: Point to LLMTrace proxy (that's it!)
client = openai.OpenAI(base_url="http://localhost:8080/v1")

# Your code stays exactly the same
response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Hello!"}]
)

4. See your traces

# View recent activity
curl http://localhost:8080/api/v1/traces | jq '.[0]'

# Check security findings
curl http://localhost:8080/api/v1/security/findings | jq

# Monitor costs
curl http://localhost:8080/api/v1/costs/current | jq

That's it! You now have full observability into your LLM interactions.

Architecture

graph LR
    A[Your Application] -->|HTTP| B[LLMTrace Proxy]
    B -->|HTTP| C[OpenAI/LLM Provider]
    B -->|Async| D[Security Engine]
    B -->|Async| E[Storage Engine]

    D --> F[SQLite/PostgreSQL]
    E --> F
    D --> G[Real-time Alerts]

    H[Dashboard] -->|REST API| B
    I[Monitoring] -->|Metrics API| B

    style B fill:#e1f5fe
    style D fill:#fff3e0
    style E fill:#f3e5f5

How it works:

Transparent Proxy — Your app sends requests to LLMTrace instead of OpenAI
Pass-through — LLMTrace forwards requests to the real LLM provider
Background Analysis — Security analysis and trace capture happen asynchronously
Zero Impact — Your application never waits for analysis, even if something fails

Integration Examples

OpenAI Python SDK

import openai

# Just change the base_url
client = openai.OpenAI(
    base_url="http://localhost:8080/v1",
    api_key="your-openai-key"
)

OpenAI Node.js SDK

import OpenAI from 'openai';

const openai = new OpenAI({
  baseURL: 'http://localhost:8080/v1',
  apiKey: 'your-openai-key'
});

LangChain

from langchain_openai import ChatOpenAI

llm = ChatOpenAI(
    base_url="http://localhost:8080/v1",
    api_key="your-openai-key"
)

curl

curl http://localhost:8080/v1/chat/completions \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4", "messages": [{"role": "user", "content": "Hello!"}]}'

View all integration guides ->

Dashboard & Monitoring

LLMTrace includes a built-in dashboard for visualizing traces, security findings, and costs:

# Access the dashboard
open http://localhost:3000

# Or use the REST API
curl http://localhost:8080/api/v1/traces
curl http://localhost:8080/api/v1/security/findings
curl http://localhost:8080/api/v1/costs/current

Dashboard features:

Real-time trace visualization
Security incident timeline
Cost breakdown by model/agent
Performance metrics and alerts

Configuration

Minimal Configuration

# config.yaml
upstream_url: "https://api.openai.com"
listen_addr: "0.0.0.0:8080"

storage:
  profile: "lite"  # SQLite for simple deployments

security:
  enable_prompt_injection_detection: true
  enable_pii_detection: true

Production Configuration

# config.yaml
upstream_url: "https://api.openai.com"
listen_addr: "0.0.0.0:8080"

storage:
  profile: "production"
  postgres_url: "postgresql://user:pass@localhost/llmtrace"
  clickhouse_url: "http://localhost:8123"
  redis_url: "redis://localhost:6379"

security:
  enable_prompt_injection_detection: true
  enable_pii_detection: true
  enable_streaming_analysis: true

cost_control:
  daily_budget_usd: 1000
  per_agent_daily_budget_usd: 100

alerts:
  slack:
    webhook_url: "https://hooks.slack.com/..."

rate_limiting:
  requests_per_minute: 1000
  burst_capacity: 2000

Full configuration guide ->

API Reference

Endpoint	Description
`GET /api/v1/traces`	List recent traces
`GET /api/v1/traces/{id}`	Get specific trace details
`GET /api/v1/security/findings`	List security incidents
`GET /api/v1/costs/current`	Cost breakdown and usage
`GET /health`	Health check and circuit breaker status
`POST /policies/validate`	Validate custom security policies

Full API documentation ->

Installation

Cargo (Rust Proxy)

cargo install llmtrace
llmtrace-proxy --config config.yaml

Pip (Python SDK)

pip install llmtracing

import llmtrace

tracer = llmtrace.configure({"enable_security": True})
span = tracer.start_span("chat_completion", "openai", "gpt-4")
span.set_prompt("Hello!")
span.set_response("Hi there!")
print(span.to_dict())

Docker

docker pull ghcr.io/epappas/llmtrace-proxy:latest
docker run -p 8080:8080 ghcr.io/epappas/llmtrace-proxy:latest

Docker Compose with Dependencies

curl -o compose.yaml https://raw.githubusercontent.com/epappas/llmtrace/main/compose.yaml
docker compose up -d

From Source

git clone https://github.com/epappas/llmtrace
cd llmtrace
cargo build --release --features ml
./target/release/llmtrace-proxy --config config.yaml

Kubernetes

helm install llmtrace ./deployments/helm/llmtrace

Installation guide with all methods ->

Production Deployment

High-Availability Setup

Load Balancer -> Multiple LLMTrace instances
PostgreSQL for persistent trace storage
ClickHouse for high-volume analytics
Redis for caching and rate limiting

Security Best Practices

API key validation and tenant isolation
TLS termination at load balancer
Network segmentation between components
Regular security policy updates

Monitoring & Alerting

Prometheus metrics export
Grafana dashboards
PagerDuty/Slack integration
OWASP LLM Top 10 compliance reporting

Production deployment guide ->

Contributing

We welcome contributions! Please see our Contributing Guide for details.

Development Setup

git clone https://github.com/epappas/llmtrace
cd llmtrace
cargo build --workspace
cargo test --workspace

Project Structure

Crate	Package	Purpose
`llmtrace-core`	-	Shared types and traits
`llmtrace`	crates.io	HTTP proxy server (`cargo install llmtrace`)
`llmtrace-security`	-	Security analysis engine (regex + DeBERTa + InjecGuard + PIGuard ensemble)
`llmtrace-storage`	-	Storage backends (SQLite, PostgreSQL, ClickHouse, Redis)
`llmtrace-python`	PyPI	Python SDK (`pip install llmtracing`, imports as `import llmtrace`)

Development guide ->

License

MIT - Free for commercial and personal use.

Star this repo if LLMTrace helps secure your LLM applications!

Found a bug? Open an issue

Questions? Start a discussion

Name		Name	Last commit message	Last commit date
Latest commit History 332 Commits
.cargo		.cargo
.github		.github
benchmarks		benchmarks
config		config
crates		crates
dashboard		dashboard
deployments/helm/llmtrace		deployments/helm/llmtrace
docs		docs
examples		examples
perf_test		perf_test
scripts		scripts
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
DASHBOARD_FEATURES.md		DASHBOARD_FEATURES.md
Dockerfile		Dockerfile
Justfile		Justfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
compose.yaml		compose.yaml
config.example.yaml		config.example.yaml
deny.toml		deny.toml
llmtrace.db-shm		llmtrace.db-shm
llmtrace.db-wal		llmtrace.db-wal
rustfmt.toml		rustfmt.toml

License

epappas/llmtrace

Folders and files

Latest commit

History

Repository files navigation

LLMTrace

Why LLMTrace?

Key Features

Security Performance

Quick Start

1. Install

2. Run

3. Try it with your existing code

4. See your traces

Architecture

Integration Examples

OpenAI Python SDK

OpenAI Node.js SDK

LangChain

curl

Dashboard & Monitoring

Configuration

Minimal Configuration

Production Configuration

API Reference

Installation

Cargo (Rust Proxy)

Pip (Python SDK)

Docker

Docker Compose with Dependencies

From Source

Kubernetes

Production Deployment

High-Availability Setup

Security Best Practices

Monitoring & Alerting

Contributing

Development Setup

Project Structure

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Uh oh!

Contributors 4

Uh oh!

Languages

Packages