Rhesis AI

Rhesis: Collaborative Testing for LLM & Agentic Applications

Website · Docs · Discord · Changelog

More than just evals.
Collaborative agent testing for teams.

Generate tests from requirements, simulate conversation flows, detect adversarial behaviors, evaluate with 60+ metrics, and trace failures with OpenTelemetry. Engineers and domain experts, working together.

About Rhesis AI

Built by developers who needed better LLM testing tools

We built Rhesis because existing LLM testing tools didn't meet our needs for testing agentic applications. If you face the same challenges, contributions are welcome.

Collaborative testing for cross-functional teams

Testing shouldn't be limited to engineers. Legal teams understand compliance requirements. Marketing knows brand guidelines. Domain experts identify edge cases. Rhesis enables everyone to contribute their expertise without writing code.

From requirements to automated test execution

Define requirements in plain language. Rhesis generates test scenarios based on your team's collective knowledge. Execute tests automatically via UI, SDK, or CI/CD. Get detailed results showing exactly how your LLM & agentic applications perform.

Open source with a clear license model

MIT licensed. Enterprise version lives in ee/ folders and remain separate.

Get started

Check out our main repository and documentation to get started.

Quick start options:

Cloud - app.rhesis.ai - Managed service, just connect your app
Self-hosted - Run locally with Docker in 5 minutes
Python SDK - Integrate directly into your codebase

Made with in Potsdam, Germany 🇩🇪

Learn more at rhesis.ai

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rhesis AI

Rhesis: Collaborative Testing for LLM & Agentic Applications

More than just evals.
Collaborative agent testing for teams.

About Rhesis AI

Built by developers who needed better LLM testing tools

Collaborative testing for cross-functional teams

From requirements to automated test execution

Open source with a clear license model

Get started

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!

Rhesis: Collaborative Testing for LLM & Agentic Applications

More than just evals.Collaborative agent testing for teams.

About Rhesis AI

Built by developers who needed better LLM testing tools

Collaborative testing for cross-functional teams

From requirements to automated test execution

Open source with a clear license model

Get started

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!

More than just evals.
Collaborative agent testing for teams.