Quickstart

Run these from the root of your Python agent repo.

Paste this into your coding agent

Run `uvx kensa init --cli --agent all`, then use the audit-evals skill and
follow the eval lifecycle.

The agent installs the CLI, scaffolds .kensa/, drops in the five skills, and runs your first eval. Works with:

Or install yourself, then ask your agent

If you want to control the install step but still let your coding agent drive the eval workflow:

uvx kensa init --cli --agent all

Then in Claude Code, Codex, Cursor, OpenCode, or Gemini CLI:

> /audit-evals

The skill captures a real run, generates scenarios, runs evals, and reports back.

Or CLI-only

If you want to skip the coding-agent loop entirely and drive kensa as a regular CLI:

uvx kensa init                                       # dev dep + bare .kensa/
kensa capture -i "<example input>" -- <your agent>   # record one real run as a trace
kensa generate                                       # synthesize scenarios from the capture
kensa eval                                           # run + judge + report

Add the SDK extra that matches your stack so traces include LLM calls and tool use:

uv add "kensa[anthropic]"   # or kensa[openai] / kensa[langchain] / kensa[all]

What “evaluate this agent” actually does

audit-evals checks setup, then routes to the right next step: capture a real run, generate scenarios, calibrate a judge, or run evals. The CLI handles execution, tracing, judging, and reporting under the hood.

Scenarios — write or generate test cases
Tracing — kensa capture and OpenTelemetry compatibility
CLI Reference — full command surface
Examples — runnable agents to point kensa at

Getting started

Reference

Workflows

Releases

Paste this into your coding agent

Or install yourself, then ask your agent

Or CLI-only

What “evaluate this agent” actually does

Next

Getting started

Reference

Workflows

Releases

Documentation Index

​Paste this into your coding agent

​Or install yourself, then ask your agent

​Or CLI-only

​What “evaluate this agent” actually does

​Next

Paste this into your coding agent

Or install yourself, then ask your agent

Or CLI-only

What “evaluate this agent” actually does

Next