Docs / v0.1 reference
Spec, schemas, adapters.
Everything needed to author scenarios, run trials against any provider, and produce signed scorecards a buyer can verify independently.
Quickstart
Clone the repo, validate a scenario, and run one provider against it. Five minutes, zero infrastructure.
Then execute a single run:
When you are ready, sign the scorecard:
Installation
`v0.1` ships schema validation and signing. Reference runner hardening is in progress.
From source:
Core concepts
Four primitives power the system:
- Scenario: declarative test case with checks and environment
- Adapter: provider-specific runtime bridge
- Trial: one run of one scenario against one adapter
- Scorecard: signed aggregate output
Scenario schema
A scenario is a YAML document with six top-level fields:
Example:
Scorecard schema
Scorecards are JSON outputs for one `scenario@version` against one provider, with deterministic grading.
Adapter contract
Adapters are the only provider-specific layer and implement five methods.
Running evals
Single run:
Campaign matrix:
Attestation
Attestation signs canonicalized scorecard JSON with domain separation:
Verify end-to-end:
CLI reference
- `voiceattest validate <path>` validates schema
- `voiceattest run` executes one scenario/provider trial set
- `voiceattest campaign <cfg>` runs matrix campaigns
- `voiceattest conform --adapter <name>` checks adapter contract
- `voiceattest attest` signs a scorecard
- `voiceattest verify` verifies signed output + log proof
Available adapters
- Orion (reference)
- Mistral Echo (reference)
- Vapi, Retell, Bland (community beta)
- Deepgram Voice, ElevenLabs Convo, Pipecat custom (planned)
Versioning & compatibility
Spec, runner, and scenarios version independently. Strict scorecard comparison requires:
- Matching `scenario_sha256`
- Same spec major version
- Same evaluator trust context
Contributing
Most valuable contributions right now:
- New scenarios with passing example scorecards
- New adapters that pass conformance suite
- Transparency log witnesses
Continue with platform overview or return to early access.