Files
portal/docs/ml/promptfoo-example

promptfoo example — lead-qualification eval

A worked promptfoo eval: a HOT/WARM/COLD lead-classification prompt with three assertion cases. Demonstrates the A11 prompt-testing workflow.

Run

# from the repo root; ANTHROPIC_API_KEY must be set (PowerShell User scope)
npm run eval:llm

This makes paid Anthropic API calls. Run it manually or in CI only — never in a git hook or pre-commit (A11 rule ML1). See docs/ml/README.md.

Adapt

Copy promptfooconfig.yaml next to a real prompt when an AI feature is built. Swap the model, add tests, use richer assertions (contains, llm-rubric, cost/latency thresholds). Full reference: https://promptfoo.dev/docs/.