d7aa5efe30
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
promptfoo example — lead-qualification eval
A worked promptfoo eval: a HOT/WARM/COLD lead-classification prompt with three assertion cases. Demonstrates the A11 prompt-testing workflow.
Run
# from the repo root; ANTHROPIC_API_KEY must be set (PowerShell User scope)
npm run eval:llm
This makes paid Anthropic API calls. Run it manually or in CI only — never
in a git hook or pre-commit (A11 rule ML1). See docs/ml/README.md.
Adapt
Copy promptfooconfig.yaml next to a real prompt when an AI feature is built.
Swap the model, add tests, use richer assertions (contains, llm-rubric,
cost/latency thresholds). Full reference: https://promptfoo.dev/docs/.