d7aa5efe30
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
21 lines
697 B
Markdown
21 lines
697 B
Markdown
# promptfoo example — lead-qualification eval
|
|
|
|
A worked promptfoo eval: a HOT/WARM/COLD lead-classification prompt with three
|
|
assertion cases. Demonstrates the A11 prompt-testing workflow.
|
|
|
|
## Run
|
|
|
|
```bash
|
|
# from the repo root; ANTHROPIC_API_KEY must be set (PowerShell User scope)
|
|
npm run eval:llm
|
|
```
|
|
|
|
This makes **paid** Anthropic API calls. Run it manually or in CI only — never
|
|
in a git hook or pre-commit (A11 rule ML1). See `docs/ml/README.md`.
|
|
|
|
## Adapt
|
|
|
|
Copy `promptfooconfig.yaml` next to a real prompt when an AI feature is built.
|
|
Swap the model, add `tests`, use richer assertions (`contains`, `llm-rubric`,
|
|
cost/latency thresholds). Full reference: <https://promptfoo.dev/docs/>.
|