# promptfoo example — lead-qualification eval A worked promptfoo eval: a HOT/WARM/COLD lead-classification prompt with three assertion cases. Demonstrates the A11 prompt-testing workflow. ## Run ```bash # from the repo root; ANTHROPIC_API_KEY must be set (PowerShell User scope) npm run eval:llm ``` This makes **paid** Anthropic API calls. Run it manually or in CI only — never in a git hook or pre-commit (A11 rule ML1). See `docs/ml/README.md`. ## Adapt Copy `promptfooconfig.yaml` next to a real prompt when an AI feature is built. Swap the model, add `tests`, use richer assertions (`contains`, `llm-rubric`, cost/latency thresholds). Full reference: .