Commit Graph

5 Commits

Author SHA1 Message Date
Дмитрий 5682926626 fix(enforce): hole 4 — triggers_matched fallback when classifier silent
Brain-retro #5 candidate C, hole 4: enforce-classifier-match.mjs main()
read only state.classification.recommended_node, which is null for
prefilter/regex classifier sources. When triggers_matched[0] contained a
recommendation, the rule was bypassed.

Added fallback: if recommended_node is null, use triggers_matched[0]. decide()
already accepts null confidence on this path (only numeric < 0.7 blocks).
2026-05-26 11:12:59 +03:00
Дмитрий a846eed9dc fix(enforce): hole 5 — tighten nodeMatches to exact/segment match
Brain-retro #5 candidate C, hole 5: nodeMatches() used free-form substring
matching (s.includes(rec) || rec.includes(s)), which matched 'meta-planning'
to a 'planning' recommendation. Tightened to exact match OR matching last
segment after ':' / '#' (skill ns / registry id).

Regression tests preserve: superpowers:writing-plans matches writing-plans,
exact-name matches keep working.
2026-05-26 11:11:29 +03:00
Дмитрий 7e5c297394 fix(enforce): hole 2 — Task/Agent count as mutating actions
Brain-retro #5 candidate C, hole 2: enforce-classifier-match.mjs's
MUTATING_TOOLS set missed Task/Agent, so delegating mutations via Task()
bypassed the rule. Added Task and Agent to the set; nodeMatches already
handles Task.subagent_type matching.

Regression test asserts Task with matching subagent_type does NOT block
(keeps the existing nodeMatches Task path intact).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-26 11:09:11 +03:00
Дмитрий ce02d1adad fix(enforce): hole 1 — remove self-override via assistant text
Brain-retro #5 candidate C, hole 1: enforce-classifier-match.mjs allowed
the agent to bypass the rule by writing 'override: <reason>' in its own
response (self-override = no enforcement). The user-vocabulary override
phrases in enforce-override-vocab.json remain the only legitimate path.

Added regression test asserting block on assistantText override when user
prompt has no override phrase.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-26 11:07:03 +03:00
Дмитрий fe338e09f9 feat(enforce): T8 — Rule #8 classifier-mismatch enforce (Stop) 2026-05-25 18:23:05 +03:00