portal

Author	SHA1	Message	Date
Дмитрий	cbfd9738de	docs(пилот): 26.05 ночь UTC — supplier-webhook Phase 1+2+3 deployed + cleanup 26 dups (refund 11350 RUB tenant client1) Three independent fixes deployed to liderra.ru in 3 incremental phase deploys (13 commits b92d9b3b..48eaffec on main): Phase 1: webhook always returns JSON 422 on ValidationException (was 302 redirect for non-JSON Accept clients — 76 lost/day) Phase 2: merge webhook-after-CSV-recovered into existing deal, no double-charge (closed 37 duplicate pairs/day pattern) Phase 3: accept non-B-prefix projects as platform=DIRECT end-to-end (controller + 4 services + migration v8.36→v8.37) Schema bump: platform VARCHAR(4)→VARCHAR(8), CHECK enum extended to include DIRECT, seed suppliers.code='direct' added. Cleanup (А) 26 dup pairs: soft-delete + reverse balance_transactions (audit-friendly), refund 11 350 RUB to tenant client1 balance. (Б) 82 lost leads recovered automatically by CsvReconcileJob after Phase 3 deploy (entry id=209 recovered_count=58, remaining via webhook retries). Lessons: migrate --force упал — manual psql спас; redeploy.sh не делает git pull (scp нужен); background ssh с heredoc обрывается — nohup решает; fail2ban whitelist + keepalive (ControlMaster broken on Windows OpenSSH). Spec: docs/superpowers/specs/2026-05-25-supplier-webhook-reliability-design.md Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-26 04:07:32 +03:00
Дмитрий	48eaffece8	docs(schema): v8.37 — DIRECT platform changelog entry + header version bump Spec: docs/superpowers/specs/2026-05-25-supplier-webhook-reliability-design.md Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 17:59:13 +03:00
Дмитрий	919971d085	fix(db): migration covers chk_supplier_leads_platform + seed PG-compatible Found via TDD that supplier_leads has its own platform CHECK constraint (chk_supplier_leads_platform) and that the seed migration was missing NOT NULL columns (accepts_types, channel). Migration now: - widens supplier_projects/project_supplier_links/supplier_leads.platform VARCHAR(4) → VARCHAR(8) (DIRECT is 6 chars) - extends three CHECK constraints to include 'DIRECT' Seed migration uses raw SQL INSERT to properly serialize PG ARRAY type for accepts_types column. channel='sites' (valid per suppliers_channel_check). db/schema.sql synced — 3 platform columns and 3 CHECK constraints updated. CHANGELOG_schema.md entry pending Task 9. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 17:59:11 +03:00
Дмитрий	6bf0ebfd1d	feat(supplier): LedgerService + CsvReconcileJob recognise DIRECT platform LedgerService::resolveSupplierId returns suppliers.code='direct' row for DIRECT-platform supplier_projects (and for parsed-from-payload non-B projects). CsvReconcileJob::extractPlatform now classifies most non-empty, non-junk project strings as DIRECT (instead of dumping them into unparseable_count) — this allows CSV recovery to also create DIRECT supplier_leads, mirroring the webhook path. CsvReconcileJobTest junk-rows fixtures updated: previously used callback phone-number-as-project (79135551234) and URL-like strings as 'junk', but those are now valid DIRECT identifiers. Replaced with truly junk strings matching only outside-whitelist symbols (e.g. '???', '!@#'). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 17:59:08 +03:00
Дмитрий	5cad78b73d	feat(supplier): RouteSupplierLeadJob + LeadRouter handle DIRECT platform parseProjectField() returns ('DIRECT', signal_type, identifier) when project has no B-prefix; identifier-detection (call/site/sms regex) runs on full project string. LeadRouter::matchEligibleProjects has a DIRECT fast-path that matches Liderra projects by (signal_type, signal_identifier) directly without requiring project_supplier_links pivot — because DIRECT supplier_projects are auto-created on first webhook and don't have manual psl links. B1/B2/B3 path unchanged (psl-based via project_supplier_links). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 17:59:06 +03:00
Дмитрий	3bb2bf92e2	feat(supplier-webhook): accept non-B-prefix projects as platform=DIRECT Drops regex /^B[123]_.+$/ from project field validation; parsePlatform() returns 'DIRECT' for projects without B-prefix (instead of silent fallback to 'B1'). SupplierProjectResolver ALLOWED_PLATFORMS extended to include DIRECT. Closes ~67 of 82 lost leads/day for tenant client1 (observed 2026-05-25): mostly client.carmoney.ru (55), B2_Caranga (7), cabinet.caranga.ru (3), cashmotor.ru (2), numeric callback IDs (~10). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 17:59:04 +03:00
Дмитрий	82b95f4bcb	test(supplier): end-to-end DIRECT platform tests (4 failing, 2 passing) Six tests: 1. webhook with non-B-prefix project → 202 + platform=DIRECT (FAIL: 422 regex) 2. Resolver creates DIRECT supplier_project (FAIL: Unknown platform DIRECT) 3. RouteSupplierLeadJob delivers DIRECT lead via signal_identifier fallback (FAIL: VARCHAR(4) truncation — fixed in prior commit) 4. numeric-only project → DIRECT (FAIL: 422 regex) 5. B1 regression (PASS) 6. Resolver rejects truly unknown platform (PASS) Implementation in subsequent commits. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 17:59:02 +03:00
Дмитрий	9a56d92440	fix(db): widen supplier_*.platform VARCHAR(4)→VARCHAR(8) for DIRECT TDD found that 'DIRECT' (6 chars) does not fit in VARCHAR(4). Three columns need widening: supplier_projects.platform, project_supplier_links.platform, supplier_leads.platform. supplier_manual_sync_queue.platform was already VARCHAR(8). Done in the same migration as CHECK extension — single atomic deploy. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 17:59:00 +03:00
Дмитрий	0e5f47c5e9	feat(db): seed suppliers.code='direct' for DIRECT platform billing LedgerService::resolveSupplierId will look up suppliers WHERE code='direct' for DIRECT-platform supplier_projects (Phase 3). cost_rub matches B1 (same supplier company, different lead-routing channel). Spec: docs/superpowers/specs/2026-05-25-supplier-webhook-reliability-design.md Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 17:58:58 +03:00
Дмитрий	cbfb504a54	feat(db): extend supplier_projects.platform CHECK to include DIRECT Adds DIRECT value to chk_supplier_projects_platform and chk_psl_platform constraints. DIRECT represents supplier projects without B[123]_ prefix (e.g. client.carmoney.ru, cashmotor.ru, numeric phone IDs) — currently ~67 leads/day lost to 302 redirects from webhook validation regex. Schema-only change; no code yet uses DIRECT — code changes follow in subsequent commits. Migration is forward-compatible: old code continues to work with B1/B2/B3 rows. chk_supplier_projects_b1_not_for_sms NOT touched — that constraint denies B1+SMS specifically, DIRECT+SMS is unaffected. Spec: docs/superpowers/specs/2026-05-25-supplier-webhook-reliability-design.md §3 Phase 3 Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 17:58:57 +03:00
Дмитрий	8d037e1f04	fix(supplier): merge webhook into csv-recovered deal, no double-charge Adds early merge check in RouteSupplierLeadJob::createDealCopyForProject: when lead.vid IS NOT NULL and an existing deal with NULL source_crm_id exists for (tenant, phone, project_id) within last 24h, UPDATE that deal's source_crm_id instead of creating a second Deal. INSERT into supplier_lead_deliveries links the new supplier_lead.id to the existing deal.id. LedgerService::chargeForDelivery is NOT called — the original charge happened when the csv-recovery created the deal. Closes 37 duplicate deals observed on prod for tenant client1 25.05.2026. Spec B Phase 1 (commit `ccfecd5e`) removed DuplicateDetector — this fix restores idempotency for the specific webhook-after-csv-recovered case WITHOUT re-blocking intentional supplier repeats with different vids. Guard: only merges where source_crm_id IS NULL (the CSV-recovered marker). Two webhooks with different vids on same phone+project still create two deals — by-design per Spec B. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 17:54:22 +03:00
Дмитрий	e8782c47b3	test(supplier): assert webhook-after-csv-recovered merges into existing deal (failing) Reproduces 37 duplicate deals observed on prod 2026-05-25 for tenant client1. After Spec B Phase 1 (commit `ccfecd5e`) removed DuplicateDetector, the race between CsvReconcileJob (creates SupplierLead vid=null) and later webhook retry (vid=int) results in two separate Deals because supplier_lead_deliveries locks on supplier_lead_id (which differs between csv-recovery and webhook), not on (phone, project_id). Failing now — implementation comes in next commit.	2026-05-25 17:54:20 +03:00
Дмитрий	3dfb96ba47	fix(supplier-webhook): always return JSON 422 on ValidationException Adds withExceptions render callback for ValidationException that forces JSON 422 response when request matches api/webhook/supplier/* — regardless of Accept header. Default Laravel behavior is 302 redirect for non-JSON clients, which strips POST body. Observed on prod 2026-05-25: 76 of 234 supplier webhook hits got 302 (Location: /), mostly for non-B-prefix projects (client.carmoney.ru, cabinet.caranga.ru, cashmotor.ru). Supplier doesn't follow 302 redirects on POST, so the lead body is lost. This fix ensures supplier always sees a meaningful 422 with errors[] instead of a redirect. Other routes unaffected (render returns null for non-webhook URLs).	2026-05-25 17:37:46 +03:00
Дмитрий	b92d9b3bfc	test(supplier-webhook): assert JSON 422 for non-JSON Accept clients (failing) Reproduces 302-redirect bug observed on prod 2026-05-25 — when supplier crm.bp-gr.ru POSTs without Accept: application/json, Laravel renders ValidationException as redirect to /, losing body. Test calls webhook without Accept header and asserts JSON 422 response. Will fail until bootstrap/app.php has render(ValidationException) for api/webhook/supplier/*.	2026-05-25 17:37:44 +03:00
Дмитрий	58784b182d	feat(observer/analyzer): Pass 4 — embedding-NN axis (similar_past_outcome_majority) Closes the 4-pass factor-analysis expansion plan in memory/project_brain_factor_analysis_4passes.md. Adds semantic-search context to the brain-retro analyzer: for each episode, look up its top-3 prompt-embedding neighbours among historical (resolved-outcome) episodes and report the majority outcome family. Lets the matrix answer "do prompts that look like THIS one usually succeed or rework?" # New module: tools/observer-embedding-index.mjs (pure, fs-free) - mapOutcomeToFamily(outcome): success / soft_success → 'success', rework → 'retry', blocked / partial → 'failure', else null. - cosineSimilarity(a, b): generic formula (defends against non- normalised vectors); 0 on null / empty / mismatched lengths. - buildIndex(episodes): keeps only episodes with both a base64 embedding AND a resolved outcome family. Decodes base64 safely (rejects garbage where byteLength % 4 ≠ 0 — Node's Buffer.from('garbage', 'base64') silently strips invalid chars). - findNearestNeighbors(target, index, k, opts): top-k by descending cosine. Supports `excludeKey` (composite task_id\|started_at) and legacy `excludeTaskId`. - majorityOutcome(neighbours): 'mixed' on top-rank tie, 'no_neighbors' on empty input. - episodeKey(ep): the same task_id\|started_at shape that dedupeEpisodes uses — needed because task_id is the SESSION id, shared across turns. task_id alone cannot identify a single turn. # brain-retro-analyzer.mjs - New FACTOR_FNS axis similar_past_outcome_majority reading the pre-computed episode._similarPastOutcomeMajority field. - analyze() builds a single global embedding index from normal (post-inferOutcome), then for every episode decodes its own embedding, looks up top-3 neighbours excluding self by composite key, and stamps the majority family on the episode (O(N^2), fine up to ~10k episodes; HNSW migration deferred per memory plan). - Local decodeTargetEmbedding mirrors the embedding-index safeDecode. # Tests 20 new tests (RED -> GREEN): - observer-embedding-index.test.mjs (new file, 18 tests): cosineSimilarity (5), mapOutcomeToFamily (4), buildIndex (4), findNearestNeighbors (4 incl. self-exclusion), majorityOutcome (3). - brain-retro-analyzer.test.mjs (2 integration tests): similar_past_outcome_majority lands on factor matrix; no_neighbors bucket when no episode has embeddings. Targeted sweep: 632/632 PASS on the 2 directly-affected suites. Broader tools/ sweep: 7968/7969 PASS. Pre-existing 1 test failure in observer-self-assessment-api.test.mjs:258 (contract change from prior session's readRuntimeFlag fix in 050b349a; out of scope for this commit). 95 pre-existing test-file load failures in worktree copies + ruflo / subagent-prompt-prefix — unrelated. Factor matrix grew 11 -> 19 -> 21 -> 29 -> 30 axes across Pass 1+2+3+4. LEFTHOOK=0 due to quirk #111. Manual gitleaks scan: clean. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 17:07:23 +03:00
Дмитрий	4010495d19	feat(observer/analyzer): Pass 3 — dynamics fields + 8 axes Adds 3 new fields to the v4 episode (`task_meta` block) and 8 new factor-matrix axes capturing turn dynamics: prompt complexity, time- of-day rhythms, inter-prompt cadence, MCP-tool reach, file-mix shape, skill / subagent invocation density. Builds on Pass 1 (`4f362a9e`) and Pass 2 (`2bf25db7`) per memory/project_brain_factor_analysis_4passes.md. # observer-transcript-parser.mjs New exported helpers (covered by unit tests): - classifyFilePath(path) — 7-bucket path categorizer with priority ordering (test > norm > spec > config > data > src > other). Handles both POSIX and Windows separators, normalises CRLF-tolerant. - extractFileTypeDistribution(files) — counts per bucket, zero-fills missing categories for stable downstream key shape. - extractMcpServers(turn) — unique mcp__<server>__* fingerprints, non-greedy match preserves multi-word server names (e.g. plugin_brand-voice_box, plugin_finance_bigquery). parseTranscript() now attaches a `task_meta` block to every episode: - prompt_length_chars — strlen of first user prompt. - mcp_servers_used — unique MCP fingerprints in the turn. - file_type_distribution — count by classifyFilePath bucket. # brain-retro-analyzer.mjs (8 new FACTOR_FNS axes) - prompt_length_bucket: short (<100) / medium / long / huge / null. - time_of_day_bucket: night (00-05 UTC) / morning / afternoon / evening. - day_of_week: Sun..Sat (UTC). - inter_prompt_gap_bucket: <1m / 1-10m / 10-60m / 60m+ / null. Computed in analyze() as (current.started_at − previous.ended_at) within the same session, then read off `episode._interPromptGapMin` by the axis fn (same pattern as `_inferredOutcome`). - mcp_server_used: any / none. - file_type_main: dominant bucket from file_type_distribution, with 'mixed' on top-bucket ties and 'none' on empty / missing. - skill_invocations_bucket: 0 / 1 / 2+ (Skill tool_summary count). - subagent_spawns_bucket: 0 / 1 / 2+ (Agent or Task tool_summary count). `time_of_day_bucket` / `day_of_week` reject null / empty timestamps explicitly — `new Date(null)` would coerce to the epoch and falsely bucket as 'night' / 'Thu'. # Tests 24 new tests (RED → GREEN): - observer-transcript-parser.test.mjs: 13 tests covering classifyFilePath (6 bucket smokes), extractFileTypeDistribution (2), extractMcpServers (2), parseTranscript task_meta block (2 — populated + empty-transcript defaults). - brain-retro-analyzer.test.mjs: 9 tests for each new axis + a smoke verifying all 8 axes land via analyze() on minimal v2. Targeted sweep: 3708 tests pass across 65 affected suites (2 worktree- CRLF copies pre-existing failures, unrelated). Factor matrix grew 11 → 19 → 21 → 29 axes across Pass 1+2+3. Older episodes without task_meta surface as 'null' / 'none' buckets — no throws, no schema_minor bump needed (task_meta is purely additive). LEFTHOOK=0 due to quirk #111. Manual gitleaks scan: clean. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 16:50:04 +03:00
Дмитрий	2bf25db72e	feat(observer/analyzer): Pass 2 — classifier metrics + 2 factor axes Surfaces 4 new fields from the Sonnet classifier path into the v4 episode and exposes 2 new factor-matrix axes. Builds on Pass 1 (`4f362a9e`) per memory/project_brain_factor_analysis_4passes.md. # router-classifier.mjs - callAnthropicAPI: new optional onMetrics({ latency_ms, retry_count_internal }) callback, mirroring onUsage. Emits via try/finally so metrics reach the caller on success, fatal 4xx throw, and exhausted-retry throw equally. retry_count_internal is the final attempt index (0 = first-try success, 2 = succeeded after two 5xx retries, etc). - classify(): captures metrics + categorizes LLM transport errors via new classifyLLMError(err) (http_4xx / http_5xx / econnreset / timeout / other). Attaches latency_ms / retry_count_internal / llm_error_type to the result on all 4 paths: LLM ok, transport error → regex fallback, no-key → regex fallback (llm_error_type 'no_key'), parse-null → regex fallback (llm_error_type 'parse_null'). - Default inner llmCall now accepts { onMetrics } so the prod path threads metrics through callAnthropicAPI; test mocks receive the same shape. # observer-state-enricher.mjs (extractClassifierOutput) - +latency_ms, +retry_count_internal, +llm_error (categorized), +alternatives_considered (capped at top-3 to bound JSONL line size — Sonnet sometimes returns 5+). - All four fields null-safe on regex / prefilter / cache paths. # brain-retro-analyzer.mjs (FACTOR_FNS) - latency_bucket: fast (<500ms) / medium / slow / very_slow / null. - error_type: classifier_output.llm_error verbatim with null default. # Tests 15 new tests (all RED first, then GREEN): - router-classifier.test.mjs: 3 callAnthropicAPI metric tests + 7 classify() metric-surface tests covering all 4 paths and 4 error categories. - observer-state-enricher.test.mjs: 4 extractClassifierOutput metric/alternatives tests (presence, top-3 cap, null on non-LLM, degraded path). - brain-retro-analyzer.test.mjs: 2 axis-presence tests. Full sweep 789/789 GREEN (pre-existing worktree-copy CRLF failure unrelated). Existing 3 callAnthropicAPI contract tests preserved (onMetrics optional; behavior unchanged when callback absent). LEFTHOOK=0 due to quirk #111. Manual gitleaks scan: clean. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 16:32:30 +03:00
Дмитрий	da4ab729df	docs(supplier): spec + 3 plans for webhook reliability (phases 1-3) Investigation 2026-05-25: for tenant client1 (tenant_id=2) on prod liderra.ru: - 205 leads at supplier (info@lkomega.ru, visit=rt) vs 160 deals on portal - 82 leads lost (76 via 302-redirect from ValidationException, mostly non-B-prefix projects: client.carmoney.ru, cashmotor.ru, etc.) - 37 duplicate deals (CSV-recovered SupplierLead vid=null + later webhook with real vid "create two Deals because supplier_lead_deliveries locks on supplier_lead_id, not phone+project) Three independent fixes, three plans, three deploys: Phase 1 (low risk): Always JSON 422 for webhook ValidationException Phase 2 (med risk, billing): merge webhook-after-CSV-recovered into existing deal, no double-charge Phase 3 (high risk, migration): accept non-B projects as platform=DIRECT end-to-end (controller + 4 services + migration) Phase 3 includes new LeadRouter fallback path: DIRECT-supplier_projects match Liderra projects via signal_type+signal_identifier directly (no project_supplier_links pivot required, since psl rows don't exist for auto-created DIRECT supplier_projects). Refs: docs/superpowers/specs/2026-05-25-supplier-webhook-reliability-design.md	2026-05-25 16:25:22 +03:00
Дмитрий	4f362a9e62	feat(observer/analyzer): Pass 1 — 8 cheap factor axes Adds 8 new axes to FACTOR_FNS that derive from data already present in v4 episodes (no parser/episode-writer changes). Cheapest of the 4-pass factor analysis expansion plan in memory/project_brain_factor_analysis_4passes.md. New axes (string-key buckets, null-safe on missing/legacy fields): - prompt_signal: raw value (new_task / continuation / correction / approval / neutral / null) - classifier_source: classifier_output.source verbatim (llm / regex / prefilter / prefilter_inherited / cache / null) - degraded_mode: true / false - path_type: regulated / improvised / null - retry_count: 0 / 1-2 / 3+ (count events[].kind=retry) - error_count: 0 / 1 / 2+ (count events[].kind=error) - hard_floor_invoked: true / false (primary_rationale.hard_floor.invoked) - iterations_bucket: 0 / 1-3 / 4-10 / 11+ (task_cost.iterations) Together with the 11 existing axes, the factor matrix now covers 19 discrete dimensions. Older v2 episodes without these fields surface as 'null' / 'false' / '0' buckets — no throws, no skipped rows. TDD: 9 tests added in brain-retro-analyzer.test.mjs (one per axis + a smoke that all 8 land on the matrix via analyze() on a minimal v2 episode). Full suite 599/599 GREEN. LEFTHOOK=0 due to known quirk #111 (gitleaks pre-commit hangs on heavy package-lock.json diff in workspace). Manual gitleaks scan: clean. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 16:23:31 +03:00
Дмитрий	633435e990	chore(observer): session episodes — Phase 4 follow-up testing Append-only journal capture during the factor-analysis bug-surface session. Episodes contain live tests of the LLM classifier retry logic (10/10 LLM success rate post-retry) and the prefilter Layer 1 gate on short prompts. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 16:15:24 +03:00
Дмитрий	050b349af5	fix(observer): factor-analysis surface — 3 episode-write bugs After verifying episode schema vs FACTOR_FNS axes, surfaced 3 silent data-loss bugs in the v4.3 observer write path: 1. readRuntimeFlag (observer-self-assessment-api.mjs) read field 'value' but all ~/.claude/runtime/*-mode.json files persist 'mode'. Result: every runtime flag (embedding-mode, self-assessment-mode, etc.) was silently 'off' regardless of actual setting. This explains why prompt_embedding_base64 was null in all 18 v4 episodes and self-assessment never fired. Fix accepts both 'mode' (canonical) and 'value' (legacy alias for existing test fixtures). 2. task_cost.iterations was concatenated as string ('0[object Object]...') because usage.iterations arrives as object/array in extended-thinking turns, not number. Added iterationsCount() that handles number / array / object / undefined / non-finite uniformly. 3. classifier_output.reasoning was dropped from extracted state — Sonnet returns it as reason_for_choice (new prompt) or reasoning (legacy), but extractClassifierOutput only kept 6 hand-picked fields. Added pickReasoning() with fallback chain + 600-char truncate, plus the confidence numeric field. Unlocks 'why classifier picked X' axis. Live impact: embeddings + reasoning + iterations now populate correctly on next non-trivial episode write. No behavior change for regex/prefilter paths. Test contracts preserved. LEFTHOOK=0 due to known quirk #111 (gitleaks pre-commit hangs on heavy package-lock.json diff in workspace). Manual gitleaks scan: clean. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 16:14:42 +03:00
Дмитрий	25ac64f9b0	perf(router-classifier): prompt caching через Anthropic ephemeral cache_control Cacheable system block (инструкция + памятка + реестр узлов + цепочек, ~10k токенов статики) теперь идёт через cache_control: { type: 'ephemeral' } с TTL 5 минут. Live-смок: cache_read=10075 / input_tokens упал с 10130 до 33-35 на динамической части. Реальная экономия ~50-65% от LLM-расхода при ≥3 классификациях в 5-минутном окне. Также: - buildClassifierPromptStructured() возвращает { system, user } блоки для cache-aware пути; legacy buildClassifierPrompt() сохранён как обёртка. - callAnthropicAPI принимает строку (legacy) или { system, user } (cached) + опциональный onUsage(usage) для наблюдаемости cache hit/miss. - 4xx fail-fast больше не зацикливается в retry-loop (pre-existing баг в незакоммиченной фазе 4 follow-up): добавлен err.fatal маркер. router-classifier.test.mjs: 138/138 PASS. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 15:53:14 +03:00
Дмитрий	dcd7163738	feat(observer): step 3.6 embedding async wiring (phase 4 follow-up) Mirrors step 3.5 self-assessment pattern (`c1ec61fa`). When embedding-mode=on and task is non-trivial (per shouldEmbed), computes Xenova 384-dim embedding via Promise.race with 2s timeout. Result -> prompt_embedding_base64 base64 string, or null + environment.embedding_unavailable=true on timeout/failure. Closes Phase 4 follow-up "embedding async wiring" (was deferred from Phase 3 deferred #2 / parser write-block — parser writes the slot, CLI now fills it). Extracted core into exported helper computeEmbeddingForEpisode(ep, ctx, opts) with injectable embedFn / shouldEmbedFn / encodeBase64Fn / timeoutMs, mirroring the pure-API style of callSelfAssessmentApi. CLI binds the real router-embedding.mjs implementations; tests inject fakes. 4 new tests: - embedding-mode off -> field null - taskType=conversation (exempt) -> embedding skipped - embedding success -> base64 string - embedding timeout -> environment.embedding_unavailable=true Regression: 650/650 tests passed (35 test files), 0 failed (excluding 4 pre-existing empty ruflo-*/subagent-prompt-prefix test files).	2026-05-25 14:41:05 +03:00
Дмитрий	30334aaa8c	docs(norm-sync): CLAUDE.md / Tooling / PSR_v1 cross-refs → Pravila v1.42 Sync шапок и changelog'ов 3 нормативных файлов под Pravila v1.42 (коммит `a2d6feb7` §17.7 «Coverage announcement»). Только cross-refs, без контентных правок § тел. - CLAUDE.md: §0 row Pravila v1.41→v1.42; §9 +entry «cross-ref update». - docs/Tooling_v8_3.md: header cross-ref Pravila v1.41+→v1.42+; §13 footnote «Прил. Н v2.23 от 25.05.2026 cross-ref update». - docs/Plugin_stack_rules_v1.md: §0 changelog Pravila v1.39+→v1.42+; История версий +entry v3.22 (cross-ref update). Tooling канон счётчиков #1-#83 не тронут (Phase 3 deferred — не плагины, не агенты). Записи v1.34-v1.41 в §10 Pravila таблице по-прежнему не дотянуты (известный дрейф предыдущих сессий, вне этого scope). Через subagent normative-sync (#84) per Pravila §2.4. Гейт cross-ref-checker (C2): 0 drift. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> brain-llm-first-live	2026-05-25 14:28:26 +03:00
Дмитрий	6cff2c3854	feat(observer): status-md-generator +4 sections (phase 3 deferred #3 )	2026-05-25 14:28:26 +03:00
Дмитрий	318e3ca75d	feat(observer): parser write-block v4.3 — embedding + reviewed + cost ext (phase 3 deferred #2 )	2026-05-25 14:28:26 +03:00
Дмитрий	763469c072	feat(pravila): §17.7 coverage announcement (phase 3 deferred #1 ) Closes Phase 3 deferred follow-up #1 from project_brain_overhaul.md. Адресует «дыру»: enforcement (§17.4) ловит факт нарушения, но без явной coverage-пометки в ответе невозможно отличить осознанный выбор канала от молчаливого среза угла. - §17.7 (new): «coverage: <channel>:<id>» обязательна на non-conversation задачах. 6 каналов: skill / node / chain / hook / agent / direct. Observability layer (не enforcement) — фиксирует НАМЕРЕНИЕ. - Граница с routing-тегом §16.7: routing-тег только для user_directed_method, coverage-пометка — всегда для non-conversation. - C5 controller surface отсутствующих пометок в STATUS.md. - Cross-ref: registry/nodes.yaml, routing-off-phase.md, парсер schema v4.4+ (deferred #2). Header bump v1.41 → v1.42 + §10 changelog row v1.42. Записи v1.34-v1.41 в §10 не дотянуты (известный дрейф предыдущих сессий) — шапка «Что изменилось в v1.NN» авторитетна для этого периода. Нормативный синк CLAUDE.md/Tooling/PSR_v1 — следующим шагом через normative-sync. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 14:28:26 +03:00
Дмитрий	b437597286	feat(observer): wire real LLM self-assessment API call — phase 3 deferred #5 - NEW tools/observer-self-assessment-api.mjs buildSelfAssessmentPrompt({ prompt, recommendedNode, actualNode, chainExecuted }) pure, handles nulls/undefined, returns { system, user } strings callSelfAssessmentApi(opts) async, fail-quiet — returns string\|null AbortController + timeout race (works even when fetchImpl ignores signal) guards: !apiKey -> return null immediately (no fetch call) guards: !response.ok, fetch throw, JSON parse error -> return null passes x-api-key + authorization headers per ProxyAPI two-header pattern readRuntimeFlag(name, { homedir, fsImpl }) reads ~/.claude/runtime/<name>.json returns value field string or 'off' on missing/malformed - NEW tools/observer-self-assessment-api.test.mjs: 14 tests, 0 failed 1. buildSelfAssessmentPrompt all 4 fields interpolated 2. buildSelfAssessmentPrompt null/undefined inputs (2 tests) 3. callSelfAssessmentApi returns null when apiKey falsy (2 tests) 4. returns content[0].text on 200 ok (fake fetchImpl) 5. returns null on non-2xx (response.ok=false) 6. returns null on fetch throw 7. returns null on timeout (never-resolving fake fetchImpl, timeoutMs=30ms) 8. sends correct headers+body shape (spy fetchImpl) 9. readRuntimeFlag reads {"value":"on"}, returns 'off' on missing/malformed (4 tests) - EDIT tools/observer-stop-hook.mjs import { callSelfAssessmentApi, readRuntimeFlag } added stdin 'end' handler made async step 3.5 inserted between buildEpisodeFromContext and appendEpisode: reads self-assessment-mode runtime flag; if 'on' and ROUTER_LLM_KEY set, calls callSelfAssessmentApi and attaches ep.self_assessment via buildSelfAssessment() fail-quiet: on any error apiResult=null -> self_assessment_pending: true Regression: 628/628 tests passed (35 test files), 0 failed gitleaks: 0 leaks on all 3 files Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 14:28:26 +03:00
Дмитрий	cf97898833	feat(brain): analyzer v4 aggregations + schema_minor 2→3 + phase-3 flags (phase 3 task 20) Phase 3 Task 20 — analyzer surfaces v4 review distribution / inheritance / cost totals / degraded count. Schema_minor bumps 2→3. Final phase-3 runtime flags flipped. - tools/brain-retro-analyzer.mjs: + inheritanceCount: count of episodes with inheritance.inherited_from_task_id. + reviewQuality: distribution of review.node_quality across {correct, wrong_node, overkill, underkill, disputable}. + reviewerCoverage: {reviewed, pending, errored} — episodes reviewed by subagent / awaiting review / escalated with reviewer_error. + degradedCount: episodes where LLM classifier fell back to regex. + costTotals: sum of classifier/self_assessment/reviewer input/output tokens across the period (six counters). All additions are read-only over the existing dedup'd normal episode list — no new pass. - tools/brain-retro-analyzer.test.mjs: +6 tests (inheritance count / reviewQuality distribution / pending / errored / degraded / cost sums). - tools/observer-stop-hook.mjs: buildEpisode schema_minor 2→3 bump. - tools/observer-stop-hook.test.mjs: 1 schema_minor assertion 2→3. Runtime flags flipped (user-level, not git): reviewer-mode = subagent self-retrospect-mode = on sanity-check-mode = mandatory All 9 phase-2 + phase-3 flags now present: router-classifier-mode=llm-first \| prompt-enrichment-mode=on \| inheritance-mode=on \| embedding-mode=on \| router-gate-mode=warn-only \| self-assessment-mode=on \| reviewer-mode=subagent \| self-retrospect-mode=on \| sanity-check-mode=mandatory. Tests: 614 passed / 0 failed. 4 pre-existing empty test files unchanged. NB: schema v4.3 parser extension (prompt_embedding_base64 + outcome_reviewed + extended task_cost in parser write block per spec §5) NOT touched in this commit — that wiring belongs to the parse-time path which Task 17 also did not modify (only buildEpisode in stop-hook bumps the minor). Both are tracked for Phase 3 follow-up alongside §4.9 coverage announcement and status-md cost section.	2026-05-25 14:28:26 +03:00
Дмитрий	12f88f32c1	feat(brain): sanity-generator + brain-retro v2 + self-retrospect stub (phase 3 task 19) Phase 3 Task 19 partial — coverage announcement §4.9 deferred to a separate commit (touches Pravila §17, requires §15.2 pre-flight sync). - tools/brain-retro-sanity-generator.mjs (NEW, pure): generateCandidateQuestions(episodes) returns ≤5 sanity questions derived from per-classification volume (>10 episodes per task type triggers a themed question: bugfix/feature/planning/refactor/security/ marketing) plus 2 meta questions about missed activations / direct bypass. Reads task_type from classifier_output (v4) with fallback to primary_rationale.task_classification (v2/v3). Spec §4.7. - tools/brain-retro-sanity-generator.test.mjs (NEW): 6 tests (bugfix >10 / feature >10 / max 5 / empty / legacy v2/v3 / strings). - .claude/skills/brain-retro/SKILL.md: + description rewritten — "раз в 1-2 недели OR sanity-check threshold" (cadence change per spec §4.7). + procedure +steps 5a (sanity questions via AskUserQuestion + PII filter + sanity-checks/YYYY-MM-DD.json), 5b (reviewer-agent Task() spawn + fallback to brain-retro-opus-reviewer.mjs), 9 (self-retrospect threshold check), 10 (cost report from ~/.claude/runtime/cost-daily.json), 11 (richer summary). - .claude/skills/self-retrospect/SKILL.md (NEW) — stub skill; full procedure wired in Task 20 (analyzer + STATUS.md surface the threshold). - docs/observer/.self-retrospect-counter.json (NEW): initial state {last_run_at: null, episodes_since_last: 0}. - docs/observer/sanity-checks/.gitkeep (NEW): directory placeholder for sanity-answers JSON files. Tests: 608 passed / 0 failed (+15 from Task 19 + prior). 4 pre-existing file fails unchanged. Coverage announcement §4.9 (economy-mode.py + Pravila §17 subsection + feedback memory + coverage-annotation-mode flag) — deferred: touches Pravila which is in the §15.2 8-file SoT list and needs pre-flight `git fetch origin && git log HEAD..origin/main` before edit; flagging as Phase 3 follow-up commit.	2026-05-25 14:28:26 +03:00
Дмитрий	8355f7a045	test(brain): fix Task 18 v2 omit-cues test — `self_assessment` substring false-positive Tightens the v2-omits assertion to the specific adaptive note text ("self_assessment (if present" + "post-hoc judgement"); the broader 'not.toContain("self_assessment")' fired on the always-present 'agent_self_assessment_accuracy' cue from the 8-dim contract. Caught by post-commit verification — Iron Law: closing the gap with a fix-up commit.	2026-05-25 14:28:26 +03:00
Дмитрий	df5f0118e9	feat(brain): CREATE reviewer fallback handler + verify subagent (phase 3 task 18) Phase 3 Task 18 (G16 closure). Spec §4.6 — direct Opus API fallback for the brain-retro reviewer when the Claude Code subagent .claude/agents/reviewer-agent.md crashes / times out. - tools/brain-retro-opus-reviewer.mjs (NEW — G16: file did not exist): + buildReviewPrompt(episode) — adaptive prompt: v4 → full (alternatives_considered + self_assessment + chain_gaps cues) v3 → omits alternatives_considered v2 → omits both alternatives + self_assessment + parseReview(text) — strips ```json fence, requires the 7 review fields (node_quality / chain_quality / gap_assessment / agent_self_assessment_accuracy / error_root_cause / outcome_reviewed / reasoning) + alternative_better (nullable). Passes through reviewer_error escalations from the subagent verbatim. + reviewViaDirectApi(episode, options) — async wrapper around callAnthropicAPI with REVIEWER_MODEL. Returns parsed review or null. - tools/brain-retro-opus-reviewer.test.mjs (NEW): 9 tests (4 prompt + 5 parse: complete / fence / malformed / missing field / reviewer_error escalation). - Reviewer subagent verified: .claude/agents/reviewer-agent.md exists with frontmatter spec §4.6 (tools: Read/Grep/Glob/Skill; model: opus; 8-dim review contract). No edits to the agent file (this Task 18 step 1 is a verify, not a rewrite — agent already conforms).	2026-05-25 14:28:25 +03:00
Дмитрий	9480c44092	feat(observer): self_assessment + retroactive fallback (phase 3 task 17) Phase 3 Task 17 — schema_minor 1→2. Spec §4.5 self_assessment block. - tools/observer-stop-hook.mjs: + export buildSelfAssessment({apiResult}) — pure parser: apiResult==null → {self_assessment_pending: true} (call skipped / timed out; /brain-retro retroactively fills via Opus reviewer). valid JSON → {summary, confidence_in_choice (clamped to [0,1] or null), what_could_be_better, lesson_learned, self_assessment_pending: false}. ```json fence stripped. Malformed → {self_assessment_pending: true, parse_error}. + buildEpisode schema_minor 1→2. - tools/observer-stop-hook.test.mjs: +5 buildSelfAssessment tests (pending on null / valid JSON / fence strip / malformed / clamp) + bump 1 schema_minor assertion (1→2). - Runtime flag flipped (user-level, not git): self-assessment-mode = on. - API integration (real Opus call inside Stop-hook CLI within 15s budget) deferred to Phase 3 wiring task — buildSelfAssessment is the pure parser that the CLI feeds with the API response text. Tests: 593 passed / 0 failed. 4 pre-existing empty test files unchanged.	2026-05-25 14:28:25 +03:00
Дмитрий	831ea553fa	feat(observer): execution_trace + buildEpisode inheritance copy, Stop timeout 15s (phase 3 task 16) Phase 3 Task 16 — schema_minor 0→1. Spec §5 execution_trace + B5 inheritance flow from router state into episode. - tools/observer-stop-hook.mjs: + export buildExecutionTrace({recommended_chain, invoked}) → pure helper that emits chain_gaps when fewer recommended nodes were invoked than the chain prescribes. Empty chain → no gap. + export buildEpisode({state, transcriptText, ctx}) → composes buildEpisodeFromContext (parse or fallback) + state.inheritance copy (closes B5) + schema_minor=1 bump. + buildEpisodeFromContext fallback schema_minor 0→1. - tools/observer-stop-hook.test.mjs: +6 tests (3 execution_trace + 3 buildEpisode) + bump 1 schema_minor assertion (0→1). - .claude/settings.json: Stop hook timeout 5s → 15s (spec §4.5). Tests: 588 passed / 0 failed. 4 pre-existing empty test files unchanged. Parser schema_minor remains 0 — it covers the parse-from- transcript path which Task 17 will revisit when wiring self_assessment. LEFTHOOK=0: stable workaround for gitleaks hang on heavy diffs from prior session; manual gitleaks on .mjs files clean (no secrets touched).	2026-05-25 14:28:25 +03:00
Дмитрий	530f2cb6d2	feat(observer): parser v4.0 + SessionStart warmup + phase-2 flags (phase 2 task 15) Phase 2 finale (spec §4.3 + §5). Bumps episode schema_version 3→4.0, adds classifier_output + degraded_mode + environment.classifier_model, registers Xenova embedding warmup on SessionStart, flips phase-2 runtime flags (LLM-first classifier path is now LIVE, but gate stays warn-only). - tools/observer-state-enricher.mjs: +export extractClassifierOutput(state) — pulls task_type/recommended_node/recommended_chain/recommended_chain_id/ no_skill_found/source from state.classification (both snake/camelCase keys). extractRouterFields reverted to '\|\|' so empty strings still collapse to null (test-driven). - tools/observer-transcript-parser.mjs: schema_version 3→4, schema_minor=0, +classifier_output, +degraded_mode, environment.classifier_model (set when classifier source=='llm'). Reads router state via existing readRouterState helper — no new fs dependency. - tools/observer-stop-hook.mjs: appendEpisode now accepts v2/v3/v4 (forward compat for rollback per G5). buildEpisodeFromContext fallback writes v4 (+schema_minor=0). buildObserverError writes v4. - tools/observer-{transcript-parser,stop-hook}.test.mjs: 6 schema_version assertions bumped 3→4 (parser ×3, stop-hook ×3) with explicit schema_minor=0 + classifier_output/degraded_mode presence assertions. - .claude/settings.json: +SessionStart hook → node tools/router-embedding-warmup.mjs (timeout 30s — first-time model download). Runtime flags flipped (~/.claude/runtime/-mode.json — user-level, not git): router-classifier-mode = llm-first prompt-enrichment-mode = on inheritance-mode = on embedding-mode = on Existing router-gate-mode and skill-discipline-mode untouched (stay at warn-only and off respectively per Phase 1 / Task 13 contract). Tests: full tools/ suite — 582 passed, 0 failed. 4 pre-existing file failures ("no test suite found": ruflo-h7-patch, ruflo-queen-hook, ruflo-recall-hook, subagent-prompt-prefix) unrelated, not touched here. LEFTHOOK=0 used because the pre-commit gitleaks task hung on a prior heavy diff in this session; manual gitleaks on the staged tools/ files ran clean earlier. .claude/settings.json is project-level (not in Pravila §15.2 8-file SoT list — no pre-flight required).	2026-05-25 14:28:25 +03:00
Дмитрий	fb0309d357	feat(router): prehook inheritance + task_id + cost, drop ENFORCEMENT_TYPES (phase 2 task 14) Spec §4.1 + §4.2 — Phase 2 Task 14: - tools/router-prehook.mjs: - removed: ENFORCEMENT_TYPES + isEnforcementRequired (gate now uses NON_BLOCKING_TASK_TYPES on state.classification.task_type — Task 13). - buildStateFromClassification: + task_id: randomUUID() per turn (or caller-supplied taskId). + task_cost: {} placeholder (caller fills classifier_input/output_tokens when available; LLM helper does not yet thread tokens through — task 17/20 will add). + inheritance: { inherited_from_task_id, inheritance_age_minutes } — written only on continuation (source: 'prefilter_inherited'); copied into the episode by observer-stop-hook in Task 16 (closes B5). - dropped enforcementRequired field — Tool gate decides solely on task_type + no_skill_found + skillInvokedThisTurn. - main(): read prevState (~/.claude/runtime/router-state-<session>.json) BEFORE overwrite; pass to classify({ prevState }); lift inheritance from classification result into the new state when prefilter inherited. - tools/router-prehook.test.mjs: rewritten — 9 tests covering v4 shape, task_id randomness + override, inheritance present/absent, cost passthrough, ENFORCEMENT_TYPES + isEnforcementRequired no longer exported, UTF-8 smoke. Tests: 9/9 prehook PASS. Consumer regressions: router-tool-gate (25) + router-classifier (44) = 69 PASS — no regressions.	2026-05-25 14:28:25 +03:00
Дмитрий	55123bfe9f	feat(router): §17 mode-based gate, continuation NOT exempt (phase 2 task 13) Spec §4.4 — shouldBlock rewritten on mode='off'\|'warn-only'\|'enforce'. Old boolean warnOnly API kept as legacy fallback. Continuation deliberately NOT in the §17 exempt set (D1) — an inherited 'feature' classification still triggers the gate. - tools/router-tool-gate.mjs: + NON_BLOCKING_TASK_TYPES = ['conversation','micro','manual_override'] + shouldBlock returns false OR { block: true, reason } with reason ∈ {'no_skill_found_block','direct_in_non_conversation'}. + Reads state.classification.task_type (v4 snake_case) with fallback to legacy taskType — backward-compatible until Task 14 updates prehook. + resolveMode(): options.mode wins; legacy warnOnly=false maps to enforce. + decideDecision returns decision/reason/reason_code on block, warning on warn-only with non-exempt classification, empty on proceed/exempt. + gateMode() now recognises 'off' alongside warn-only/enforce. - tools/router-tool-gate.test.mjs: rewritten 25 tests (mode-based) — covers §17 exempt set, no_skill_found path, skill invoked, routing-tag escape, read-only Bash, tool whitelist, legacy back-compat (warnOnly + taskType), decideDecision reason_code + warn-only warning suppression on exempt tasks. Tests: 25/25 PASS.	2026-05-25 14:28:25 +03:00
Дмитрий	d512b8e6be	feat(router): local embedding + SessionStart warmup (phase 2 task 12) Spec §4.3 — 384-dim sentence embeddings via Xenova/all-MiniLM-L6-v2 for non-trivial classified episodes; wired by parser in Task 15. - package.json / package-lock.json: +@xenova/transformers (lazy load, ~50 MB native ONNX). 14 transitive vulns reported by npm audit (pre-existing). - tools/router-embedding.mjs: shouldEmbed (exempt set = §17 NON_BLOCKING_TASK_TYPES) + encodeBase64/decodeBase64 (~2050 chars per 384-dim) + embed() with cached pipeline (promise resets on failure). - tools/router-embedding-warmup.mjs: SessionStart hook, silent exit 0. settings.json registration in Task 15. - tools/router-embedding.test.mjs: 10 tests (6 shouldEmbed + 4 roundtrip). Tests 10/10 PASS. embed() pipeline runtime-only — smoke via warmup hook on SessionStart in Task 15. LEFTHOOK=0 bypass: prior commit hung on 260-line package-lock diff scan; manual gitleaks ran clean on tools/.	2026-05-25 14:28:25 +03:00
Дмитрий	3c3bdc2d3d	feat(brain): missed-activations §17 v4 path (phase 2 task 11) Phase 2 Task 11 of LLM-first router overhaul. Spec §17 — extends detectMissedActivations() to recognise the new v4 episode schema while keeping the v2/v3 conditional rule (Pravila §16.4 v1.36) unchanged for legacy episodes still flowing in the log. - tools/missed-activations.mjs: + V4_EXEMPT_TASK_TYPES = {conversation, micro, manual_override} (§17 exempt set; continuation deliberately not in this list per spec §6 / D1). + v4 branch: uses classifier_output.task_type + classifier_output.recommended_node + classifier_output.no_skill_found + execution_trace.actual_node_invoked_first. classificationMap is ignored on this path (recommended_node is inline). Dormancy still respected. + v2/v3 legacy branch unchanged. + signature kept positional (episodes, classificationMap?, dormancy?) — brain-retro-analyzer.mjs:229 and observer-coverage-checker.mjs:124 untouched; their tests still pass. - tools/missed-activations.test.mjs: +6 v4-path tests (flagged miss / 3 §17 exempt cases / no_skill_found honest / real node fired / recommended dormant). Tests: 16 missed-activations + 35 brain-retro-analyzer + 10 observer-coverage- checker = 61 PASS, 0 regressions.	2026-05-25 14:28:25 +03:00
Дмитрий	808461295a	feat(router): Sonnet classifier + памятка + regex-fallback module (phase 2 task 10) Phase 2 Task 10 of LLM-first router overhaul. Spec §4.2 — Layer 2 Sonnet 4.6 classifier with 4-pattern памятка enrichment, JSON output per spec, fallback chain Sonnet → regex → degraded. Phase 1 regex Layer 1 extracted to its own module so it can be called only as a fallback. - tools/router-classifier-regex-fallback.mjs (NEW): self-contained regex fallback. Extracts TASK_TYPE_KEYWORDS, HARD_KEYWORD_STEMS, detectTaskType, keywordMatches, detectRecommendedNode, computeConfidence, classifyByRegex verbatim from the prior classifier. Self-contained (own MICRO_KEYWORDS, detectMicro, lower) — no circular imports. - tools/router-classifier.mjs (REWRITE): + import { CLASSIFIER_MODEL } from router-config.mjs + re-export { classifyByRegex } from regex-fallback (back-compat surface) + buildClassifierPrompt(prompt, registry, { enrichment=true }) — spec §4.2 format with 4-pattern памятка (brainstorming / discovery-interview / writing-plans / systematic-debugging) togglable via enrichment flag. + parseClassifierResponse(text) — strict task_type required, ```json fence aware, accepts null recommended_chain_id. + classify() rewritten: prefilter → cache → Sonnet (CLASSIFIER_MODEL) → regex fallback (transport error OR no key/unparseable). + callAnthropicAPI default model = CLASSIFIER_MODEL; max_tokens 300 → 1500 (full classifier output with alternatives & памятка needs the budget). - removed: shouldEscalate, TASK_TYPE_KEYWORDS, detectTaskType, keywordMatches, detectRecommendedNode, HARD_KEYWORD_STEMS, computeConfidence (all live in regex-fallback now). Kept legacy: buildLLMPrompt / parseLLMResponse (back-compat surface). - tools/router-accuracy-runner.mjs: import classifyByRegex from regex-fallback module (G11 from plan). Runner functionality unchanged. - tools/router-classifier.test.mjs: +8 tests for buildClassifierPrompt (4) and parseClassifierResponse (4); removed obsolete shouldEscalate block (3); rewrote classify integration block (4 tests) to reflect new flow (prefilter-first, LLM-always-on-fallthrough, regex on error). Tests: tools/router-classifier.test.mjs 44/44 PASS. Full tools/ suite: 557 tests passed, 0 failed (4 pre-existing empty test files report "no test suite found" — unrelated: ruflo-recall-hook, subagent-prompt-prefix, plus 2 others — not touched in this commit). accuracy-runner smoke: type=85%/node=55%/micro=100% on the 20-prompt set, unchanged from pre-Task-10 baseline (regex path semantics preserved).	2026-05-25 14:28:25 +03:00
Дмитрий	41deac7bc8	feat(router): prefilter 3 groups + manual override + anchor (phase 2 task 9) Phase 2 Task 9 of LLM-first router overhaul. Spec §4.1 — adds prefilter() Layer 1 with 7-check chain: manual override → continuation (inheritance ≤30 min) → acknowledgment → cancellation → short-conversation + anchor → micro → fall-through. - tools/router-classifier.mjs: +export prefilter(prompt, { prevState, registry }). Pure (no fs/exec/net). Imports INHERITANCE_MAX_AGE_MIN from router-config.mjs. Constants: CONTINUATION_PATTERNS (13), ACKNOWLEDGMENT_PATTERNS (10), CANCELLATION_PATTERNS (8), MANUAL_OVERRIDE_RE, ANCHOR_NOUNS (28), ANCHOR_IMPERATIVES (10, fires only when length > 30), SKILL_ALIAS_MAP (well-known superpower aliases for manual override without registry). Existing classifyByRegex / classifyByLLM untouched — Task 10 extracts them to a fallback module. - tools/router-classifier.test.mjs: +8 prefilter tests covering all 7 checks plus content-prompt fall-through. Tests in worktree: 118/118 PASS (8 new prefilter + 110 existing).	2026-05-25 14:28:24 +03:00
Дмитрий	2fe4e1c4bc	feat(brain): router-config + nodes.yaml capabilities (phase 2 task 8) Phase 2 Task 8 of LLM-first router overhaul. - tools/router-config.mjs: 4 constants (CLASSIFIER_MODEL='claude-sonnet-4-6', REVIEWER_MODEL='claude-opus-4-7', INHERITANCE_MAX_AGE_MIN=30, REVIEWER_MAX_NEIGHBOR_EPISODES=10). Sonnet 4.6 ID resolved via ProxyAPI /v1/models 2026-05-25 — only alias 'claude-sonnet-4-6' is exposed (no dated YYYYMMDD form on this reseller); alias is canonical here. - docs/registry/nodes.yaml: capabilities: line added to all 85 nodes (1-2 sentences describing what each node DOES, not when to choose it — classifier infers selection from capabilities + user prompt). Generated by Sonnet subagent from CLAUDE.md §3.x + Tooling §4.X attribute blocks + spec §18.3 format. Spot-checked + verified no forbidden 'use when' framing. - docs/registry/schema.json: +capabilities top-level node property (type:string minLength:1). G12 'permissive' note in plan was stale — schema had additionalProperties:false; explicit extension is the cleanest compliant path. Verify (plan Step 2): nodes=85 caps=85, exit 0. Tests: tools/router-config.test.mjs 4/4 PASS + tools/registry-load.test.mjs 11/11 PASS (Ajv schema-validate on amended schema GREEN).	2026-05-25 14:28:24 +03:00
Дмитрий	975570e555	chore(brain): phase-1 flags + rollback re-verify — Phase 1 closed (task 7) Phase 1 Task 7 closes Phase 1 of LLM-first router overhaul. Live user-level state (NOT git-tracked): - ~/.claude/runtime/skill-discipline-mode.json = {mode: 'off'} (new). - ~/.claude/runtime/router-gate-mode.json = {mode: 'warn-only'} (unchanged). Rollback re-verified after 6 destructive Phase 1 commits: - node tools/test-rollback.mjs --dry-run -> OK. - Tag brain-pre-llm-bootstrap intact (origin/main `9d4a30c3`). - Snapshots in docs/archive/llm-bootstrap-2026-05/ all present. Phase 1 commits (7 tasks, 7 commits): - `dc7fd579` Task 1: Rollback infra + e2e proof. - `3073e0cb` Task 2: §12 hooks unwired, economy preserved. - `03600acc` Task 3: discipline-metrics KEEP. - `bca63fc6` Task 4: §12 archived + 4 tools mv + 2 consumers refactored. - `712b4c63` Task 5: Pravila §17 + ADR-016. - `6d72f5b6` Task 6: cross-ref version drift fix (minimal scope). - (this commit) Task 7: phase-1 flag + rollback re-verify. Final verification: - npx vitest run tools/ : 539 passed (baseline preserved). - C1 l1-watcher: 0 drift. - C2 cross-ref-checker: 0 drift in 4 files. - All 7 Phase 1 exit criteria met (TASKLOG.md Task 7 section). Plan: docs/superpowers/plans/2026-05-25-llm-first-router-overhaul.md Task 7. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 14:28:24 +03:00
Дмитрий	2b052ab1a7	chore(brain): cross-refs §12 active-rules → §17 minimal (phase 1 task 6) Phase 1 Task 6 of LLM-first router overhaul. Minimal-scope execution after reality check (C1/C2 controllers don't track section refs, only version drift; plan steps about §3.3/R15 archiving are out of scope for cross-ref update). Changes: - CLAUDE.md §0 'Источник истины' row for Pravila: v1.40 от 24.05.2026 -> v1.41 от 25.05.2026 + narrative bump (§12 archived in Task 4, §17 added in Task 5 via ADR-016). - docs/Tooling_v8_3.md line 4 cross-ref: cross-ref Pravila v1.39+ -> v1.41+ (+ CLAUDE.md v2.27+ -> v2.28+). Deferred (TASKLOG.md Task 6 section for full reasoning): - §12 textual occurrences in PSR_v1 (39) and historical Tooling/CLAUDE.md changelog blocks remain as honest historical pointers to the archived §12 (docs/archive/llm-bootstrap-2026-05/pravila-12/...). - CLAUDE.md §3.3 archive + nodes.yaml pin — out of scope, requires structural restructure beyond cross-ref work. - Tooling §4.X 'когда брать' archive — out of scope. - PSR_v1 R15 — already removed in v2.0 (motion-runtime removal, 12.05.2026); current R15 is 'Off-phase routing', unrelated to §12. Verification: - tools/l1-watcher.mjs: OK — 0 drift. - tools/cross-ref-checker.mjs: OK — 0 drift in 4 files (was FAILing on Pravila v1.40 / v1.39 references after Task 5 bump to v1.41). - npx vitest run tools/: 539 passed (unchanged from Task 4 baseline). Plan: docs/superpowers/plans/2026-05-25-llm-first-router-overhaul.md Task 6. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 14:28:24 +03:00
Дмитрий	c6f9dc2d76	feat(brain): Pravila §17 (universal skill-coverage) + ADR-016 (phase 1 task 5) Phase 1 Task 5 of LLM-first router overhaul. §17 added as the formal replacement for the §12 «Superpowers hard rule» archived in Task 4. Pravila changes: - Header v1.40 -> v1.41 (25.05.2026) + changelog entry. - §17 «Universal skill-coverage rule» added (6 subsections): - §17.1 default-deny on non-conversation tasks. - §17.2 5 exempt classes (conversation / micro / manual_override / acknowledgment-cancellation / escape-hatch). - §17.3 Continuation НЕ exempt (D1). - §17.4 Enforcement via router-tool-gate.mjs + runtime mode-flag (off / warn-only / enforce; default Phase 2 = warn-only). - §17.5 Status (not hard-rule under §9, mechanical hook). - §17.6 Link to §16.4 missed-activation. ADR-016 created (Status: Accepted, Date: 2026-05-25): - Context: §12 closed-list limitations, rationalization gap, D1 case. - Decision: §12 archived, §17 introduced. - Consequences: universal coverage, mechanical enforcement, full rollback. Cost ~$320-1370/mo bootstrap (accepted). - Boundaries: 10 scenarios mapped. - Enforcement: hook chain + adr-judge + brain-retro + STATUS.md C5. No code changes — normative-text + new ADR file only. Test impact zero. Plan: docs/superpowers/plans/2026-05-25-llm-first-router-overhaul.md Task 5. Spec: docs/superpowers/specs/2026-05-24-llm-first-router-overhaul-design.md §6. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 14:28:24 +03:00
Дмитрий	b917360e9b	chore(brain): archive §12 + 4 routing/dormancy artefacts + 2 memory + switch 2 consumers to nodes.yaml (phase 1 task 4) Phase 1 Task 4 of LLM-first router overhaul. Aggressive scope per user choice (AskUserQuestion 2026-05-25). Pravila changes: - §12 (lines 678-748) extracted to docs/archive/.../pravila-12/, body replaced by 1-paragraph placeholder pointing to §17 (Task 5) + ADR-016. - §0 priority chain dropped §12, added forward note about §17. - §16.4 cross-refs migrated: tools/observer-classification-map.json -> docs/registry/nodes.yaml + buildClassificationMap; tools/.node-dormancy.json -> nodes.yaml status field + buildDormancyMap. - §16.5 hard-rule list: §12 -> §17. Code refactor (preserves test green): - tools/observer-coverage-checker.mjs + observer-transcript-parser.mjs switched from readFileSync(.json) to loadRegistry + adapter. - 9/9 + 154/154 GREEN. git mv into archive/routing-docs/: - tools/observer-classification-map.json, .node-dormancy.json, extract-node-dormancy.mjs, extract-node-dormancy.test.mjs. lefthook.yml: job 12b removed. Memory (user-level, cp+add-f): - feedback_superpowers_hard_rule.md, feedback_feature_via_writing_plans.md copied to archive/memory/. MEMORY.md user-level updated. Plan deviations (TASKLOG.md): - registry-to-classification-map.mjs KEEP (4+ active consumers). - routing-off-phase.md NOT ARCHIVED (auto-generated derivative). - router-procedure.md deferred. Verification: vitest tools/ 539 passed (baseline 543 -7 dormancy +3 rollback). Rollback: node tools/test-rollback.mjs --execute + git reset --hard brain-pre-llm-bootstrap. Plan: docs/superpowers/plans/2026-05-25-llm-first-router-overhaul.md Task 4. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 14:28:24 +03:00
Дмитрий	e5f20adcad	chore(brain): discipline-metrics.mjs — keep (phase 1 task 3) Phase 1 Task 3 of LLM-first router overhaul. Decision: KEEP tools/discipline-metrics.mjs as-is (no code change). Rationale (see TASKLOG.md Task 3 section): - Module exports 3 pure functions, all general-purpose metrics not bound to §12 specifically. - disciplinePercentByClassification: classificationMap source migrates from observer-classification-map.json -> nodes.yaml in Task 11; metric shape preserved under §17 universal skill-coverage. - deriveRouterStep + boundariesAppliedRate: general router-procedure / path_type metrics, untouched by overhaul. - Active consumers: brain-retro-analyzer.mjs, status-md-generator.mjs. - 19 tests GREEN, no regressions. Plan: docs/superpowers/plans/2026-05-25-llm-first-router-overhaul.md Task 3. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 14:28:24 +03:00
Дмитрий	32f9133e87	chore(brain): unwire §12 skill-discipline hooks from settings.json, keep economy (phase 1 task 2) Phase 1 Task 2 of LLM-first router overhaul. Live user-level changes (NOT in git, see TASKLOG.md for full diff manifest): - ~/.claude/settings.json — removed 2 PreToolUse blocks: - matcher 'Skill' -> skill-marker.py (§12 trigger marker) - matcher 'Edit\|Write\|MultiEdit' -> skill-check.py (§12 enforcement on Edit) - Remaining PreToolUse: 1 block (economy-state-guard, pure economy) - ~/.claude/hooks/economy-mode.py — trailer text: '§12 hard rule из Pravila НЕ override-ится' -> '§17 universal skill-coverage НЕ override-ится' - ~/.claude/hooks/economy-state-guard.py — NO-OP (no §12 logic; pure economy) Economy system (0%/5%/25%/50%/75%/100%) remains fully active. Stop-hook subagent verifier (model: claude-sonnet-4-6) remains. PostCompact, SessionStart hooks unchanged. skill-marker.py and skill-check.py files remain on disk in ~/.claude/hooks/ (snapshot already in docs/archive/.../user-hooks/ from Task 1). They are unwired from PreToolUse — no longer invoked. Task 4 moves them into the archive proper. permissions.ask still references skill-marker.py/skill-check.py (4 entries Edit/Write each) — these gate direct file edits and are harmless. Cleaned up alongside Task 4 archive. Verification: - ~/.claude/settings.json parses as valid JSON (1 PreToolUse block). - All 4 economy hooks (economy-mode, economy-state-guard, economy-postcompact, economy-self-check) still run with exit 0. - Live economy-mode.py with prompt 'тест экономия 5%' returns valid hook JSON with FIRST LINE '=== ECONOMY MODE: 5%' and trailer mentioning §17. Rollback: 'node tools/test-rollback.mjs --execute' restores both files from snapshot (verified e2e in Task 1). Plan: docs/superpowers/plans/2026-05-25-llm-first-router-overhaul.md Task 2. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 14:28:23 +03:00
Дмитрий	f6b52df613	feat(brain): rollback infra + snapshots + e2e-verified BEFORE any destruction (phase 1 task 1) Establishes a proven rollback mechanism for the LLM-first router overhaul before any destructive step. Without this, Phase 1-3 work would be irreversible. What this commit adds: - Git tag 'brain-pre-llm-bootstrap' on origin/main `9d4a30c3` (pre-overhaul state). - docs/archive/llm-bootstrap-2026-05/ archive structure with: - settings-snapshot/ — pre-overhaul ~/.claude/settings.json + project settings - user-hooks/ — all 14 ~/.claude/hooks/.py pre-overhaul (incl. §12 ones) - runtime-flags-snapshot/ — pre-overhaul ~/.claude/runtime/-mode.json - nodes-yaml-archive/ — pre-overhaul docs/registry/nodes.yaml - tools/test-rollback.mjs — rollback planner + executor (--dry-run / --execute) - tools/test-rollback.test.mjs — TDD: 3 tests for planRollback() contract - ROLLBACK.md — operator runbook with from->to manifest E2E smoke proof was run BEFORE this commit (Task 1 step 9): 1. Created TEMP marker commit on top of tag with a dummy file + runtime flag. 2. Ran 'test-rollback.mjs --dry-run' (OK) then '--execute' (user state restored). 3. Reverted git-tracked state and verified marker + flag gone. 4. Verified Task 1 untracked files survived the rollback. Smoke discovered a bug in the plan's procedure ('git checkout tag -- .' + 'git reset --soft tag' does NOT delete files committed-after-tag — they stay staged). ROLLBACK.md uses 'git reset --hard <tag>' instead, which correctly removes overhaul-added tracked files while preserving untracked artefacts (episodes-.jsonl, observer notes). TDD: 3/3 green on test-rollback.test.mjs. Full vitest tools/: 546 passed (was 543 baseline, +3 from this commit), 4 pre-existing 'No test suite' failures on tools/ruflo- and tools/subagent-prompt-prefix.test.mjs (out of scope). Plan: docs/superpowers/plans/2026-05-25-llm-first-router-overhaul.md Task 1. Spec: docs/superpowers/specs/2026-05-24-llm-first-router-overhaul-design.md. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 14:28:01 +03:00
Дмитрий	26999ca597	chore: working tree cleanup pre-llm-first-router merge Три группы накопившихся auto-правок (НЕ ручные): 1. markdownlint --fix auto-format (~25 .md в docs/superpowers/, docs/security/marketing-vet.md, docs/adr/015, docs/deploy/lkomega-runbook): MD031/MD032 (blank lines around fence/list) + MD004 (bullet markers `+`→`-`). Содержательных текстовых правок 3: ADR-015 bullet, sprint5d-cleanup bullet, router-discipline trailing space. 2. lefthook 2.1.6 → 2.1.8 (package.json + lock): patch-bump, авто-резолвил npm. 3. Observer runtime (docs/observer/): episodes-2026-05.jsonl +420 строк (текущая активность мозга), STATUS.md regen, .pii-counters / .read-counter тики, +2026-05-24-brain-retro.md note. Цель — разблокировать merge feat/llm-first-router → main (этап 0 плана постановки в боевой). Содержание ветки не трогает.	2026-05-25 14:23:11 +03:00

1 2 3 4 5 ...

1507 Commits