breakpilot-compliance

Author	SHA1	Message	Date
Benjamin Admin	dfb2c6dfdb	Merge remote-tracking branch 'origin/feat/iace-machinery-playbooks' into feat/customer-mission-2-multicert	2026-06-28 09:08:54 +02:00
Benjamin Admin	16d6ad4122	feat(knowledge): CE conformity + technical documentation playbook draft (machinery) 5th machinery-safety playbook, capability ce_conformity_assessment_and_technical_ documentation — referenced by the ISO27001->CRA+MaschinenVO transition pattern and listed as content-missing. Covers MaschVO conformity assessment (Annex XI), technical file (Annex IV), EU declaration (Annex V) and CE marking; notes the CRA<->MaschinenVO integrated technical file. status: draft, with canonical_action verb. New file only -> non-runtime, no deploy, conflict-free ride-along. capability_id unchanged. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-28 09:02:06 +02:00
Benjamin Admin	3856bb3a4f	feat: Customer Mission #2 — the company arrives with a PROFILE, not a journey Second mission, deliberately different from #1: a highly-certified company (ISO 9001 + ISO 27001 + ISO 14001 + TISAX + CE + PSIRT) asking „what do WE still need for the CRA?". Stresses Mission #1's one open seam (Scope → Journey) and proves the reframe with the real engines: - The start is a Company Capability Profile (certs aggregated), NOT a single cert→target journey. Certifications are OBSERVATIONS feeding the profile. - Evidence is target-relative: ISO 14001 is in the profile but irrelevant to the CRA; PSIRT covers two CRA-delta capabilities. More evidence = smaller delta (12 → 9). - The „journey" is the computed delta (Profile, Target) — not a thing a selector picks. This SHRINKS Mission #1's jump: the seam is profile-intake + target-pick, not a journey-matcher engine. There is no „ISO 27001 → CRA"; only „Profile → CRA". Records the 5 per-mission selection-rationale questions (which journey/why/decisive info/model-extended?/new-parameter?). Selector input = (Company Profile, Target), which collapses the 2^N cert-combination explosion. Non-runtime (reference_scenarios + tests only) -> no deploy. 6 tests pass; check-loc 0.	2026-06-28 09:00:51 +02:00
Benjamin Admin	0b962b41fa	feat(knowledge): 4 machinery-safety implementation playbook drafts (Reasoning delegation) Fulfils the board delegation Reasoning -> IACE (line 45): expert FIRST DRAFTS for the 4 MaschinenVO capabilities the Reference-Suite playbook dashboard lists as "content missing": machine_safety_risk_assessment (ISO 12100), mechanical_safety_and_guards (ISO 14120/14119/13850/13849), operating_instructions_and_safety_information (ISO 12100 6.4 / IEC 82079), protection_against_corruption_of_safety_functions (MaschVO Annex III 1.1.9 = the CRA<->MaschinenVO cyber-safety bridge). Schema per knowledge/implementation_playbooks/README.md. status: draft (expert draft, non-normative). Includes the optional canonical_action verb-formulation (capability-is- a-verb experiment). New files only -> non-runtime, no deploy, conflict-free ride-along. Capability ids unchanged (Execution registry contract). Owner verifies + integrates. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-28 08:47:56 +02:00
pilotadmin	b6c400902e	Merge pull request 'feat: Customer Mission #1 — the platform as one connected expert system (end-to-end)' (#29 ) from feat/customer-mission-1 into main	2026-06-28 08:40:12 +02:00
Benjamin Admin	98f67e75d9	feat(mission): Customer Mission #1 — the platform as one connected expert system (end-to-end) Turn the architecture inside-out: instead of refining classes/registries/journeys, force the whole platform to behave as ONE expert system and run a real consulting project end-to-end — measuring how often the consultant has to "jump" (special-case glue instead of a clean engine-to-engine handoff). A Reference Scenario asks "is the knowledge correct?"; a Customer Mission asks "can a customer WORK with it?". This is the last big architecture test before broad corpus expansion. - reference_scenarios/mission_machine_builder.py: a synthetic machine builder (ISO9001 + ISMS + CE + PLC + remote maintenance + cloud + 80 devs + EU; no real names) asks "what must I do in the next 6 months?". Runs the REAL engines: Regulatory Map -> Journey selection -> Capability Delta (RS-005) -> Roadmap (leverage) -> Playbooks -> Evidence -> Verification -> Completeness, and produces the 6-month consulting answer ("the top-5 measures close 9/16 = 56%, starting with the ones that satisfy CRA AND MaschinenVO at once"). - Flow-Continuity audit (the actual test): 5 CLEAN, 2 JUMPS, 2 deliberate DEPENDENCIES. The two real seams: (1) Scope -> Journey (no `certs x targets -> journeys` selector engine; the data exists in transitions.yaml, only the selection is glue); (2) Evidence -> Verification (parked, Vision V2). The two dependencies (cert->capability map @Execution, corpus_status curation) are intended ownership boundaries, not architecture breaks. - Finding: the platform carries the WHOLE consulting flow end-to-end. Once the Scope->Journey selector exists, the foundation is essentially done — from there the work is knowledge, not architecture. 4 end-to-end tests (mission runs, exactly two known jumps, full flow present, no real company names). check-loc 0. Non-runtime harness -> no deploy (ADR-001). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-28 08:39:26 +02:00
pilotadmin	f652e2d4ed	Merge pull request 'feat: Domain Vocabulary — identity vs representation + regulation aliases (fixes KPI normalization)' (#28 ) from feat/domain-vocabulary into main	2026-06-28 08:12:15 +02:00
Benjamin Admin	ecae5bc7f1	feat(vocabulary): Domain Vocabulary — identity vs representation; regulation aliases fix the KPI normalization Before the next Journey: the LANGUAGE. With 5 knowledge objects but no vocabulary, the same reise gets named four different ways (ISO9001->MaschinenVO vs Quality Management->Product Safety vs ...). The spec answers ONE question: which terms are IDENTITIES and which are REPRESENTATIONS of the same meaning? - spec docs-src/architecture/domain-vocabulary-spec-v1.md (PROPOSAL): identity hierarchy (Requirement RQ / Capability MCAP [Registry 2C] / regulation-source-target / Journey Class MJRN [PROVISIONAL] / Journey instance / Playbook MPLB); canonical name + aliases; capability vocabulary = the Capability Registry (not rebuilt); reorder Vocabulary -> Transition #2 -> #3 -> Rule of Three. - knowledge/vocabulary/regulations.yaml: regulation/standard IDENTITIES (id + canonical + aliases). SOLVES the regulation-ID normalization the KPIs flagged: CRA == "Cyber Resilience Act" == "Regulation (EU) 2024/2847" all resolve to `cra`; ISO9001/QMS -> iso9001; etc. Shared artifact (@Legal-KG/@Execution please adopt). - knowledge/vocabulary/journey_classes.yaml (PROVISIONAL): clusters our transitions into classes (Information Security -> Product Cybersecurity; Quality Management -> Product Compliance/Safety). Finding: ISO9001->MaschinenVO is an INSTANCE of an existing class (like ISO9001->CRA, ISO13485->MDR), not a new kind -> avoids duplication. Journey Class is a new abstraction -> its own Rule of Three (no MJRN minting yet). - reference suite: both KPIs now read aliases from regulations.yaml instead of hard-coded maps; the "Regelwerk-ID-Normalisierung" line flips TODO -> PASS. KPI numbers unchanged (vocab is a superset). - Side effect = Requirements Intelligence: a Tender "Security Patch Procedure" resolves to MCAP-0017. 7 vocabulary tests (17 with domain programs), check-loc 0. Knowledge data + spec + reference harness = non-runtime -> no deploy (ADR-001). No new module, no runtime change, no minting (Freeze). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-28 08:11:30 +02:00
pilotadmin	23a6f02ec2	Merge pull request 'docs: sharpen Journey canonicalization gate (two conditions + diverse transitions + model-change balance)' (#27 ) from docs/journey-canon-criteria into main	2026-06-28 07:43:13 +02:00
Benjamin Admin	4a7412e4f2	docs(spec): sharpen Journey canonicalization gate — two conditions, diverse transitions, model-change balance User 2026-06-28: canonicalization is NOT just "3 transitions built". Two conditions: 1. >= 3 deliberately DIFFERENT transitions (the more different the character, the stronger the evidence — not three similar security transitions): ISO27001->CRA (security->cyber), ISO9001-> MaschinenVO (QM->product safety), TISAX->CRA (automotive security->cyber). 2. NO structural extension of the Journey model in the last two transitions (or only clearly justified, general extensions). Per-transition maturity test: "did the MODEL need extending, or were only DATA added?" — tracked as a balance sheet. Only when both hold (3 diverse + model stable in the last two) -> rename Transition Pattern -> Journey, ratify ADR-011, derive renderers. Matches the pattern at Compiler / Layout families / Master Controls: become the standard only after proving stable under DIFFERENT loads. Non-runtime -> no deploy. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-28 07:42:26 +02:00
pilotadmin	0cb224a7f1	Merge pull request 'docs: Journey model — Accepted as Concept, Pending Canonicalization (Rule of Three)' (#26 ) from docs/journey-provisional-acceptance into main	2026-06-28 07:32:55 +02:00
Benjamin Admin	d44f3672be	docs(spec): Journey model — Accepted as Concept, Pending Canonicalization (Rule of Three) User decision (2026-06-28): provisional acceptance. Journey is now the preferred way of THINKING, but the persisted artifact stays "Transition Pattern" — NO rename, NO migration, NO runtime change. Per the Rule of Three, Journey becomes the official primary entity only after it proves itself on >=3 distinct transitions (1. ISO27001->CRA done, 2. ISO9001->MaschinenVO, 3. TISAX->CRA). Only then: rename to Journey, ratify ADR-011, derive renderers officially. Erst beweisen, dann kanonisieren — as with Master Controls/Capabilities. Also makes the two-axis separation durable (the most valuable finding): Atomic Requirement -> Capability -> Journey (transition axis) vs Capability -> Playbook (implementation axis). Journey belongs to the transition; Playbook stays capability-owned, referenced by any number of journeys. We do NOT force-unify. Non-runtime doc -> no deploy (ADR-001). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-28 07:32:14 +02:00
pilotadmin	c98500c303	Merge pull request 'docs: Journey model spec (PROPOSAL) — validate transition is the unit, rest are renderers' (#25 ) from spec/journey-model into main	2026-06-28 07:16:34 +02:00
Benjamin Admin	4efbfa45c4	docs(spec): move Journey model spec to repo-root docs-src/architecture (correct location, fixes ADR links) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-28 07:15:01 +02:00
Benjamin Admin	86a783e72f	docs(spec): Journey model PROPOSAL — validate "transition is the unit, rest are renderers" Transition Pattern, Playbook and Reference Scenario describe the same transition from different angles. The hypothesis: a single underlying object, the Journey, is the unit of knowledge; everything else is a renderer ("rendered, not modeled"). Per the user's request this is a SPECIFICATION to validate the assumption BEFORE any code — not a runtime module, not new architecture, decision pending. - Conceptual Journey schema: identity (from->to) + source_variants + likely_covered[] + delta[] + rejected_assumptions; questions/measures/evidence/verification are DERIVED. Truth hierarchy: Atomic Requirement -> Capability -> Journey (x Company context on instantiation). - Renderers: Transition Pattern (= the curated Journey core), Interview, Roadmap, Reference Scenario, Evidence view, role views (Sales/Auditor/Developer/GF) — four views, one Journey. - GROUNDED validation against the real ISO 27001 -> CRA artifacts (TP-ISO27001-CRA-v1, the sbom/CVD playbooks, RTS-001/002/003). Honest finding: the unification HOLDS with two refinements that AVOID premature abstraction: 1. Reference Scenario = Journey x Company context + expected outcome (no duplicated transition data). 2. Playbook = a CAPABILITY renderer (reused across journeys), NOT a Journey renderer — stays capability-owned (ADR-004); the Journey aggregates, it does not own. => a transition is curated exactly once; ISO9001->MaschinenVO becomes one object, not four. - Proposes the principle "rendered, not modeled" alongside computed-not-stored / derived-not-curated. No code, no runtime change (Freeze). Non-runtime doc -> no deploy (ADR-001). If accepted -> ADR-011. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-28 07:14:15 +02:00
pilotadmin	1054facffa	Merge pull request 'feat: Operational Knowledge — the transition is the unit + Transition Coverage KPI' (#24 ) from feat/transition-coverage into main	2026-06-27 23:49:41 +02:00
Benjamin Admin	18f5d0cb05	feat(programs): Operational Knowledge — the transition is the unit + Transition Coverage KPI Customers don't buy "EMV domain"; they buy "we have ISO 9001, help us with the CRA". The sellable unit of knowledge is the TRANSITION (from -> to), not the law and not the capability. This reframes the backlog from "model EMV next" to "the top demanded transitions". No new runtime framework (ADR-010). - knowledge/programs/transitions.yaml: the Operational Knowledge backlog — the ~20-30 actually demanded transitions (of ~N*(N-1) possible) with priority. ISO27001->CRA, ISO9001->CRA, ISO9001->MaschinenVO (all 5-star), IEC62443->CRA, TISAX->CRA, ISO27001/IEC62443->NIS2, ISO14001->Umweltrecht. - Transition Coverage KPI (reference suite, computed-not-stored): per transition a status DERIVED from the transition-pattern corpus (reviewed/validated/proven -> Gold, draft -> 🟡, none -> ⚪). Honest current state: ISO27001->CRA ✅ reviewed, ISO9001->CRA 🟡 draft, rest ⚪. Highest-priority gap = ISO9001->MaschinenVO (the next Track-B work) — a far stronger product indicator than "EMV 30% modelled". - Three knowledge layers documented: Regulatory -> Operational (transitions/playbooks/deltas, the biggest differentiator) -> Verification (Vision V2). A domain is a TRANSITION PROGRAM with two tracks: Track A breadth (model sources, @Legal-KG/@Execution) + Track B product (transitions/playbooks/RTS per source, @Reasoning). - ADR-010: the transition is the unit of knowledge; Transition Coverage KPI; three layers; two tracks. 10 program/transition-contract tests, check-loc 0. Knowledge data + ADR + reference harness = non-runtime -> no deploy (ADR-001). No new module, no runtime change. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 23:48:45 +02:00
pilotadmin	a2403eaed9	Merge pull request 'feat: open Domain Knowledge Program v1 (7-stage production line + per-domain KPI)' (#23 ) from feat/domain-knowledge-program-v1 into main	2026-06-27 18:50:12 +02:00
Benjamin Admin	1a9439d013	feat(programs): open Domain Knowledge Program v1 — 7-stage production line + per-domain KPI The real bottleneck is domain MODELLING. Phase B is organized as one program with sub-programs per domain, each run through the SAME 7-stage production line. No new runtime framework, no new module (ADR-009, Freeze v1.0) — only program data + a derived reporting view. - Customer enters by INDUSTRY, not regulation: Industry -> Domain Model -> Requirement Sources -> Requirements -> Capabilities -> ... -> Completeness. - 7-stage checklist identical for every domain (Domain Model / Requirement Sources / Capability Registry / Transition Patterns / Playbooks / Reference Scenarios / Completeness) with per-stage ownership. README generalized to the framework. - Each domain lists typical_requirement_sources + typical_certifications -> pre-onboarding capability HYPOTHESIS (the ETO insight; feeds Company 2A as inferred, never confirmed). - Backlog v1 (by customer value): 1 Industrial Automation, 2 Environmental, 3 Automotive, 4 Medical, 5 Energy. Five domain-definition shells (environmental restructured to the unified shape, law-first preserved). - Per-domain KPI is DERIVED from the real corpus (computed-not-stored; sources modelled / transition patterns / playbooks / reference scenarios), NOT a curated number. Reference suite renders maturity bars: Industrial Automation 43% (3/7 sources) leads, Environmental 0% (work ahead). Backlog (value) and KPI (corpus state) are deliberately separated. - ADR-009: Domain Knowledge Program framework. Honest known refinement: regulation-ID normalization (CRA vs Cyber Resilience Act) aliased in the KPI. 7 program-contract tests (backlog order + industry-first + derived-not-stored), check-loc 0. Knowledge data + ADR + reference harness = non-runtime -> no deploy (ADR-001). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 18:49:06 +02:00
pilotadmin	c737e1ad7d	Merge pull request 'feat: start the Environmental Knowledge Program (law-first domain, not architecture)' (#22 ) from feat/environmental-knowledge-program into main	2026-06-27 14:36:54 +02:00
Benjamin Admin	9c02c2c4a2	feat(programs): start the Environmental Knowledge Program — domains, not architecture The architecture is stable; from here the value comes from DOMAINS, not more software. Phase B is organized as law-first Domain Knowledge Programs, each delivering the same production line: Corpus -> Obligations -> Capabilities -> Transition Patterns -> Playbooks -> Reference Scenarios -> Completeness. No new runtime framework (Freeze v1.0). - knowledge/programs/README.md: reusable Domain Program blueprint (production line, per-stage ownership, law-first ordering, planned programs Environmental/Automotive/IEC62443/Functional-Safety). - knowledge/programs/environmental.yaml: the Environmental domain as DATA. Law-first: B1 Environmental Regulatory Corpus (water/chemicals/emissions/energy/waste/product-responsibility — law + obligations only) -> B2 Capability Model -> B3 Transition Patterns (ISO 14001 -> corpus, built LAST). ISO 14001 is a source state, NOT the domain. - Ownership handoffs: B1 -> Legal Knowledge, B2 -> Compliance Execution, B3+/playbooks/reference -> Reasoning. Coordinate via the board; no session builds another's artifacts. - reference suite: "Domain Knowledge Programs" section renders the program stages + a measurable Completeness baseline (6 areas, 0 assessed today) that flips automatically as stages land. - ADR-008: from architecture to domains; Phase B as law-first programs; architecture frozen. 6 program-contract tests (law-first order + ownership pinned), check-loc 0. Knowledge data + ADR + reference harness = non-runtime -> no deploy (ADR-001). No new module, no runtime change. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 14:36:03 +02:00
pilotadmin	c4e9ca8f4d	Merge pull request 'feat: Regulatory Completeness Engine (auditable coverage, not confidence)' (#21 ) from feat/regulatory-completeness into main	2026-06-27 14:17:07 +02:00
Benjamin Admin	aa99111a87	feat(completeness): Regulatory Completeness Engine — auditable coverage, not confidence Phase A½. The move from feature to product development: for every assessment, answer "how sure are we that this answer is COMPLETE?" — different from confidence. The product never claims full coverage; it makes its own knowledge state transparent and auditable. Shows what we do NOT know and why. - compliance/completeness/: assess_completeness(identified, corpus_status, uncertain, assumptions, assessed_obligations) -> CompletenessReport. Separates IDENTIFIED from ASSESSED (validated corpus AND determined applicability) and justifies every gap. Two kinds of open: corpus gap (future_corpus) and applicability uncertainty (query_required + deciding question, e.g. Data Act / generates_usage_data). - The metric is COUNTS, never a single percentage: "Identifiziert N · bewertet M · offen K · Unsicherheiten U · Begründung ja" + an honest audit statement. - ADR-007: auditable honesty; phase order A factory -> A½ Completeness -> B new domains; the transparency selling point. Deterministic, no LLM; corpus status + obligation count injected. - reference suite: "Regulatory Completeness" section runs an industrial-dishwasher assessment (assessed CRA/MaschinenVO; open EMV/Environmental=future_corpus, Data Act=query_required) and notes Environmental flips open->validated automatically once the corpus lands. 11 completeness tests (54 with adjacent modules), mypy --strict clean (15 files), check-loc 0. Product code with no app caller + ADR/reference = non-runtime -> no deploy (ADR-001). Freeze-safe. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 14:16:12 +02:00
pilotadmin	0b0d262462	Merge pull request 'feat: Knowledge Intake — classify + impact triage before extraction' (#20 ) from feat/knowledge-intake into main	2026-06-27 13:59:47 +02:00
Benjamin Admin	07e392913f	feat(knowledge-intake): classify a document + assess its impact before extraction Phase A1. The real knowledge production is not writing — it is TARGETED UPDATING: when 20 documents arrive, which 5 change our knowledge and which 15 are ignorable? Before the parser, Knowledge Intake classifies a new document (no content extraction) and intersects its signals with an index of the existing knowledge to emit a Knowledge Package (an impact analysis). - compliance/knowledge_intake/: build_knowledge_index(patterns, playbooks, reference_scenarios, obligation_index) + assess_document_impact(descriptor, index) -> KnowledgePackage. Deterministic, NO content extraction, NO LLM. Surfaces affected capabilities / playbooks / transition patterns / reference scenarios / (injected) obligations, whether it is a new domain, and a triage level (HIGH / LOW / NONE / NEW_DOMAIN) with a recommendation. - ADR-006: Knowledge Intake = classify + impact before extraction; full factory Intake -> Package -> Parser -> Draft -> Review -> Published; phase order A1 Intake / A2 Draft / A3 Review. - reference suite: "Knowledge Intake" section triages 3 example documents (CRA SBOM-FAQ -> high, 14C/2PB/3RTS/2Obl; environmental guidance -> new_domain; marketing blog -> ignorable). Section lives in _helpers.py to keep generate.py under the 500-LOC budget. - Honest known refinement surfaced by intake: regulation-ID normalization (CRA vs Cyber Resilience Act). 10 intake tests (60 with the adjacent modules), mypy --strict clean (16 files), check-loc 0. Product code with no app caller + ADR/reference = non-runtime -> no deploy (ADR-001). Freeze-safe. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 13:58:59 +02:00
pilotadmin	d51bcd77c7	Merge pull request 'feat: Knowledge Production — Playbook Draft Generator (prepare deterministically, curate)' (#19 ) from feat/knowledge-production into main	2026-06-27 13:32:26 +02:00
Benjamin Admin	b6cfc0a503	feat(knowledge-production): Playbook Draft Generator — prepare the corpus deterministically The bottleneck is not content, it is knowledge PRODUCTION. Instead of writing 200 playbooks by hand, generate drafts deterministically from data the software already owns, then have an expert review them. Mirrors the legal pipeline (Gesetz -> Parser -> Obligation -> Review) for BreakPilot's own knowledge: new Capability -> Registry -> Transition Pattern -> Playbook Draft Generator -> Expert Review -> versioned Playbook. - compliance/knowledge_production/: generate_playbook_draft(capability, requirement, control_links) + drafts_from_pattern(pattern) -> one PlaybookDraft per delta capability. Owned fields (why / closes_regulations / expected_evidence / typical_controls) are assembled with per-field provenance; the practitioner know-how (tools / process_steps / how_others) is left as an explicit TODO. - DraftStatus lifecycle (Freigabestatus): draft_generated -> in_review -> reviewed -> validated -> proven. Deterministic, NO LLM in the core (any model enrichment stays offline/advisory/propose-only). - ADR-005: extends "the engine does not change, the corpus grows" with "and the corpus is not written by hand — it is deterministically prepared, then curated". - reference suite: "Knowledge Production" section turns the convergence pattern into 12 auto-assembled drafts (why/closes/evidence filled, tools/steps TODO) -> review 12 drafts, don't write 12 playbooks. 10 tests (50 with playbook/optimization/transition/company), mypy --strict clean, check-loc 0. Product code with no app caller + ADR/reference = non-runtime -> no deploy (ADR-001). Freeze-safe. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 13:31:31 +02:00
pilotadmin	1e1689f1f2	Merge pull request 'feat: Implementation Playbooks (Berater renderer over the Capability spine)' (#18 ) from feat/implementation-playbooks into main	2026-06-27 10:38:57 +02:00
Benjamin Admin	78f0ffa9de	feat(playbook): Implementation Playbooks — the Berater renderer ("wie komme ich dort hin?") Roadmap item 4. After WHAT applies / WHAT is missing / WHICH first, the GF asks HOW. The Implementation Playbook renders, for one capability, the full journey — why / which regulations it closes / tools / process / evidence / controls — and chains the Optimization Roadmap into per-measure playbooks. Another renderer over the same Capability spine (ADR-003/004), not a new engine: ~95% of the data already exists, it just needs a different rendering. - compliance/playbook/: build_playbook() + playbooks_for_plan() (chains optimization -> playbook, acyclic; reuses leverage for "closes which regulations"). Capabilities without curated content render as honest status:missing stubs — the content-owed signal. - knowledge/implementation_playbooks/: curated knowledge layer (Reasoning Knowledge Acquisition), two deep expert drafts (SBOM, CVD/PSIRT, status draft, expert-draft-not-normative) + README. The bottleneck is now CONTENT, not software; Playbook (own knowledge) != regulatory domain. - ADR-004: Implementation Playbooks = renderer + knowledge layer; content is the bottleneck. - reference suite: "Implementation Playbook" section renders the SBOM journey + Roadmap->Playbook table (high-leverage caps flagged "fehlt (Inhalt)" — content backlog, highest leverage first). - refactor: extracted markdown helpers to reference_scenarios/_helpers.py to keep generate.py under the 500-LOC budget. 9 playbook tests (40 with optimization+transition+company), mypy --strict clean, check-loc 0. Product code with no app caller + knowledge/ADR/reference = non-runtime -> no deploy (ADR-001). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 10:38:13 +02:00
pilotadmin	50d88d611d	Merge pull request 'feat: Regulatory Optimization (Roadmap renderer over the Capability Delta)' (#17 ) from feat/regulatory-optimization into main	2026-06-27 09:50:18 +02:00
Benjamin Admin	cfafa31ea2	feat(optimization): Regulatory Optimization — Roadmap/Management renderer over the Capability Delta Roadmap item 5. GAP analysis and measure-prioritisation are the SAME computation: Required − Known = the Capability Delta. The Capability Delta Engine (RS-005) computes it once; renderers read that ONE delta. Interview Renderer (missing info → questions) was already built; this adds the Roadmap/Management Renderer (missing capabilities → measures ranked by regulatory leverage). - compliance/optimization/: regulatory_leverage() + select_within_budget() (pure leverage math) + roadmap_from_delta(assessment, ...) — the keystone binding optimization to the RS-005 delta (dependency optimization → transition_reasoning, acyclic; the delta engine stays hermetic). leverage(measure) = number of regulatory requirements it closes at once (e.g. patch management → CRA+MaschinenVO+IEC62443+ISO27001 = 4). No new corpus, no new meta-model class (freeze v1.0). - Welt-1 honesty: percentages are exact count ratios over the IDENTIFIED requirements (the known delta), never "% gesetzeskonform". - reference suite: "Regulatory Optimization" section runs the SAME convergence delta → ranked measures + budget answer + the management sentence "of N identified requirements you close M with the top-K measures (X%) — highest regulatory leverage". - ADR-003: Capability Delta Engine — one delta, many renderers; rename Gap → Capability Delta. 13 optimization tests (31 with transition+company), mypy --strict clean, check-loc 0. Product code with no app caller + ADR/reference = non-runtime → no deploy (ADR-001). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 09:49:38 +02:00
pilotadmin	ffff9bb592	Merge pull request 'feat: extend Reference Transition Scenarios to multi-regulation (CRA + MaschinenVO)' (#16 ) from feat/rts-multi-regulation into main	2026-06-27 09:26:40 +02:00
Benjamin Admin	a0f72fc39b	feat(rts): extend Reference Transition Scenarios to multi-regulation (CRA + MaschinenVO) Roadmap item 2: the RTS now pin MaschinenVO + convergence Expected Outcomes, so the convergence USP is a living regression, not just a one-off section. - RTS-003 (machine + ISMS, networked): full multi-regulation archetype — maschinenvo expected_delta + convergence expected_multi_target (links TP-ISO27001-CRA-MaschinenVO-v1). Generator runs the convergence pattern through RS-005: 4/4 machine-safety delta MISSING + 4/4 expected multi-target caps converge. PASS. - RTS-001 (component): MaschinenVO modeled as `uncertain` (a pure component is usually not a machine; deciding question is_safety_component) — engine must never assert it applies. Honest, parallel to the Data-Act handling. - RTS-002 (machine, QMS-only): MaschinenVO `applies` (is_machine) but LOW convergence — no ISMS means the cyber side is entirely delta, so few caps are shared. The honest contrast that the convergence USP rewards companies who already run an ISMS. - generator: per-RTS maschinenvo/convergence Soll-Ist checks; convergence pattern run once and reused. Data Act stays `uncertain` everywhere, never asserted. All 3 RTS PASS. 18 tests (transition+company), mypy --strict clean, check-loc 0. Non-runtime (knowledge + reference harness) -> no deploy (ADR-001). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 09:26:01 +02:00
pilotadmin	5fde7690a5	Merge pull request 'feat: first Regulatory Convergence Pattern (ISO27001 -> CRA + MaschinenVO)' (#15 ) from feat/regulatory-convergence-cra-maschinenvo into main	2026-06-27 09:14:56 +02:00
Benjamin Admin	66be23f0c4	feat(convergence): first Regulatory Convergence Pattern (ISO27001 -> CRA + MaschinenVO) The first multi-regulation pattern: each capability declares `covers_targets`, so we can answer the convergence USP — "which capability satisfies CRA AND MaschinenVO at once?" - knowledge: transition_pattern_iso27001_to_cra_maschinenvo_v1.yaml (pattern_type: regulatory_convergence, status draft). The cyber-safety bridge = MaschinenVO Annex III 1.1.9 "protection against corruption" overlapping CRA integrity. 4 convergence capabilities cover BOTH; 5 CRA-only; 3 MaschinenVO-only. - product: compliance/transition_reasoning/convergence.py — regulatory_convergence() pure/deterministic/computed-not-stored, no new graph/class (freeze v1.0 untouched). No app caller yet -> non-runtime, no deploy (ADR-001). - reference suite: Cross-Regulation Capability Mapping section renders the customer sentence "von N neuen Massnahmen erfuellen M gleichzeitig CRA und MaschinenVO". - README: term -> Regulatory Transition / Convergence Pattern; covers_targets documented. - tests: test_regulatory_convergence (18 transition+company pass), mypy --strict clean. Curated expert knowledge, AI first draft (L1/draft) — Annex/Article refs indicative, review_required by a machinery-safety expert. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 09:12:30 +02:00
pilotadmin	caa9b8b609	Merge pull request 'docs(knowledge): Reference Transition Scenarios (RTS) + ISO9001->CRA' (#14 ) from feat/reference-transition-scenarios into main	2026-06-27 08:46:57 +02:00
Benjamin Admin	f78e03bd0a	docs(knowledge): Reference Transition Scenarios (RTS-001..003) + ISO9001->CRA pattern Three ANONYMIZED reference transition scenarios (no real company names stored) = canonical regression scenarios that test the KNOWLEDGE, not just the engine. Each pins an Expected Outcome (expected_likely_covered + expected_delta); every commit must reproduce it (identical or better). - RTS-001 automotive supplier (TISAX+ISO27001) -> CRA: mature ISMS, standard CRA delta. - RTS-002 classic machine builder (ISO9001) -> CRA: only process discipline -> MUCH larger delta (10 missing vs 3 covered). New TP-ISO9001-CRA-v1 pattern (different shape). - RTS-003 networked machine builder (ISMS) -> CRA: highlights the Data Act. Data Act is modelled as UNCERTAIN (a hypothesis), never a fixed gilt/gilt-nicht: the generator checks the engine SURFACES the uncertainty + the deciding question (generates_usage_data) and never wrongly ASSERTS applicability. All three RTS PASS. Non-runtime knowledge + reference harness -> no deploy (ADR-001). Names deliberately absent. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 08:46:20 +02:00
pilotadmin	5412864705	Merge pull request 'docs(knowledge): TKP 4-level lifecycle + enrichments + ISMS->TISAX (genericity)' (#13 ) from feat/transition-knowledge-levels-tisax into main	2026-06-27 08:29:33 +02:00
Benjamin Admin	0da093c046	docs(knowledge): TKP 4-level lifecycle + 3 enrichments + ISMS->TISAX (genericity proof) Transition KNOWLEDGE Patterns (renamed term -- curated knowledge, not an algorithm): - 4 maturity levels: draft -> reviewed -> validated (domain expert) -> proven (field). "approved" dropped; target is validated. TP-ISO27001-CRA set to reviewed (L2). - 3 enrichments per pattern: confidence_source: relationship (curated, not an LLM estimate -> computed-not-stored); why_asked (customer-facing: why the source does not suffice here); dropped_if (what makes the question unnecessary). Applied to TP-ISO27001-CRA. - New TP-ISMS-TISAX (draft): different character -- info-security module mostly covered; delta is automotive-specific (prototype protection, TISAX labels, VDA ISA self-assessment, ENX assessment, Art. 28 data protection). Proves the architecture is GENERIC, not CRA-tailored. - Reference scenario 4 generalized to loop over ALL patterns through RS-005: both carried (CRA 17->17, TISAX 13->13) -> a living genericity + regression test for every future pattern. Non-runtime knowledge + reference harness -> no deploy (ADR-001). Next: ISO9001->IATF16949. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 08:29:30 +02:00
pilotadmin	3199d0d90e	Merge pull request 'docs(knowledge): TP-ISO27001->CRA gold standard + RS-005 reference scenario' (#12 ) from feat/transition-pattern-gold-standard into main	2026-06-27 08:12:32 +02:00
Benjamin Admin	4bfd552da7	docs(knowledge): TP-ISO27001->CRA gold standard + reference scenario (RS-005 regression) (1) Harden the first Transition Pattern to the gold-standard template per quality checklist: versioned transition_goal (ISO27001:2022 -> CRA, applies 2027-12-11), source_state_variants (certified/isms_introduced/expired/limited_scope), each likely_covered assumption with a typed relationship (supports\|partially_supports, never equivalent) + verification + rationale (the Warum) + an auditor-checkable reviewable_claim, delta as missing-capability + needed-info, an explicit rejected_assumptions section, and a determinism_goal. README schema updated to match. (2) New Reference-Suite scenario 4 (Transition): the generator READS the pattern YAML and runs it through the RS-005 Planning Engine + Company 2A -> coverage + question requests. Proves the architecture fully carries the pattern (17 caps -> 17 coverage + 17 requests; 9 HIGH delta = the real CRA gaps, 8 probably-covered from the ISMS). Now a living regression test: every future pattern runs through the same engine. Non-runtime knowledge + reference harness -> no deploy (ADR-001). Next: ISMS->TISAX once approved. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 08:11:42 +02:00
pilotadmin	cb18eac7ec	Merge pull request 'docs(knowledge): Transition Pattern ISO27001->CRA v1 (knowledge base)' (#11 ) from feat/knowledge-transition-iso27001-cra into main	2026-06-27 07:51:37 +02:00
Benjamin Admin	bea8559f78	docs(knowledge): first Transition Pattern ISO27001 -> CRA (curated knowledge base) Reasoning session's new Knowledge Acquisition responsibility (re-charter): build and curate the Transition Knowledge Base under backend-compliance/knowledge/transition_patterns/ (beside reasoning/, not under it -- it is knowledge, not an engine). First professional pattern TP-ISO27001-CRA-v1 (status: draft): separates what a mature ISMS likely covers at the ORG level (probably_covered, needs product-level confirmation, never auto-"erfuellt") from the CRA-specific delta with no ISO 27001 analogue (SBOM, support period + secure signed updates, coordinated vulnerability disclosure, Art. 14 authority reporting, product cyber risk assessment, CE conformity / technical documentation). Expert draft, not a normative proof; review_required before customer use. Non-runtime knowledge -> no deploy (ADR-001). Next: ISMS->TISAX. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 07:50:42 +02:00
pilotadmin	81f8b56b48	Merge pull request 'feat(transition): RS-005 v0 Transition Planning Engine (+ spec v1.3)' (#10 ) from feat/transition-reasoning-v0 into main	2026-06-27 07:37:53 +02:00
Benjamin Admin	db2efe9f52	docs(spec): Transition Reasoning v1.3 — Planning Engine / QuestionRequest / Renderer split Aligns the spec with RS-005 v0: the Transition Planning Engine owns the INFORMATION GAPS (TransitionQuestionRequest), not the questions. Chain: Planning Engine -> TransitionQuestionRequest -> Question Renderer (RS-005.1) -> Interview. RS-005.1 (renderer/templates) deliberately deferred; GeneratedQuestion reframed as the renderer's output (a swappable policy layer), not part of the engine. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 07:37:50 +02:00
Benjamin Admin	77de7e794c	feat(transition): Transition Reasoning v0 (RS-005) — Transition Planning Engine Second reasoning mode, scope per user: the engine owns the INFORMATION GAPS, not the questions. assess_transition(context, target_requirements, company_profile) emits ranked TransitionQuestionRequest {capability, control, reason, question_intent, expected_evidence, priority, information_gain} -- NOT rendered question text. Rendering (intent+subject->sentence) is a separate swappable layer (RS-005.1), not here. Consumes the Company Capability Profile (2A) as "have" + injected TargetRequirement (Execution-owned placeholder) as "required" -- no required-capability data in product code (EMPTY_REQUIREMENTS, mocks only in tests). A certification-derived capability is probably_covered (Welt 1) -> a confirmation request, never already_covered/"erfuellt". Deterministic, computed-not-stored, no percentages. Activates 2A/2C/RCI (first consumer of the Company profile). Freeze-respecting: additive package, no new graph/base class/meta-model class. 9 tests, mypy --strict clean, LOC ok. No endpoint/UI/RAG; question rendering deliberately deferred to RS-005.1. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 07:31:11 +02:00
pilotadmin	5e735e9e56	Merge pull request 'docs(spec): Transition Reasoning v1.2 (generate-from-controls + AI-drafted curated library)' (#9 ) from feat/transition-reasoning-v11 into main	2026-06-27 07:12:35 +02:00
Benjamin Admin	24fdde89c6	docs(spec): Transition Reasoning v1.2 — questions generated from controls + AI-drafted curated library v1.1: interview questions are GENERATED from the existing (Master) Controls, not hand-written. Three building blocks: Control->question_intent (corpus/Execution), ~30-40 Master Question Templates (Reasoning), Transition-Prioritization (certs decide which generated questions can be skipped; 217->19 funnel, reuses Company 2A + cert map). v1.2: knowledge production. LLMs produce the first expert DRAFT (the prioritization per transition); BreakPilot reviews + versions + OWNS the canonical library (in Git, not the AI; model-independent, MDQ-00127 v4). Offline multi-model workflow, NOT runtime (deterministic-first: LLM offline-propose, never online-mutate). Hard boundary: the library is an expert DRAFT, not a normative/legal proof -- "cert probably covers X" is Welt-1 (ClaimCoverage), never "erfuellt" (anti-fake-evidence). Reframes the 100 seed questions as validation/template-extraction set. Spec only, no code; non-runtime docs -> no deploy (ADR-001). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 07:11:53 +02:00
pilotadmin	f3d3255de1	Merge pull request 'docs(spec): Transition Reasoning v1 + MDQ Registry + ADR-002' (#8 ) from feat/transition-reasoning-spec into main	2026-06-27 07:03:43 +02:00
Benjamin Admin	fe21c2f487	docs(spec): Transition Reasoning spec v1 + MDQ Registry + ADR-002 Second reasoning mode (extends, does not replace): BreakPilot answers MIGRATION questions (start state -> target state -> delta), not regulation Q&A. New package compliance/transition_reasoning/ (spec only). Transition Reasoning is RCI generalized; reuses Company 2A (have), Master Capability Registry (MCAP) and RCI. MDQ Registry = 4th identity-machine instance (after Master Controls/Obligations/ Capabilities): every Master Delta Question is a versioned, identifiable knowledge unit (verifies MCAP, supports obligations, transition patterns, evidence types, information gain, confidence impact, follow-up). Transition Patterns hold only MDQ references -> reuse across transitions. Delta interview = information-gain optimization, not a sequential questionnaire. ADR-002: transitions are DATA (patterns + capability/MDQ knowledge), never engine or metamodel extensions. 100 seed questions captured as v1. Spec only (no code; freeze-respecting: additive package, no new graph/base class/ meta-model class). Non-runtime docs -> no deploy (ADR-001). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 07:03:42 +02:00

1 2 3 4 5 ...

1634 Commits