breakpilot-compliance

Author	SHA1	Message	Date
pilotadmin	d51bcd77c7	Merge pull request 'feat: Knowledge Production — Playbook Draft Generator (prepare deterministically, curate)' (#19 ) from feat/knowledge-production into main	2026-06-27 13:32:26 +02:00
Benjamin Admin	b6cfc0a503	feat(knowledge-production): Playbook Draft Generator — prepare the corpus deterministically The bottleneck is not content, it is knowledge PRODUCTION. Instead of writing 200 playbooks by hand, generate drafts deterministically from data the software already owns, then have an expert review them. Mirrors the legal pipeline (Gesetz -> Parser -> Obligation -> Review) for BreakPilot's own knowledge: new Capability -> Registry -> Transition Pattern -> Playbook Draft Generator -> Expert Review -> versioned Playbook. - compliance/knowledge_production/: generate_playbook_draft(capability, requirement, control_links) + drafts_from_pattern(pattern) -> one PlaybookDraft per delta capability. Owned fields (why / closes_regulations / expected_evidence / typical_controls) are assembled with per-field provenance; the practitioner know-how (tools / process_steps / how_others) is left as an explicit TODO. - DraftStatus lifecycle (Freigabestatus): draft_generated -> in_review -> reviewed -> validated -> proven. Deterministic, NO LLM in the core (any model enrichment stays offline/advisory/propose-only). - ADR-005: extends "the engine does not change, the corpus grows" with "and the corpus is not written by hand — it is deterministically prepared, then curated". - reference suite: "Knowledge Production" section turns the convergence pattern into 12 auto-assembled drafts (why/closes/evidence filled, tools/steps TODO) -> review 12 drafts, don't write 12 playbooks. 10 tests (50 with playbook/optimization/transition/company), mypy --strict clean, check-loc 0. Product code with no app caller + ADR/reference = non-runtime -> no deploy (ADR-001). Freeze-safe. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 13:31:31 +02:00
pilotadmin	1e1689f1f2	Merge pull request 'feat: Implementation Playbooks (Berater renderer over the Capability spine)' (#18 ) from feat/implementation-playbooks into main	2026-06-27 10:38:57 +02:00
Benjamin Admin	78f0ffa9de	feat(playbook): Implementation Playbooks — the Berater renderer ("wie komme ich dort hin?") Roadmap item 4. After WHAT applies / WHAT is missing / WHICH first, the GF asks HOW. The Implementation Playbook renders, for one capability, the full journey — why / which regulations it closes / tools / process / evidence / controls — and chains the Optimization Roadmap into per-measure playbooks. Another renderer over the same Capability spine (ADR-003/004), not a new engine: ~95% of the data already exists, it just needs a different rendering. - compliance/playbook/: build_playbook() + playbooks_for_plan() (chains optimization -> playbook, acyclic; reuses leverage for "closes which regulations"). Capabilities without curated content render as honest status:missing stubs — the content-owed signal. - knowledge/implementation_playbooks/: curated knowledge layer (Reasoning Knowledge Acquisition), two deep expert drafts (SBOM, CVD/PSIRT, status draft, expert-draft-not-normative) + README. The bottleneck is now CONTENT, not software; Playbook (own knowledge) != regulatory domain. - ADR-004: Implementation Playbooks = renderer + knowledge layer; content is the bottleneck. - reference suite: "Implementation Playbook" section renders the SBOM journey + Roadmap->Playbook table (high-leverage caps flagged "fehlt (Inhalt)" — content backlog, highest leverage first). - refactor: extracted markdown helpers to reference_scenarios/_helpers.py to keep generate.py under the 500-LOC budget. 9 playbook tests (40 with optimization+transition+company), mypy --strict clean, check-loc 0. Product code with no app caller + knowledge/ADR/reference = non-runtime -> no deploy (ADR-001). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 10:38:13 +02:00
pilotadmin	50d88d611d	Merge pull request 'feat: Regulatory Optimization (Roadmap renderer over the Capability Delta)' (#17 ) from feat/regulatory-optimization into main	2026-06-27 09:50:18 +02:00
Benjamin Admin	cfafa31ea2	feat(optimization): Regulatory Optimization — Roadmap/Management renderer over the Capability Delta Roadmap item 5. GAP analysis and measure-prioritisation are the SAME computation: Required − Known = the Capability Delta. The Capability Delta Engine (RS-005) computes it once; renderers read that ONE delta. Interview Renderer (missing info → questions) was already built; this adds the Roadmap/Management Renderer (missing capabilities → measures ranked by regulatory leverage). - compliance/optimization/: regulatory_leverage() + select_within_budget() (pure leverage math) + roadmap_from_delta(assessment, ...) — the keystone binding optimization to the RS-005 delta (dependency optimization → transition_reasoning, acyclic; the delta engine stays hermetic). leverage(measure) = number of regulatory requirements it closes at once (e.g. patch management → CRA+MaschinenVO+IEC62443+ISO27001 = 4). No new corpus, no new meta-model class (freeze v1.0). - Welt-1 honesty: percentages are exact count ratios over the IDENTIFIED requirements (the known delta), never "% gesetzeskonform". - reference suite: "Regulatory Optimization" section runs the SAME convergence delta → ranked measures + budget answer + the management sentence "of N identified requirements you close M with the top-K measures (X%) — highest regulatory leverage". - ADR-003: Capability Delta Engine — one delta, many renderers; rename Gap → Capability Delta. 13 optimization tests (31 with transition+company), mypy --strict clean, check-loc 0. Product code with no app caller + ADR/reference = non-runtime → no deploy (ADR-001). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 09:49:38 +02:00
pilotadmin	ffff9bb592	Merge pull request 'feat: extend Reference Transition Scenarios to multi-regulation (CRA + MaschinenVO)' (#16 ) from feat/rts-multi-regulation into main	2026-06-27 09:26:40 +02:00
Benjamin Admin	a0f72fc39b	feat(rts): extend Reference Transition Scenarios to multi-regulation (CRA + MaschinenVO) Roadmap item 2: the RTS now pin MaschinenVO + convergence Expected Outcomes, so the convergence USP is a living regression, not just a one-off section. - RTS-003 (machine + ISMS, networked): full multi-regulation archetype — maschinenvo expected_delta + convergence expected_multi_target (links TP-ISO27001-CRA-MaschinenVO-v1). Generator runs the convergence pattern through RS-005: 4/4 machine-safety delta MISSING + 4/4 expected multi-target caps converge. PASS. - RTS-001 (component): MaschinenVO modeled as `uncertain` (a pure component is usually not a machine; deciding question is_safety_component) — engine must never assert it applies. Honest, parallel to the Data-Act handling. - RTS-002 (machine, QMS-only): MaschinenVO `applies` (is_machine) but LOW convergence — no ISMS means the cyber side is entirely delta, so few caps are shared. The honest contrast that the convergence USP rewards companies who already run an ISMS. - generator: per-RTS maschinenvo/convergence Soll-Ist checks; convergence pattern run once and reused. Data Act stays `uncertain` everywhere, never asserted. All 3 RTS PASS. 18 tests (transition+company), mypy --strict clean, check-loc 0. Non-runtime (knowledge + reference harness) -> no deploy (ADR-001). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 09:26:01 +02:00
pilotadmin	5fde7690a5	Merge pull request 'feat: first Regulatory Convergence Pattern (ISO27001 -> CRA + MaschinenVO)' (#15 ) from feat/regulatory-convergence-cra-maschinenvo into main	2026-06-27 09:14:56 +02:00
Benjamin Admin	66be23f0c4	feat(convergence): first Regulatory Convergence Pattern (ISO27001 -> CRA + MaschinenVO) The first multi-regulation pattern: each capability declares `covers_targets`, so we can answer the convergence USP — "which capability satisfies CRA AND MaschinenVO at once?" - knowledge: transition_pattern_iso27001_to_cra_maschinenvo_v1.yaml (pattern_type: regulatory_convergence, status draft). The cyber-safety bridge = MaschinenVO Annex III 1.1.9 "protection against corruption" overlapping CRA integrity. 4 convergence capabilities cover BOTH; 5 CRA-only; 3 MaschinenVO-only. - product: compliance/transition_reasoning/convergence.py — regulatory_convergence() pure/deterministic/computed-not-stored, no new graph/class (freeze v1.0 untouched). No app caller yet -> non-runtime, no deploy (ADR-001). - reference suite: Cross-Regulation Capability Mapping section renders the customer sentence "von N neuen Massnahmen erfuellen M gleichzeitig CRA und MaschinenVO". - README: term -> Regulatory Transition / Convergence Pattern; covers_targets documented. - tests: test_regulatory_convergence (18 transition+company pass), mypy --strict clean. Curated expert knowledge, AI first draft (L1/draft) — Annex/Article refs indicative, review_required by a machinery-safety expert. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 09:12:30 +02:00
pilotadmin	caa9b8b609	Merge pull request 'docs(knowledge): Reference Transition Scenarios (RTS) + ISO9001->CRA' (#14 ) from feat/reference-transition-scenarios into main	2026-06-27 08:46:57 +02:00
Benjamin Admin	f78e03bd0a	docs(knowledge): Reference Transition Scenarios (RTS-001..003) + ISO9001->CRA pattern Three ANONYMIZED reference transition scenarios (no real company names stored) = canonical regression scenarios that test the KNOWLEDGE, not just the engine. Each pins an Expected Outcome (expected_likely_covered + expected_delta); every commit must reproduce it (identical or better). - RTS-001 automotive supplier (TISAX+ISO27001) -> CRA: mature ISMS, standard CRA delta. - RTS-002 classic machine builder (ISO9001) -> CRA: only process discipline -> MUCH larger delta (10 missing vs 3 covered). New TP-ISO9001-CRA-v1 pattern (different shape). - RTS-003 networked machine builder (ISMS) -> CRA: highlights the Data Act. Data Act is modelled as UNCERTAIN (a hypothesis), never a fixed gilt/gilt-nicht: the generator checks the engine SURFACES the uncertainty + the deciding question (generates_usage_data) and never wrongly ASSERTS applicability. All three RTS PASS. Non-runtime knowledge + reference harness -> no deploy (ADR-001). Names deliberately absent. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 08:46:20 +02:00
pilotadmin	5412864705	Merge pull request 'docs(knowledge): TKP 4-level lifecycle + enrichments + ISMS->TISAX (genericity)' (#13 ) from feat/transition-knowledge-levels-tisax into main	2026-06-27 08:29:33 +02:00
Benjamin Admin	0da093c046	docs(knowledge): TKP 4-level lifecycle + 3 enrichments + ISMS->TISAX (genericity proof) Transition KNOWLEDGE Patterns (renamed term -- curated knowledge, not an algorithm): - 4 maturity levels: draft -> reviewed -> validated (domain expert) -> proven (field). "approved" dropped; target is validated. TP-ISO27001-CRA set to reviewed (L2). - 3 enrichments per pattern: confidence_source: relationship (curated, not an LLM estimate -> computed-not-stored); why_asked (customer-facing: why the source does not suffice here); dropped_if (what makes the question unnecessary). Applied to TP-ISO27001-CRA. - New TP-ISMS-TISAX (draft): different character -- info-security module mostly covered; delta is automotive-specific (prototype protection, TISAX labels, VDA ISA self-assessment, ENX assessment, Art. 28 data protection). Proves the architecture is GENERIC, not CRA-tailored. - Reference scenario 4 generalized to loop over ALL patterns through RS-005: both carried (CRA 17->17, TISAX 13->13) -> a living genericity + regression test for every future pattern. Non-runtime knowledge + reference harness -> no deploy (ADR-001). Next: ISO9001->IATF16949. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 08:29:30 +02:00
pilotadmin	3199d0d90e	Merge pull request 'docs(knowledge): TP-ISO27001->CRA gold standard + RS-005 reference scenario' (#12 ) from feat/transition-pattern-gold-standard into main	2026-06-27 08:12:32 +02:00
Benjamin Admin	4bfd552da7	docs(knowledge): TP-ISO27001->CRA gold standard + reference scenario (RS-005 regression) (1) Harden the first Transition Pattern to the gold-standard template per quality checklist: versioned transition_goal (ISO27001:2022 -> CRA, applies 2027-12-11), source_state_variants (certified/isms_introduced/expired/limited_scope), each likely_covered assumption with a typed relationship (supports\|partially_supports, never equivalent) + verification + rationale (the Warum) + an auditor-checkable reviewable_claim, delta as missing-capability + needed-info, an explicit rejected_assumptions section, and a determinism_goal. README schema updated to match. (2) New Reference-Suite scenario 4 (Transition): the generator READS the pattern YAML and runs it through the RS-005 Planning Engine + Company 2A -> coverage + question requests. Proves the architecture fully carries the pattern (17 caps -> 17 coverage + 17 requests; 9 HIGH delta = the real CRA gaps, 8 probably-covered from the ISMS). Now a living regression test: every future pattern runs through the same engine. Non-runtime knowledge + reference harness -> no deploy (ADR-001). Next: ISMS->TISAX once approved. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 08:11:42 +02:00
pilotadmin	cb18eac7ec	Merge pull request 'docs(knowledge): Transition Pattern ISO27001->CRA v1 (knowledge base)' (#11 ) from feat/knowledge-transition-iso27001-cra into main	2026-06-27 07:51:37 +02:00
Benjamin Admin	bea8559f78	docs(knowledge): first Transition Pattern ISO27001 -> CRA (curated knowledge base) Reasoning session's new Knowledge Acquisition responsibility (re-charter): build and curate the Transition Knowledge Base under backend-compliance/knowledge/transition_patterns/ (beside reasoning/, not under it -- it is knowledge, not an engine). First professional pattern TP-ISO27001-CRA-v1 (status: draft): separates what a mature ISMS likely covers at the ORG level (probably_covered, needs product-level confirmation, never auto-"erfuellt") from the CRA-specific delta with no ISO 27001 analogue (SBOM, support period + secure signed updates, coordinated vulnerability disclosure, Art. 14 authority reporting, product cyber risk assessment, CE conformity / technical documentation). Expert draft, not a normative proof; review_required before customer use. Non-runtime knowledge -> no deploy (ADR-001). Next: ISMS->TISAX. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 07:50:42 +02:00
pilotadmin	81f8b56b48	Merge pull request 'feat(transition): RS-005 v0 Transition Planning Engine (+ spec v1.3)' (#10 ) from feat/transition-reasoning-v0 into main	2026-06-27 07:37:53 +02:00
Benjamin Admin	db2efe9f52	docs(spec): Transition Reasoning v1.3 — Planning Engine / QuestionRequest / Renderer split Aligns the spec with RS-005 v0: the Transition Planning Engine owns the INFORMATION GAPS (TransitionQuestionRequest), not the questions. Chain: Planning Engine -> TransitionQuestionRequest -> Question Renderer (RS-005.1) -> Interview. RS-005.1 (renderer/templates) deliberately deferred; GeneratedQuestion reframed as the renderer's output (a swappable policy layer), not part of the engine. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 07:37:50 +02:00
Benjamin Admin	77de7e794c	feat(transition): Transition Reasoning v0 (RS-005) — Transition Planning Engine Second reasoning mode, scope per user: the engine owns the INFORMATION GAPS, not the questions. assess_transition(context, target_requirements, company_profile) emits ranked TransitionQuestionRequest {capability, control, reason, question_intent, expected_evidence, priority, information_gain} -- NOT rendered question text. Rendering (intent+subject->sentence) is a separate swappable layer (RS-005.1), not here. Consumes the Company Capability Profile (2A) as "have" + injected TargetRequirement (Execution-owned placeholder) as "required" -- no required-capability data in product code (EMPTY_REQUIREMENTS, mocks only in tests). A certification-derived capability is probably_covered (Welt 1) -> a confirmation request, never already_covered/"erfuellt". Deterministic, computed-not-stored, no percentages. Activates 2A/2C/RCI (first consumer of the Company profile). Freeze-respecting: additive package, no new graph/base class/meta-model class. 9 tests, mypy --strict clean, LOC ok. No endpoint/UI/RAG; question rendering deliberately deferred to RS-005.1. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 07:31:11 +02:00
pilotadmin	5e735e9e56	Merge pull request 'docs(spec): Transition Reasoning v1.2 (generate-from-controls + AI-drafted curated library)' (#9 ) from feat/transition-reasoning-v11 into main	2026-06-27 07:12:35 +02:00
Benjamin Admin	24fdde89c6	docs(spec): Transition Reasoning v1.2 — questions generated from controls + AI-drafted curated library v1.1: interview questions are GENERATED from the existing (Master) Controls, not hand-written. Three building blocks: Control->question_intent (corpus/Execution), ~30-40 Master Question Templates (Reasoning), Transition-Prioritization (certs decide which generated questions can be skipped; 217->19 funnel, reuses Company 2A + cert map). v1.2: knowledge production. LLMs produce the first expert DRAFT (the prioritization per transition); BreakPilot reviews + versions + OWNS the canonical library (in Git, not the AI; model-independent, MDQ-00127 v4). Offline multi-model workflow, NOT runtime (deterministic-first: LLM offline-propose, never online-mutate). Hard boundary: the library is an expert DRAFT, not a normative/legal proof -- "cert probably covers X" is Welt-1 (ClaimCoverage), never "erfuellt" (anti-fake-evidence). Reframes the 100 seed questions as validation/template-extraction set. Spec only, no code; non-runtime docs -> no deploy (ADR-001). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 07:11:53 +02:00
pilotadmin	f3d3255de1	Merge pull request 'docs(spec): Transition Reasoning v1 + MDQ Registry + ADR-002' (#8 ) from feat/transition-reasoning-spec into main	2026-06-27 07:03:43 +02:00
Benjamin Admin	fe21c2f487	docs(spec): Transition Reasoning spec v1 + MDQ Registry + ADR-002 Second reasoning mode (extends, does not replace): BreakPilot answers MIGRATION questions (start state -> target state -> delta), not regulation Q&A. New package compliance/transition_reasoning/ (spec only). Transition Reasoning is RCI generalized; reuses Company 2A (have), Master Capability Registry (MCAP) and RCI. MDQ Registry = 4th identity-machine instance (after Master Controls/Obligations/ Capabilities): every Master Delta Question is a versioned, identifiable knowledge unit (verifies MCAP, supports obligations, transition patterns, evidence types, information gain, confidence impact, follow-up). Transition Patterns hold only MDQ references -> reuse across transitions. Delta interview = information-gain optimization, not a sequential questionnaire. ADR-002: transitions are DATA (patterns + capability/MDQ knowledge), never engine or metamodel extensions. 100 seed questions captured as v1. Spec only (no code; freeze-respecting: additive package, no new graph/base class/ meta-model class). Non-runtime docs -> no deploy (ADR-001). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 07:03:42 +02:00
pilotadmin	e4695cf289	Merge pull request 'docs(adr): ADR-001 Runtime Deploy Policy' (#7 ) from feat/adr-runtime-deploy-policy into main	2026-06-27 06:51:31 +02:00
Benjamin Admin	d72dcbacfb	docs(adr): ADR-001 Runtime Deploy Policy A dev deploy must always have a verifiable runtime effect. Deploy only on runtime/API/data-model/reasoning/security changes; docs, reference suites, ADRs, board and ownership texts are merged to origin/main but NOT pushed to dev (no Orca build). Keeps the CI/CD history meaningful: every build == a runtime change. Architecture/release decision (not a developer convention) -> own folder docs-src/architecture/adr/. Non-runtime: this commit triggers no deploy, per its own policy. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-27 06:51:00 +02:00
pilotadmin	8a51db92ed	Merge pull request 'feat: reference scenario suite v1' (#6 ) from feat/reference-scenario-suite into main	2026-06-26 23:07:50 +02:00
Benjamin Admin	16371f2909	feat(reference): Reference Scenario Suite v1 (living regression reference, not docs) Three real customer scenarios driven through the DEPLOYED engines (scope/map/ interpretation, RCI, company 2A, capability registry). Each scenario emits an Architecture Coverage table DERIVED from the real run, so cells flip automatically as domains land (e.g. Sz2/Environmental UNSUPPORTED -> PASS). The roll-up answers "is BreakPilot better than six months ago" by real customer situations, not LOC. Gaps captured as epics (NOT implemented): RS-001 Interpretation Pattern Library, RS-002 Environmental Corpus, RS-003 Capability Linking (cap<->MCAP) + Company-Gap, RS-004 MaschinenVO/EMV Registry Linking. reference_scenarios/generate.py = reproducible source (ruff/mypy-exempt, NOT product code, not imported by the app); reference_scenario_suite_v1.md = generated artifact. No new product code; CRA patterns deliberately NOT built — the suite is now the measure. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 22:48:27 +02:00
Benjamin Admin	c7339e68df	docs: Architekturprinzip — Ownership auf BEZIEHUNGEN, nicht Knoten + cap.=kanonische ID User-Reframe (die eigentliche Reife): nicht „Session X besitzt Knoten Y", sondern jede Session besitzt KANTEN. Edge-Ownership-Tabelle: Feature/Cert->Cap = S3 · Cap->Obligation/Procedure/ Control/Evidence = S2 · Citation-Span->Legal-Basis = S1. Kein Owner hält alle ein+ausgehenden Kanten eines Knotens. `cap.` = kanonische ID auf obligation_id-Niveau. Capability = EINZIGER Knoten über 3 Welten (Recht/Produkt/Nachweis) = semantischer Mittelpunkt. Künftiger Vertrag: Confidence/Disambiguierung bei mehreren Capabilities = Domaene 3, Domaene 2 vertraut geliefertem cap.X. Domaene 2 ruht stabil bis Wake-up. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 22:11:43 +02:00
Benjamin Admin	06efb9e61b	Merge origin/main (`ed64d929`) in ownership-resolution + reasoning#1	2026-06-26 21:55:02 +02:00
Benjamin Admin	aaacec087c	feat: Ownership-Konflikt #1 RESOLVED (Capability = geteilter Knoten) + Reasoning#1 Re-Link User-Entscheidung: Feature->Capability + Certificate->Capability = Session 3 (Domaene 3), NICHT Compliance. Capability = GETEILTER Knoten: eingehende Kanten (Feature/Cert->Cap) = Domaene 3 · Knoten+IDs+ausgehende Kanten (Cap->Obligation/Procedure/Control/Evidence) = Domaene 2. Expliziter Vertrag: „Domaene 2 besitzt NIE Wissen, welche Produkte/Zertifikate welche Capabilities brauchen." + Ownership-Tabelle in session_ownership_model_v1.md. Reasoning#1 (Domaene 2, Registry-Kanonisierung): obligations/proposed_obligation_canonical_map.json — 5 machine_* -> cra_machinery (re-link, Ziele validiert), 7 data_act_/cra_ pending (Regulierung nicht geschnitten). RE-LINK, kein Re-Mint. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 21:54:07 +02:00
pilotadmin	ed64d92904	Merge pull request 'feat: master capability registry foundation' (#5 ) from feat/master-capability-registry into main CI / detect-changes (push) Successful in 11s Details CI / branch-name (push) Has been skipped Details CI / guardrail-integrity (push) Has been skipped Details CI / secret-scan (push) Has been skipped Details CI / dep-audit (push) Has been skipped Details CI / sbom-scan (push) Has been skipped Details CI / build-sha-integrity (push) Successful in 7s Details CI / validate-canonical-controls (push) Successful in 6s Details CI / loc-budget (push) Successful in 17s Details CI / go-lint (push) Has been skipped Details CI / python-lint (push) Has been skipped Details CI / nodejs-lint (push) Has been skipped Details CI / nodejs-build (push) Has been skipped Details CI / test-go (push) Has been skipped Details CI / iace-gt-coverage (push) Has been skipped Details CI / test-python-backend (push) Successful in 26s Details CI / test-python-document-crawler (push) Has been skipped Details CI / test-python-dsms-gateway (push) Has been skipped Details	2026-06-26 21:50:42 +02:00
Benjamin Admin	6ccc6c87c1	feat(capability): Master Capability Registry v0 (Phase 2C, Compliance Execution domain) Third instance of the identity-machine pattern (after Master Controls and Master Obligations). New compliance/capability/ package: MasterCapability with stable MCAP ids, CapabilityCandidate minting, seven typed relation types, a VERSIONED derivation policy, and identity lifecycle (merge/split/deprecate/redirect with provenance). Stored: identities, sources, relationship types, policy versions, lifecycle events, provenance. Derived (never stored): confidence/status via evaluate_relation under a policy version. Hard rule (structurally guarded): a certification alone can never yield CONFIRMED — only CONFIRMS + concrete artifact (or expert) does. Built from the Reasoning session per user directive but this IS the Compliance Execution model (Execution owns Capability) — handed off via the board. Metadata-first: CapabilityRelation is registry metadata, NOT a new meta-model class (freeze v1.0 untouched). No Company-Gap, no real ISO/cert mappings, no UI/RAG, no generic canonicalization engine. 11 tests; mypy --strict clean; LOC ok. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 21:35:12 +02:00
pilotadmin	7eb7f61483	Merge pull request 'feat: company capability profile foundation' (#4 ) from feat/company-intelligence-2a into main CI / detect-changes (push) Successful in 14s Details CI / branch-name (push) Has been skipped Details CI / guardrail-integrity (push) Has been skipped Details CI / secret-scan (push) Has been skipped Details CI / dep-audit (push) Has been skipped Details CI / sbom-scan (push) Has been skipped Details CI / build-sha-integrity (push) Successful in 10s Details CI / validate-canonical-controls (push) Successful in 5s Details CI / loc-budget (push) Successful in 20s Details CI / go-lint (push) Has been skipped Details CI / python-lint (push) Has been skipped Details CI / nodejs-lint (push) Has been skipped Details CI / nodejs-build (push) Has been skipped Details CI / test-go (push) Has been skipped Details CI / iace-gt-coverage (push) Has been skipped Details CI / test-python-backend (push) Successful in 23s Details CI / test-python-document-crawler (push) Has been skipped Details CI / test-python-dsms-gateway (push) Has been skipped Details	2026-06-26 15:13:21 +02:00
Benjamin Admin	8c893ca783	feat(company): Company Intelligence 2A — Company Capability Profile foundation HEAD of the spine Company->Capability->Product->Regulation->Obligation->Procedure ->Evidence. New compliance/company/ package: CompanyContext container + a four-state trust model (declared/inferred/confirmed/unknown). Hard rule (structural): a certification yields at most an INFERRED candidate and is never auto-treated as CONFIRMED/"erfuellt". A certification produces evidence-of- capability; only real ExistingEvidence promotes a capability to CONFIRMED. Ownership: Reasoning owns the container + trust-state; the Certification->Capability mapping is Execution's domain, consumed via an injected contract. No mapping data in product code (tests inject mocks). No endpoint/UI/RAG/new regs/controls; no meta-model classes (freeze v1.0 untouched). 8 tests; mypy --strict clean. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 14:59:42 +02:00
pilotadmin	d1383227b2	Merge pull request 'feat: regulatory change intelligence foundation' (#3 ) from feat/regulatory-change-intelligence into main CI / detect-changes (push) Successful in 12s Details CI / branch-name (push) Has been skipped Details CI / guardrail-integrity (push) Has been skipped Details CI / secret-scan (push) Has been skipped Details CI / dep-audit (push) Has been skipped Details CI / sbom-scan (push) Has been skipped Details CI / build-sha-integrity (push) Successful in 8s Details CI / validate-canonical-controls (push) Successful in 5s Details CI / loc-budget (push) Successful in 21s Details CI / go-lint (push) Has been skipped Details CI / python-lint (push) Has been skipped Details CI / nodejs-lint (push) Has been skipped Details CI / nodejs-build (push) Has been skipped Details CI / test-go (push) Has been skipped Details CI / iace-gt-coverage (push) Has been skipped Details CI / test-python-backend (push) Successful in 24s Details CI / test-python-document-crawler (push) Has been skipped Details CI / test-python-dsms-gateway (push) Has been skipped Details	2026-06-26 14:01:48 +02:00
Benjamin Admin	a5687bbc65	feat(rci): Regulatory Change Intelligence foundation (delta over the stored map) RCI/Delta as a read-/reasoning layer ON TOP of the product-first pipeline. Answers "what changes relative to my existing Regulatory Map?" — NOT "what does the new law say in general". No UI, no ingestion (newsletter/mailbox), no RAG, no new regulations/controls, no legal evaluation outside the stored map. - 4 core objects (compliance/rci/schemas.py): ComplianceBaseline (snapshot of profile + map + registry obligations + required/present evidence), RegulatoryChange (simulated/provided INPUT), ObligationDelta (delta_type NEW\|CHANGED\|REMOVED\| ALREADY_COVERED\|NEEDS_REVIEW\|NOT_APPLICABLE), ChangeImpactSummary. delta_type is a THIRD vocabulary, disjoint from ClaimCoverage (Welt 1) and ComplianceStatus (Welt 2). - create_baseline() snapshots the existing pipeline once; assess_change() computes deltas deterministically against the snapshot (no re-evaluation). - 12 tests = the 5 acceptance questions (affects product? new/changed? already covered by evidence? needs human review? not relevant?) + repeal/uncertain-reg/ missing-evidence/boundary. Existing pipeline tests stay green; mypy clean; LOC ok. - App/reasoning types only — no compliance-meta-model classes (freeze v1.0 untouched). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 13:45:23 +02:00
pilotadmin	da466b3821	Merge pull request 'feat(ai-sdk): IACE hazard-engine quality + offline proposer (Session 4)' (#2 ) from feat/iace-gt-warewashing into main CI / detect-changes (push) Successful in 10s Details CI / branch-name (push) Has been skipped Details CI / guardrail-integrity (push) Has been skipped Details CI / secret-scan (push) Has been skipped Details CI / dep-audit (push) Has been skipped Details CI / sbom-scan (push) Has been skipped Details CI / build-sha-integrity (push) Successful in 7s Details CI / validate-canonical-controls (push) Successful in 8s Details CI / loc-budget (push) Successful in 21s Details CI / go-lint (push) Has been skipped Details CI / python-lint (push) Has been skipped Details CI / nodejs-lint (push) Has been skipped Details CI / nodejs-build (push) Has been skipped Details CI / test-go (push) Successful in 1m1s Details CI / iace-gt-coverage (push) Successful in 19s Details CI / test-python-backend (push) Successful in 24s Details CI / test-python-document-crawler (push) Has been skipped Details CI / test-python-dsms-gateway (push) Has been skipped Details	2026-06-26 11:48:09 +02:00
pilotadmin	eca8ec43c5	Merge pull request 'feat(reasoning): product-first regulatory pipeline — Profile → Navigator → Scope → Map → Interpretation' (#1 ) from feat/regulatory-reasoning-engine into main	2026-06-26 11:47:18 +02:00
Benjamin Admin	37c9b8e773	docs: Domaene-2 Wake-up-Trigger + erster Folgeauftrag Feature Coverage Report User-Praezisierung: Domaene 2 ruht NICHT unbestimmt. Wake-up-Trigger (EINER reicht): Feature Graph>=200 Features · Span-Anker verfuegbar · neue Regulierung ingestiert · Runtime kennt neue Evidence-Typen. Erster Folgeauftrag (gated auf Feature Library v1): FEATURE COVERAGE REPORT = Wissenslueckenanalyse pro Feature (Feature->cap.*->Obligation-> Procedure->Evidence -> Coverage %; zeigt fehlende Capability/Procedure/Evidence je Feature). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 11:24:06 +02:00
Benjamin Admin	50ae9e94d1	feat(interpretation-in-map): judge a customer interpretation within the map (step 5) Thin adapter — it judges the customer's reading WITHIN the already-built RegulatoryMap, it does not assess abstract legal questions and it is not RCI. - Reuses the existing assess_interpretation (no new legal reasoning); the 6 verdicts (plausible/too_narrow/too_broad/partially_correct/unsupported/uncertain) pass through unchanged. - Restricts affected_regulations/affected_obligations to those present in the map (intersection); links to the map's uncertain regulations. - Touched unsupported domains (wastewater/chemicals/...) are reported as future_corpus_domains (future_corpus_needed) — never pseudo-evaluated. - Customer-readable explanation ("Ihre Interpretation ist wahrscheinlich zu eng. … Betroffen in Ihrer Map: CRA."). - POST /reasoning/interpretation-in-map (renders the map, then interprets). - 7 tests; 63 green (existing reasoning MVP stays green), mypy clean, LOC ok. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 10:58:00 +02:00
Benjamin Admin	429ac957c1	docs: Feature Knowledge Graph + Sequenz (Domaene 3 Rename + Feature Library; Domaene 2 STOPP #59 ) User-Entscheidung: Domaene 3 = „Feature Knowledge Graph" (Kunden kaufen Features, nicht Capabilities — Advisor beginnt bei „Fernwartung", nicht „cap.transport_encryption"). Besitzt zusaetzlich Feature Library (~200-400 Features) != Product Profile. Volle Pipeline Feature Library -> Product Profile -> Capabilities -> Obligations -> Procedures -> Controls -> Evidence. SEQUENZ: (1) cap.-Vertrag JETZT an Domaene 3 uebergeben (Multiplikator); (2) Domaene 3 Vollgas (Feature->cap.); (3) Domaene 2 STOPP bei #59 (Capability Registry STABIL, nur Bugfixes, bis Domaene 3 den realen Bedarf zeigt); (4) Domaene 1 Re-Ingest/Spans/Citation. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 10:45:46 +02:00
Benjamin Admin	9312ad18ef	feat(regulatory-map): customer-readable read-model over the scope (step 4) The Map Renderer explains the engine's state, it does not extend it. Pure composition of resolve_product_scope (scope verdict) + derive_obligations (registry-linked obligations + overlaps) into one RegulatoryMap. - product_summary, trigger_facts, applicable/uncertain/excluded regulations, unsupported_domains, overlaps (shared_obligations), shared_evidence, and a customer-readable executive_summary. - No own legal decisions: applicable/uncertain mirror the scope verdict exactly. - Obligations shown ONLY when registry-linkable (registry_anchor) — MaschinenVO/ EMV obligations are proposed, so they render empty + a note, never as linked. Overlaps/shared_evidence likewise filtered to registry-linked members. - Uncertain regulations link to the navigator question that would resolve them (RED -> has_radio_module, DataAct -> generates_usage_data). - Environmental appears only as unsupported_domain; executive_summary has NO percentage (counts + "no further regulations identified" instead). - POST /reasoning/regulatory-map (thin handler). Response types are presentation- level, not meta-model classes (freeze v1.0 untouched). - 9 tests; 56 green (existing reasoning MVP stays green), mypy clean, LOC ok. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 10:36:06 +02:00
Benjamin Admin	2063615d37	feat: Capability Registry v1 API-Vertrag (#59 ) + Ownership-Modell finalisiert #59 (geschaerft, User): capabilities.json -> capability_registry_v1 (contract_version 1.0): stabile `cap.*`-IDs (NIE umbenennen) + 5 Vertragsfelder (description/guidance_basis/ realizes_obligations/required_procedures/evidence_patterns), PRODUKTNEUTRAL (keine Features). = stabiler API-Vertrag fuer die Product->Compliance-Schnittstelle (Feature->Capability, Session 3 mappt read-only dagegen). session_ownership_model_v1.md RESOLVED: Legal-Owner = Re-Ingest-Session (vergibt KEINE obligation_id, nur citation_span->legal_basis) · 4. Session -> Quality & Validation (nur Tests, KEINE Daten) · Compliance 2 Branches DAUERHAFT (A=Build, B=Runtime). 4-Bibliotheken- Zielbild (Legal/Product/Capability/Evidence). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 10:35:49 +02:00
Benjamin Admin	4d225f73a8	feat(ai-sdk): coverage blind-spot proposer (P2 slice 6, type 4) Completes the proposer's four types. - FindCoverageGaps (proposer_coverage.go): deterministic — which EN ISO 12100 hazard groups A-G did the engine leave with zero hazards for this machine? An empty group is a structural blind-spot signal (the machine may truly lack it, or a pattern/GT case is missing). Useful with no model at all. - ProposeMissingHazards + BuildCoveragePrompt: optional LLM expansion of each gap into specific expected-but-missing hazards a safety assessor would name (propose-only, reuses LLMCompleter, degrades to nil on any error). - Wired into iace-audit propose -> audit-reports/coverage.{md,json}. On the dishwasher: D. Pneumatik (truly absent — nothing invented), E. Laerm (borderline), F. Ergonomie (a genuine gap: manual loading the engine did not produce). P3 (pin an accepted proposal into a GT case) remains as a human-in-the- loop follow-up. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 10:27:01 +02:00
Benjamin Admin	c13aa9183a	feat(ai-sdk): vocab->tag proposer (P2 slice 5, type 3) Extends Method C: for each unknown narrative token that pattern text names, suggest the keyword_dictionary tag = the RequiredComponentTags shared by the naming patterns (ranked by frequency, kept only when shared by >=40% of them, top 3). Surfaces real dictionary gaps like "zwischenkreis" -> stored_energy and "updates" -> has_software, which close coverage without hand-editing the dict. Two precision fixes to Method C while here: - patternsMentioning now matches WHOLE WORDS, not substrings — substring matching flagged fragments like "stehen" inside "entstehen" and produced nonsensical tag suggestions. - a token is only proposed with a tag if one is shared by >=40% of its naming patterns, so diffuse common verbs (spread across categories) drop out. Wired into iace-audit propose -> audit-reports/vocab.{md,json}. Residual common-verb noise is left to the human/LLM filter rather than a hand-grown stopword list. Type 4 (coverage blind spots) + P3 (pin accepted proposals into a GT case) remain for slice 6. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 10:27:01 +02:00
Benjamin Admin	662aec209a	feat(ai-sdk): foreign-framing proposer (P2 slice 4, type 2) Surfaces fired patterns whose zone names terms the machine's narrative never mentions — foreign framing that leaks through terms not yet in domainGateTerms (once a term is a gate term, the ghost-pattern invariant already fences it out). - FindFramingCandidates (proposer_framing.go): per fired pattern, zone terms with no narrative echo (minus a generic hazard-location stoplist). Echo matching is bidirectional to survive German compounding (narrative "Steuerung" echoes zone "Steuerungssystem"). Heuristic verdict foreign (fully orphan) / plausible (partial). Over-surfaces by design — human/LLM is the precision filter. - Wired into iace-audit propose -> audit-reports/framing.{md,json}, threshold via IACE_FRAMING_MIN_ORPHAN (default 0.6). Honest finding: genuine wrong-MACHINE framing (Walzen, Transportbaender) no longer fires thanks to the machine-type gate; the residual is mostly cyber/control patterns with generic-industrial zone vocabulary, candidates for re-framing. Proposal types 3-4 (vocab->tag, coverage blind spots) remain for slice 5. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 10:27:01 +02:00
Benjamin Admin	8440ddfecb	feat(ai-sdk): runnable iace-audit propose CLI + live LLM wiring (P2 slice 3) Makes the offline proposer runnable end-to-end. - BuildProposerInput (proposer_input.go): non-test engine->hazards path. The PatternMatch->Hazard converter is lifted out of the GT test files into production scope so both the tests and the CLI share one pipeline. - iace-audit propose <narrative.json> [<ground-truth.json>]: detect candidates -> GT-screen survivors (when a ground truth is given) -> judge (HeuristicJudge by default, LLMJudge over ollama when IACE_PROPOSE_LLM=1) -> write the human-review queue to audit-reports/proposals.{md,json}. Propose-only. Smoke run on a dishwasher narrative: 32 fired -> 3 candidates -> queue with a confident duplicate, a confident distinct, and one punted to the LLM judge; GT wall recall-safe. Live qwen is opt-in via env; the heuristic default keeps the tool runnable (and CI deterministic) without a model. Proposal types 2-4 (foreign-framing gates, vocab->tag, coverage blind spots) remain for slice 4. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 10:27:01 +02:00
Benjamin Admin	0ce4794767	feat(ai-sdk): pluggable LLM judgment over recall-safe dedup candidates (P2 slice 2) Adds the semantic judgement layer on top of the slice-1 detector + GT wall. DEV-TIME, propose-only — nothing mutates the library or runtime. - CandidateJudge interface with two implementations: HeuristicJudge (deterministic default/fallback, used in tests) and LLMJudge (offline, over the shared llm.ProviderRegistry via the LLMCompleter adapter). LLMJudge degrades to "uncertain" on any transport/parse error — it can never break a run. - BuildJudgePrompt: the ISO 12100 same-vs-distinct prompt, unit-tested deterministically even though the call is not. - RenderProposalQueue: markdown human-review queue with a suggested action per candidate (supersede / keep both / needs review). On real warewashing output the heuristic punts to "uncertain — needs the LLM judge" for exactly the two recall-safe near-dupes (HP807/HP033 update, HP101/HP096 winding-vs-friction), making the LLM's role explicit. All 3 GTs unaffected (read-only). Live qwen wiring + a CLI/file queue are slice 3. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-26 10:27:01 +02:00

1 2 3 4 5 ...

1609 Commits