feat(programs): open Domain Knowledge Program v1 — 7-stage production line + per-domain KPI

The real bottleneck is domain MODELLING. Phase B is organized as one program with sub-programs per
domain, each run through the SAME 7-stage production line. No new runtime framework, no new module
(ADR-009, Freeze v1.0) — only program data + a derived reporting view.

- Customer enters by INDUSTRY, not regulation: Industry -> Domain Model -> Requirement Sources ->
  Requirements -> Capabilities -> ... -> Completeness.
- 7-stage checklist identical for every domain (Domain Model / Requirement Sources / Capability
  Registry / Transition Patterns / Playbooks / Reference Scenarios / Completeness) with per-stage
  ownership. README generalized to the framework.
- Each domain lists typical_requirement_sources + typical_certifications -> pre-onboarding capability
  HYPOTHESIS (the ETO insight; feeds Company 2A as inferred, never confirmed).
- Backlog v1 (by customer value): 1 Industrial Automation, 2 Environmental, 3 Automotive, 4 Medical,
  5 Energy. Five domain-definition shells (environmental restructured to the unified shape, law-first
  preserved).
- Per-domain KPI is DERIVED from the real corpus (computed-not-stored; sources modelled / transition
  patterns / playbooks / reference scenarios), NOT a curated number. Reference suite renders maturity
  bars: Industrial Automation 43% (3/7 sources) leads, Environmental 0% (work ahead). Backlog (value)
  and KPI (corpus state) are deliberately separated.
- ADR-009: Domain Knowledge Program framework. Honest known refinement: regulation-ID normalization
  (CRA vs Cyber Resilience Act) aliased in the KPI.

7 program-contract tests (backlog order + industry-first + derived-not-stored), check-loc 0.
Knowledge data + ADR + reference harness = non-runtime -> no deploy (ADR-001).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
This commit is contained in:
Benjamin Admin
2026-06-27 18:49:06 +02:00
parent c737e1ad7d
commit 1a9439d013
10 changed files with 312 additions and 159 deletions
+47 -40
View File
@@ -1,51 +1,58 @@
# Domain Knowledge Programsfrom architecture to domains
# Domain Knowledge Program — the production line for every domain
**The architecture is stable. From here the value comes from DOMAINS, not more software.**
The runtime architecture (scope, regulatory map, capability delta, optimization, playbooks, intake,
production, completeness) is frozen. A new regulatory domain is a **data + knowledge** project that
extends every existing view automatically — no new runtime framework (see ADR-008, Freeze v1.0).
**The architecture is stable. From here the value comes from DOMAIN MODELLING, not more software.**
The real bottleneck is no longer architecture or controls or even „knowledge" — it is **domain
modelling**. Phase B is therefore organised as ONE program with sub-programs per domain, each run
through the SAME production line. No new runtime framework (ADR-008/009, Freeze v1.0).
## The production line (reusable for EVERY domain)
## The customer enters by INDUSTRY, not by regulation
A customer never says „explain ISO 9001". They say „I build packaging machines" / „I'm an automotive
supplier" / „I build parking systems". So the pipeline starts at the industry:
```
Regulatory Corpus → Obligations → Capabilities → Transition Patterns → Playbooks → Reference Scenarios → Completeness
Industry → Domain Model → Requirement Sources → Requirements → Capabilities → … → Reality / Verification
```
Each domain delivers the SAME artifacts. Once they land, the existing engines automatically extend:
Scope · Gap · Capability Delta · Optimization · Playbooks · Reference Scenarios · Regulatory Completeness.
## The 7-stage checklist (identical for EVERY domain)
## Law-first (a deliberate ordering)
Start from the **law**, not from a management system. A management system (e.g. ISO 14001) is NOT
the domain — it is one possible source state. The customer asks *„welche Anforderungen gelten für mein
Produkt?"*, not *„wie komme ich von ISO 14001 weg?"*. So the order is:
```
Recht → Obligations → Capabilities → (Managementsystem) → Delta
```
A management-system→corpus transition pattern is built LAST, once BOTH sides are known.
## Ownership per stage (coordinate via the board, do not duplicate)
| Stage | Artifact | Owner |
| # | Stage | Owner |
|---|---|---|
| B1 Corpus + Obligations | legal sources, obligations registry | **Legal Knowledge / Obligation Registry** |
| B2 Capability Model | new capabilities in the Master Capability Registry | **Compliance Execution** |
| B3 Transition Patterns | source-state → corpus delta patterns | **Reasoning (Knowledge Acquisition)** |
| Playbooks | implementation playbooks per capability | **Reasoning** |
| Reference Scenarios | canonical regression + expected outcomes | **Reasoning** |
| Completeness | corpus-status registry per domain | **Reasoning / curation** |
| 1 | **Domain Model** (industry → what world is this?) | Reasoning / curation |
| 2 | **Requirement Sources** (which regulations/standards/specs apply) | Legal Knowledge |
| 3 | **Capability Registry** (capabilities the sources require) | Compliance Execution |
| 4 | **Transition Patterns** (source-state → domain delta) | Reasoning |
| 5 | **Playbooks** (how to implement each capability) | Reasoning |
| 6 | **Reference Scenarios** (canonical regression + expected outcomes) | Reasoning |
| 7 | **Completeness** (auditable coverage per domain) | Reasoning / curation |
## Programs (planned)
This is the scaling mechanism: every new domain reuses the same production line; the existing engines
(Scope, Gap, Capability Delta, Optimization, Playbooks, Reference, Completeness) extend automatically.
| Program | File | Status |
|---|---|---|
| Environmental Knowledge Program | `environmental.yaml` | started (B1 handed off) |
| Automotive Knowledge Program | _planned_ | — |
| OT / IEC 62443 Knowledge Program | _planned_ | — |
| Functional Safety Knowledge Program | _planned_ | — |
## A domain knows its typical sources → pre-onboarding HYPOTHESIS (the ETO insight)
Each program is a machine-readable definition (`*.yaml`) consumed by the reference suite to track its
progress; future sessions flip stage `status` as artifacts land, and the Completeness engine reports
the domain flipping `unsupported → validated` automatically.
Each domain definition lists `typical_requirement_sources` and `typical_certifications`. So before
onboarding, BreakPilot can say „this process world is *probably* present" — as a **hypothesis, not a
truth**. We don't want to know whether an automotive supplier has ISO 9001 (everyone does); we want
to know **which company capabilities are therefore probably already present** (feeds Company 2A as
`inferred`, never `confirmed`).
## Per-domain KPI — reproducible, not marketing
Progress per domain is **derived from the Regulatory Completeness Engine + the actual corpus**
(computed-not-stored): identified requirement sources · modelled capabilities · transition patterns ·
playbooks · passed reference scenarios · consciously declared corpus gaps. Rendered as a bar
(`Industrial ███████░░░ 70 %`). These are reproducible quality metrics — no curated numbers.
## Domain Knowledge Program v1 — backlog (by current customer value)
| Rank | Domain | File | Typical sources |
|---|---|---|---|
| 1 | **Industrial Automation** | `industrial_automation.yaml` | CRA · MaschinenVO · EMV · RED · Data Act · IEC 62443 · NIS2 |
| 2 | Environmental | `environmental.yaml` | Wasser · Chemikalien · Luft · Energie · Abfall · Produktverantwortung |
| 3 | Automotive | `automotive.yaml` | IATF · TISAX · UNECE R155/R156 · ASPICE · OEM-Lastenhefte |
| 4 | Medical | `medical.yaml` | MDR · IEC 62304 · ISO 14971 |
| 5 | Energy | `energy.yaml` | je nach Zielmarkt |
The work shifts decisively from software development to knowledge production; the competitive
advantage now comes from the quality and breadth of the modelled domains.