T

Benjamin Admin c39787ad96 fix(onboarding): separate observation vs requirement signals — a demanded SBOM is not a present SBOM

Semantic correction of the knowledge base BEFORE the empirical loop (#59) is built — otherwise the
Observation Store would learn from already-misclassified signals. The Silent Pass conflated two kinds of
signal into one: an OBSERVATION ("I saw an SBOM in the repo") and a REQUIREMENT ("a tender DEMANDS an
SBOM"). They were aliased to the same canonical id, so a tender clause read as "SBOM already present" and
suppressed the very question that should have been asked.

Fix — make the kind explicit and authoritative (no new architecture, data + thin wiring):
  - `kind` ∈ {observation, requirement} on ProducedSignal (producer may declare) and on the canonical
    SignalVocabularyEntry (AUTHORITATIVE — a mislabelled producer cannot collapse the two).
  - Vocabulary split: sbom_file_found → sbom_present (obs) + sbom_required (req);
    security_txt_or_cvd_policy → cvd_policy_present (obs) + psirt_required (req); add signed_updates_required.
    requirement signals are intentionally UNMAPPED in intake_signal_map (they describe a target, not state).
  - silent_intake() consumes ONLY kind==observation; requirement signals are preserved in
    `requirements_seen` (visible/auditable) but NEVER become a detected capability.
  - normalize_signals() stamps the vocabulary's kind onto every IntakeSignal; unknown ids still pass through.

This is the same Observation-vs-Requirement split the Requirements Verification Platform rests on:
observations are reality, requirements are targets, and their comparison is the delta. A tender / OEM spec /
law now produces requirement signals; scanners / repos / documents produce observation signals.

Tests: rewrote the two test_signal_producer cases that previously ASSERTED the bug (tender == repo) to pin
the correct split; regression — `requires_sbom` yields no capability + stays in requirements_seen while
`cyclonedx_found` still detects sbom_creation; endpoint-level regression that a tender requirement does not
auto-detect and the gap stays asked; vocabulary-kind-overrides-mislabelled-producer. 25 onboarding tests
pass, mypy --strict clean, demo runs, check-loc 0. Runtime effect → deploy + smoke. (Fix A; partial-vs-
detected decoupling follows as Fix B before #59.)

2026-06-28 15:52:50 +02:00

.claude

feat: wire breakpilot-compliance to Infisical for local dev

2026-06-22 21:00:58 +02:00

.gitea/workflows

ci(go-lint): golangci-lint v1.64.8 (go1.24) + new-from-merge-base (#32 )

2026-06-23 10:58:48 +00:00

.woodpecker

fix(ci): update Go to 1.24 for ai-compliance-sdk

2026-02-15 17:43:27 +01:00

admin-compliance

feat(ai-sdk): legal-corpus coverage + Phase-2 citation-graph assessment (#33 )

2026-06-24 06:37:22 +00:00

ai-compliance-sdk

feat(ai-sdk): ePrivacy/cookie topic — §25 TDDDG co-primary for cookie questions

2026-06-27 07:19:00 +02:00

backend-compliance

fix(onboarding): separate observation vs requirement signals — a demanded SBOM is not a present SBOM

2026-06-28 15:52:50 +02:00

breakpilot-compliance-sdk

docs: update service READMEs for refactor progress and stale phase references

2026-04-19 16:07:23 +02:00

compliance-tts-service

feat(p83): wire BUILD_SHA through all Dockerfiles + compose + CI check

2026-05-22 18:29:03 +02:00

consent-sdk

refactor(consent-sdk,dsms-gateway): split ConsentManager, types, and main.py

2026-04-18 08:42:32 +02:00

consent-tester

fix(consent-history): banner_provider als Fallback fürs CMP (#62 )

2026-06-13 17:03:44 +02:00

design/redesign

feat(redesign): Design-Tokens + Ebene-2 "Cyber trifft Safety" (additiv)

2026-06-18 16:49:04 +02:00

developer-portal

feat(p83): wire BUILD_SHA through all Dockerfiles + compose + CI check

2026-05-22 18:29:03 +02:00

docs-site

feat(p83): wire BUILD_SHA through all Dockerfiles + compose + CI check

2026-05-22 18:29:03 +02:00

docs-src

feat: Smart Onboarding Advisor — make the knowledge usable in onboarding (ADR-012)

2026-06-28 12:45:49 +02:00

document-crawler

feat(p83): wire BUILD_SHA through all Dockerfiles + compose + CI check

2026-05-22 18:29:03 +02:00

dsms-gateway

feat(iace): DSMS-CID-Badge im Tech-File-Export + aggregierter Bulk-Diff

2026-06-09 09:07:20 +02:00

dsms-node

feat(p83): wire BUILD_SHA through all Dockerfiles + compose + CI check

2026-05-22 18:29:03 +02:00

obligations

feat(rs-004): PROPOSED MaschinenVO obligation->capability linking (safety-expert input)

2026-06-28 09:08:54 +02:00

scripts

feat: Capability Registry v1 API-Vertrag (#59 ) + Ownership-Modell finalisiert

2026-06-26 10:35:49 +02:00

zeroclaw

feat(platform): live-wire AGB v2 + DSE v3 + Architektur-Tab (#29 )

2026-06-21 12:58:26 +00:00

.env.example

feat(infra): Qdrant + MinIO auf externe Hetzner-Services migrieren

2026-03-06 14:33:04 +01:00

.env.orca.example

chore: replace all Coolify references with Orca

2026-04-19 16:33:56 +02:00

.gitignore

docs: add root README, CONTRIBUTING, onboarding section, gitignore fixes

2026-04-19 16:09:28 +02:00

.gitleaks.toml

feat(platform): live-wire AGB v2 + DSE v3 + Architektur-Tab (#29 )

2026-06-21 12:58:26 +00:00

.infisical.json

feat: wire breakpilot-compliance to Infisical for local dev

2026-06-22 21:00:58 +02:00

AGENTS.go.md

fix: resolve CI failures in Python tests and admin-compliance build

2026-04-19 16:41:39 +02:00

AGENTS.python.md

fix: resolve CI failures in Python tests and admin-compliance build

2026-04-19 16:41:39 +02:00

AGENTS.typescript.md

docs(agents): require build + lint + test locally before pushing [guardrail-change]

2026-04-19 16:38:21 +02:00

CONTRIBUTING.md

chore: replace all Coolify references with Orca

2026-04-19 16:33:56 +02:00

docker-compose.hetzner.yml

feat(mcp): HTTP+Bearer CRA-MCP-Server für den Repo-Scanner + Finding-Adapter

2026-06-15 18:30:47 +02:00

docker-compose.orca.yml

chore: replace all Coolify references with Orca

2026-04-19 16:33:56 +02:00

docker-compose.yml

feat(cra): Pull-Flow — Findings vom Scanner-MCP ziehen + assessen

2026-06-15 19:05:44 +02:00

dse_criteria_backup.json

feat(platform): live-wire AGB v2 + DSE v3 + Architektur-Tab (#29 )

2026-06-21 12:58:26 +00:00

dse_criteria_changelog.json

feat(platform): live-wire AGB v2 + DSE v3 + Architektur-Tab (#29 )

2026-06-21 12:58:26 +00:00

INFISICAL_SETUP.md

feat: wire breakpilot-compliance to Infisical for local dev

2026-06-22 21:00:58 +02:00

Makefile

feat: wire breakpilot-compliance to Infisical for local dev

2026-06-22 21:00:58 +02:00

mkdocs.yml

docs(architecture): RAG retrieval engine architecture set (01-09)

2026-06-25 09:25:22 +02:00

README.md

feat: wire breakpilot-compliance to Infisical for local dev

2026-06-22 21:00:58 +02:00

REFACTOR_PLAYBOOK.md

docs: add root README, CONTRIBUTING, onboarding section, gitignore fixes

2026-04-19 16:09:28 +02:00

README.md

breakpilot-compliance

DSGVO/AI-Act compliance platform — 10 services, Go · Python · TypeScript

Overview

breakpilot-compliance is a multi-tenant DSGVO/EU AI Act compliance platform that provides an SDK for consent management, data subject requests (DSR), audit logging, iACE impact assessments, and document archival. It ships as 10 containerised services covering an admin dashboard, a developer portal, a Python/FastAPI backend, a Go AI compliance engine, TTS, and a decentralised document store on IPFS. Every service is deployed automatically via Gitea Actions → Orca on every push to main.

Architecture

Service	Tech	Port	Container
admin-compliance	Next.js 15	3007	bp-compliance-admin
backend-compliance	Python / FastAPI 0.123	8002	bp-compliance-backend
ai-compliance-sdk	Go 1.24 / Gin	8093	bp-compliance-ai-sdk
developer-portal	Next.js 15	3006	bp-compliance-developer-portal
breakpilot-compliance-sdk	TypeScript SDK (React/Vue/Angular/vanilla)	—	—
consent-sdk	JS/TS Consent SDK	—	—
compliance-tts-service	Python / Piper TTS	8095	bp-compliance-tts
document-crawler	Python / FastAPI	8098	bp-compliance-document-crawler
dsms-gateway	Python / FastAPI / IPFS	8082	bp-compliance-dsms-gateway
dsms-node	IPFS Kubo v0.24.0	—	bp-compliance-dsms-node

All containers share the external breakpilot-network Docker network and depend on breakpilot-core (Valkey, Vault, RAG service, Nginx reverse proxy).

Quick Start

Prerequisites: Docker, Go 1.24+, Python 3.12+, Node.js 20+, Infisical CLI

git clone ssh://git@gitea.meghsakha.com:22222/Benjamin_Boenisch/breakpilot-compliance.git
cd breakpilot-compliance

# One-time per machine: log in to the self-hosted Infisical instance
infisical login --domain https://secrets.meghsakha.com

# Start the full stack with secrets injected from Infisical (env=dev)
make dev

Secrets are pulled from Infisical (secrets.meghsakha.com) at runtime; .env files are not used. See INFISICAL_SETUP.md for full onboarding, and make help for the rest of the targets (dev-build, dev-down, secrets, secrets-set).

For the Orca/Hetzner production target (x86_64), use the override:

make dev ENV=prod  # or:
infisical run --env=prod -- docker compose -f docker-compose.yml -f docker-compose.hetzner.yml up -d

Development Workflow

Use feature branches off main. Supported prefixes: feat/, feature/, hotfix/.

git checkout main && git pull origin main
git checkout -b feat/my-change
# ... make changes ...
git push origin feat/my-change
# Open a PR → squash merge to main

Push to main triggers:

Gitea Actions — lint → test → validate (see CI Pipeline below)
Orca — automatic build + deploy (~3 min total)

Monitor status: https://gitea.meghsakha.com/Benjamin_Boenisch/breakpilot-compliance/actions

CI Pipeline

Defined in .gitea/workflows/ci.yaml.

Job	What it checks
`loc-budget`	All source files ≤ 500 LOC; soft target 300
`guardrail-integrity`	Commits touching guardrail files carry `[guardrail-change]`
`go-lint`	`golangci-lint` on `ai-compliance-sdk/`
`python-lint`	`ruff` + `mypy` on Python services
`nodejs-lint`	`tsc --noEmit` + ESLint on Next.js services
`test-go-ai-compliance`	`go test ./...` in `ai-compliance-sdk/`
`test-python-backend-compliance`	`pytest` in `backend-compliance/`
`test-python-document-crawler`	`pytest` in `document-crawler/`
`test-python-dsms-gateway`	`pytest test_main.py` in `dsms-gateway/`
`sbom-scan`	License + vulnerability scan via `syft` + `grype`
`validate-canonical-controls`	OpenAPI contract baseline diff

File Budget

Limit	Value	How to check
Soft target	300 LOC	`bash scripts/check-loc.sh`
Hard cap	500 LOC	Same; also enforced by `PreToolUse` hook + git pre-commit + CI
Exceptions	`.claude/rules/loc-exceptions.txt`	Require written rationale + `[guardrail-change]` commit marker

The .claude/settings.json PreToolUse hook blocks Claude Code from writing or editing files that would exceed the hard cap. The git pre-commit hook re-checks. CI is the final gate.

Links

	URL
Admin dashboard	https://admin-dev.breakpilot.ai
Developer portal	https://developers-dev.breakpilot.ai
Backend API	https://api-dev.breakpilot.ai
AI SDK API	https://sdk-dev.breakpilot.ai
Gitea repo	https://gitea.meghsakha.com/Benjamin_Boenisch/breakpilot-compliance
Gitea Actions	https://gitea.meghsakha.com/Benjamin_Boenisch/breakpilot-compliance/actions

Languages

TypeScript 39.8%

Python 35.3%

Go 22.4%

Shell 1.1%

PLpgSQL 0.7%

Other 0.4%