Files

T

Benjamin Admin 98d616d82b feat: Observation Model — the empirical learning unit, defined BEFORE persistence (Task 59a)

The learning point is not the hypothesis, it is the QUESTION — and confirmed/refuted is too coarse.
"partial, only critical suppliers" or "certified but not lived" are not "wrong", they are valuable
knowledge. So the chain is Hypothesis -> Question -> Observation -> (Review) -> Hypothesis, and the
observation model must be defined cleanly before any store/API (else thousands of too-coarse
observations get migrated later).

compliance/onboarding/observations.py:
  - ObservationType: confirmed / partial / refuted / not_applicable / unknown (richer than binary).
  - Observation: {hypothesis_id, capability, question, answer (free text), observation_type,
    scope_note ("only critical suppliers"), evidence_uploaded, reviewed, reviewed_by}.
  - empirical_distribution() -> a DISTRIBUTION (confirmed 61 / partial 31 / refuted 8), not one %.
  - empirical_confidence() -> (confirmed + 0.5*partial) / (confirmed+partial+refuted); n.a./unknown
    excluded; None until calibrated.
  - REVIEW GATE: only reviewed observations calibrate — a raw answer never changes a hypothesis (no
    learning from outliers).

Refactor: the hypothesis is now PURE curated knowledge — the binary observations counter and any
confidence are removed from CapabilityHypothesis and the YAML; confidence is COMPUTED from the separate
reviewed observation stream. Pure, mypy --strict clean. Persistence/aggregation/calibration are 59b/c/d.
Non-runtime -> no deploy. 12 tests pass, check-loc 0.

2026-06-28 13:31:43 +02:00

auth

…

classroom_engine

…

compliance

feat: Observation Model — the empirical learning unit, defined BEFORE persistence (Task 59a)

2026-06-28 13:31:43 +02:00

knowledge

feat: Observation Model — the empirical learning unit, defined BEFORE persistence (Task 59a)

2026-06-28 13:31:43 +02:00

middleware

…

migrations

fix(db): dedupe doc_check_controls 3x + unique constraint

2026-06-20 14:25:03 +02:00

reference_scenarios

feat: Certification Capability Hypotheses — capability-centric library + empirical confidence

2026-06-28 13:16:45 +02:00

scripts

chore(cra): align CRA module to the dev/demo tenant + demo-customer seed script

2026-06-14 15:52:49 +02:00

services

…

templates/gdpr

…

tests

feat: Observation Model — the empirical learning unit, defined BEFORE persistence (Task 59a)

2026-06-28 13:31:43 +02:00

consent_admin_api.py

…

consent_api.py

…

consent_client.py

…

database.py

…

Dockerfile

feat(audit): Screenshot+Tesseract-OCR Cookie-Extract als Vendor-Quelle C

2026-05-22 23:22:35 +02:00

gdpr_api.py

…

gdpr_export_service.py

…

main.py

feat(sdk): Kunden-Dokumente + CRA-Meldewesen, Screening aus Frontend genommen

2026-06-17 21:21:28 +02:00

migration_runner.py

…

mypy.ini

…

PHASE1_RUNBOOK.md

…

README.md

…

requirements-reranker.txt

…

requirements.txt

feat(cra): standalone CRA finding->Annex I risk mapper + MCP interface

2026-06-13 20:22:34 +02:00

README.md

backend-compliance

Python/FastAPI service implementing the DSGVO compliance API: DSR, DSFA, consent, controls, risks, evidence, audit, vendor management, ISMS, change requests, document generation.

Port: 8002 (container: bp-compliance-backend) Stack: Python 3.12, FastAPI, SQLAlchemy 2.x, Alembic, Keycloak auth.

Architecture

compliance/
├── api/            # Routers (thin, ≤30 LOC per handler)
├── services/       # Business logic
├── repositories/   # DB access
├── domain/         # Value objects, domain errors
├── schemas/        # Pydantic models, split per domain
└── db/models/      # SQLAlchemy ORM, one module per aggregate

The service follows this layered target structure but not all files are fully refactored yet. Phase 1 backlog is tracked in .claude/rules/loc-exceptions.txt (27 backend-compliance files currently excepted).

See ../AGENTS.python.md for the full convention and ../.claude/rules/architecture.md for the non-negotiable rules.

Run locally

cd backend-compliance
pip install -r requirements.txt
export COMPLIANCE_DATABASE_URL=...  # Postgres (Hetzner or local)
uvicorn main:app --reload --port 8002

Tests

pytest compliance/tests/ -v
pytest --cov=compliance --cov-report=term-missing

Layout: tests/unit/, tests/integration/, tests/contracts/. Contract tests diff /openapi.json against tests/contracts/openapi.baseline.json.

Public API surface

404+ endpoints across /api/v1/*. Grouped by domain: ai, audit, consent, dsfa, dsr, gdpr, vendor, evidence, change-requests, generation, projects, company-profile, isms. Every path is a contract — see the "Public endpoints" rule in the root CLAUDE.md.

Environment

Var	Purpose
`COMPLIANCE_DATABASE_URL`	Postgres DSN, `sslmode=require`
`KEYCLOAK_*`	Auth verification
`QDRANT_URL`, `QDRANT_API_KEY`	Vector search
`CORE_VALKEY_URL`	Session cache

Don't touch

Database schema, __tablename__, column names, existing migrations under migrations/. See root CLAUDE.md rule 3.