686834cea0
Build + Deploy / build-ai-sdk (push) Failing after 36s
Build + Deploy / build-developer-portal (push) Successful in 8s
Build + Deploy / build-tts (push) Successful in 7s
Build + Deploy / build-document-crawler (push) Successful in 7s
Build + Deploy / build-admin-compliance (push) Successful in 8s
Build + Deploy / build-backend-compliance (push) Successful in 8s
CI / nodejs-build (push) Successful in 3m14s
CI / dep-audit (push) Has been skipped
CI / sbom-scan (push) Has been skipped
CI / test-go (push) Failing after 46s
CI / test-python-backend (push) Successful in 43s
CI / test-python-document-crawler (push) Successful in 29s
CI / test-python-dsms-gateway (push) Successful in 30s
CI / validate-canonical-controls (push) Successful in 16s
Build + Deploy / build-dsms-gateway (push) Successful in 8s
Build + Deploy / build-dsms-node (push) Successful in 8s
CI / branch-name (push) Has been skipped
Build + Deploy / trigger-orca (push) Has been skipped
CI / guardrail-integrity (push) Has been skipped
CI / loc-budget (push) Failing after 17s
CI / secret-scan (push) Has been skipped
CI / go-lint (push) Has been skipped
CI / python-lint (push) Has been skipped
CI / nodejs-lint (push) Has been skipped
1. EU Institution Checks (Verordnung 2018/1725): - New doc_type "eu_institution" with 9 L1 + 15 L2 checks - Both German + English patterns (EU institutions are multilingual) - Auto-detection via "2018/1725", "EDSB", "EDPS" keywords - Correct article references (Art. 15 instead of 13, Art. 5 instead of 6) 2. Banner Check Integration: - banner_runner.py maps scan results to 36 L1/L2 structured checks - BannerCheckTab shows hierarchical ChecklistView with hints - 3-phase summary (cookies/scripts before/after consent) - /scan endpoint now includes structured_checks in response 3. JS-heavy Website Fixes (dm, Zalando, HWK): - dsi_helpers.py: goto_resilient (networkidle→domcontentloaded fallback) - try_dismiss_consent_banner before text extraction - PDF redirect detection (dm.de redirects to GCS PDF) 4. Caritas False Positive Fixes: - Phone regex allows parentheses: +49 (0)761 → now matches - "Recht auf Widerspruch" (3 words) + §23 KDG → matches Art. 21 - Church authorities: "Katholisches Datenschutzzentrum" recognized Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
35 lines
1.1 KiB
Python
35 lines
1.1 KiB
Python
"""
|
|
doc_checks — Legal document compliance checkers.
|
|
|
|
Provides checklists and functions for verifying legal documents
|
|
(DSI, AGB, Impressum, Cookie, Widerruf, Social Media, DSFA, EU Institution)
|
|
against their mandatory content requirements.
|
|
|
|
Two check levels:
|
|
L1 — "Is the mandatory field mentioned?"
|
|
L2 — "Is it correct/complete?"
|
|
"""
|
|
|
|
from .runner import check_document_completeness, classify_document_type
|
|
from .dse_checks import ART13_CHECKLIST
|
|
from .widerruf_checks import WIDERRUF_CHECKLIST
|
|
from .agb_checks import AGB_CHECKLIST
|
|
from .impressum_checks import IMPRESSUM_CHECKLIST
|
|
from .cookie_checks import COOKIE_CHECKLIST
|
|
from .social_media_checks import JOINT_CONTROLLER_CHECKLIST
|
|
from .dsfa_checks import DSFA_CHECKLIST
|
|
from .eu_institution_checks import EU_INSTITUTION_CHECKLIST
|
|
|
|
__all__ = [
|
|
"check_document_completeness",
|
|
"classify_document_type",
|
|
"ART13_CHECKLIST",
|
|
"WIDERRUF_CHECKLIST",
|
|
"AGB_CHECKLIST",
|
|
"IMPRESSUM_CHECKLIST",
|
|
"COOKIE_CHECKLIST",
|
|
"JOINT_CONTROLLER_CHECKLIST",
|
|
"DSFA_CHECKLIST",
|
|
"EU_INSTITUTION_CHECKLIST",
|
|
]
|