breakpilot-compliance

Author	SHA1	Message	Date
Benjamin Admin	fa45b5793c	feat: Control Library check via SQL (canonical_controls) instead of Qdrant Complete rewrite of rag_document_checker.py: - Queries canonical_controls table (294K controls, 10K data_protection) - Filters by category + title keywords per document type - Uses test_procedure field as actual check instructions - Regex pre-check extracts key terms from procedure → fast match - LLM fallback only for regex misses (saves tokens) - /no_think prefix for direct JSON output SQL approach advantages: - Structured data with test_procedure, pass_criteria, fail_criteria - Category filtering (data_protection, compliance, governance) - No Qdrant API key issues - Controls are actual check criteria, not general legal texts Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-05-06 20:26:56 +02:00
Benjamin Admin	6da36d87c2	fix: Robust JSON parsing for LLM responses — handles unquoted keys, fallback extraction LLM returns {fulfilled: true} instead of {"fulfilled": true}. Now fixes unquoted keys, True→true, and falls back to text-based boolean extraction when JSON parsing fails entirely. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-05-06 15:18:52 +02:00
Benjamin Admin	e50c4d659e	fix: Disable Qwen thinking mode for RAG checks (/no_think prefix) Qwen 3.5 uses all tokens for thinking, leaving response empty. Using /no_think prefix to get direct JSON output. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-05-06 15:12:51 +02:00
Benjamin Admin	9f16e6d535	fix: Read Qwen response from 'thinking' field when 'response' is empty Qwen 3.5 with latest Ollama returns structured thinking in separate 'thinking' field, leaving 'response' empty. Now checks both fields. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-05-06 15:07:09 +02:00
Benjamin Admin	f4374cfe8d	feat: Semantic Qdrant search — embed query via bge-m3, vector search in local Qdrant Replaces scroll+filter approach with proper semantic search: 1. Embed query via bp-core-embedding-service (bge-m3, 1024 dim) 2. Vector search in Qdrant (bp_compliance_datenschutz + bp_compliance_gesetze) 3. Sort by cosine similarity score 4. No API key needed — local Qdrant on Mac Mini Falls back gracefully: SDK first, then semantic Qdrant, then empty. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-05-06 14:46:06 +02:00
Benjamin Admin	7b8440191e	fix: Better error logging + increase LLM timeout to 120s for RAG check Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-05-06 14:33:58 +02:00
Benjamin Admin	510f513811	fix: Qdrant search uses chunk_text + section/category filter Payload structure: chunk_text (not text), section (Article 13), category, regulation_id. Scrolls 100 points per collection, filters client-side against regulation keywords. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-05-06 14:28:32 +02:00
Benjamin Admin	b50c4ec940	fix: RAG checker falls back to local Qdrant when Go SDK returns 401 Go SDK points to external Qdrant (qdrant-dev.breakpilot.ai) with expired API key. Fallback: search directly in local Qdrant (bp-core-qdrant:6333) which has all collections: bp_compliance_datenschutz, bp_compliance_gesetze, atomic_controls_dedup. Search strategy: 1. Try Go SDK RAG endpoint (preferred, has embedding-based search) 2. Fallback: Qdrant scroll with text-based regulation filter Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-05-06 14:23:52 +02:00
Benjamin Admin	090da0f71b	feat: RAG-based document verification against 144K Control Library New module: rag_document_checker.py - Searches RAG (Qdrant) for controls relevant to document type - Filters by regulation (DSGVO Art.13, TDDDG §25, BGB §355 etc.) - LLM (Qwen 3.5:35b) verifies each control against document text - Returns fulfilled/missing with evidence text + severity - Supports: DSI, Cookie, Impressum, Widerruf, AGB, DSFA, AVV, Loeschkonzept Integration in doc-check endpoint: - Regex checklist runs first (fast, deterministic) - RAG checks run after (semantic, catches what regex misses) - Both results combined in single response LLM prompt returns JSON: {fulfilled, evidence, issue, severity} Think-tags stripped, JSON extracted from response. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-05-06 13:19:15 +02:00

9 Commits