Benjamin_Boenisch
  • Joined on 2026-02-07
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-lehrer 2026-03-12 19:05:58 +00:00
e9ccd1e35c feat: add Kombi-Modus (PaddleOCR + Tesseract) for OCR Overlay
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-compliance 2026-03-12 18:55:30 +00:00
050f353192 feat(canonical-controls): Canonical Control Library — rechtssichere Security Controls
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-lehrer 2026-03-12 17:54:52 +00:00
d335a7bbf3 fix: use OCR word_box coordinates directly instead of fuzzy matching
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-lehrer 2026-03-12 16:46:23 +00:00
1f527fcd49 fix: split PaddleOCR boxes at leading ! for overlay word positioning
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-lehrer 2026-03-12 16:19:56 +00:00
8349c28f54 fix: paddle_direct reuses build_grid_from_words for correct overlay
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-lehrer 2026-03-12 16:10:13 +00:00
71a1b5f058 fix: paddle_direct groups words per row (matching _build_cells format)
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-lehrer 2026-03-12 15:56:26 +00:00
c743a38eaf fix: Paddle Direct keeps preprocessing (orient/deskew/dewarp/crop)
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-compliance 2026-03-12 15:44:32 +00:00
8442115e7c fix(rag): Fix bash compatibility + missing mkdir in phase functions
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-lehrer 2026-03-12 15:42:07 +00:00
90c1efd9b0 feat: Paddle Direct — 1-click OCR without deskew/dewarp/crop
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-lehrer 2026-03-12 15:19:21 +00:00
06d63d18f9 fix: generic fuzzy text matching for overlay word-box positioning
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-compliance 2026-03-12 15:14:54 +00:00
999cc81c78 feat(rag): Phase J — Security Guidelines & Standards (NIST, OWASP, ENISA)
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-lehrer 2026-03-12 15:08:21 +00:00
3e65b14b83 fix: split PaddleOCR boxes at IPA brackets for overlay positioning
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-lehrer 2026-03-12 15:00:10 +00:00
40ac593d28 fix: split PaddleOCR phrase boxes into per-word boxes for overlay slide
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-compliance 2026-03-12 14:56:30 +00:00
ff66612beb fix(rag): Make download failures non-fatal — prevent set -e from aborting entire ingestion
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-lehrer 2026-03-12 14:04:11 +00:00
ea69239e06 fix: word_boxes in words_first use absolute pixels (consistent with v2 grid)
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-lehrer 2026-03-12 13:52:26 +00:00
bb90d1ba94 fix: PaddleOCR engine forces words_first in frontend to match backend
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-compliance 2026-03-12 13:37:03 +00:00
42ec3cad6d feat(rag): Phase I DACH-Erweiterung — Gesetze, Templates, Urteile
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-lehrer 2026-03-12 13:29:01 +00:00
685d135be5 fix: downscale large images before PaddleOCR (Traefik 60s limit)
Benjamin_Boenisch pushed to main at Benjamin_Boenisch/breakpilot-compliance 2026-03-12 13:24:38 +00:00
9945a62a50 fix(rag): docker cp into /workspace_scripts, then copy at runtime
Benjamin_Boenisch pushed to coolify at Benjamin_Boenisch/breakpilot-core 2026-03-12 13:16:07 +00:00
778c44226e fix: expose port 8095 directly (bypass Traefik 60s timeout)