breakpilot-lehrer

Benjamin_Boenisch/breakpilot-lehrer

Fork 0

Files

T

History

Benjamin Admin 4a44ad7986

CI / go-lint (push) Has been skipped

Details

CI / python-lint (push) Has been skipped

Details

CI / nodejs-lint (push) Has been skipped

Details

CI / test-go-school (push) Successful in 26s

Details

CI / test-go-edu-search (push) Successful in 27s

Details

CI / test-python-klausur (push) Failing after 1m51s

Details

CI / test-python-agent-core (push) Successful in 18s

Details

CI / test-nodejs-website (push) Successful in 16s

Details

fix: hard-filter OCR words inside detected graphic regions

Run detect_graphic_elements() in the grid pipeline after image loading
and remove ALL words whose centroids fall inside detected graphic regions,
regardless of confidence. Previously only low-confidence words (conf < 50)
were removed, letting artifacts like "Tr", "Su" survive.

Changes:
- grid_editor_api.py: Import and call detect_graphic_elements() at Step 3a,
  passing only significant words (len >= 3) to avoid short artifacts fooling
  the text-vs-graphic heuristic. Hard-filter all words in graphic regions.
- cv_graphic_detect.py: Lower density threshold from 20% to 5% for large
  regions (>100x80px) — photos/illustrations have low color saturation.
  Raise page-spanning limit from 50% to 60% width/height.

Tested: 5 ground-truth sessions pass regression (079cd0d9, d8533a2c,
2838c7a7, 4233d7e3, 5997b635). Session 5997 now detects 2 graphic regions
and removes 29 artifact words including "Tr" and "Su".

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-03-22 10:18:23 +01:00

backend

fix: hard-filter OCR words inside detected graphic regions

2026-03-22 10:18:23 +01:00

docs

Initial commit: breakpilot-lehrer - Lehrer KI Platform

2026-02-11 23:47:26 +01:00

frontend

Initial commit: breakpilot-lehrer - Lehrer KI Platform

2026-02-11 23:47:26 +01:00

scripts

Initial commit: breakpilot-lehrer - Lehrer KI Platform

2026-02-11 23:47:26 +01:00

Dockerfile

perf(klausur-service): split Dockerfile into base + app layer

2026-02-26 17:43:24 +01:00

Dockerfile.base

feat: OCR pipeline step 8 — validation view with image detection & generation

2026-03-05 10:40:37 +01:00