breakpilot-lehrer

Benjamin_Boenisch/breakpilot-lehrer

Fork 0

T

Benjamin Admin 606bef0591

CI / go-lint (push) Has been skipped

Details

CI / python-lint (push) Has been skipped

Details

CI / nodejs-lint (push) Has been skipped

Details

CI / test-go-school (push) Successful in 1m14s

Details

CI / test-go-edu-search (push) Successful in 26s

Details

CI / test-python-klausur (push) Failing after 1m55s

Details

CI / test-python-agent-core (push) Successful in 17s

Details

CI / test-nodejs-website (push) Successful in 17s

Details

fix(ocr-pipeline): overlap-based word assignment and empty row filtering

1. Word-to-column assignment now uses overlap-based matching instead of
   center-point matching. This fixes narrow page_ref columns losing
   their last digit (e.g. "p.59" → "p.5") when the digit's center
   falls slightly past the midpoint boundary into the next column.

2. Post-OCR empty row filter: rows where ALL cells have empty text
   are removed after OCR. This catches inter-row gaps that had stray
   Tesseract artifacts giving word_count > 0 but no actual content.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-03-03 11:00:29 +01:00

.claude

docs: update CLAUDE.md for direct MacBook development workflow

2026-02-25 23:09:42 +01:00

.gitea/workflows

feat: edu-search-service migriert, voice-service/geo-service entfernt

2026-02-15 18:36:38 +01:00

.woodpecker

feat: edu-search-service migriert, voice-service/geo-service entfernt

2026-02-15 18:36:38 +01:00

admin-lehrer

fix(ocr-pipeline): show sub-columns in reconstruction and LLM review steps