Commit Graph

6 Commits

Author SHA1 Message Date
BreakPilot Dev
70dd834137 feat(ocr): Add Ground Truth labeling UI for OCR comparison
Adds a step-through tool for creating 100% correct reference data (ground truth)
with position information. Users scan a page, review each vocabulary entry with
image crops, confirm or correct the OCR text, and save the result as JSON.

Backend: extract_entries_with_boxes() helper + 3 endpoints (extract-with-boxes,
ground-truth save/load). Frontend: GroundTruthPanel component with SVG overlay,
ImageCrop, keyboard shortcuts (Enter/Tab/arrows), and tab navigation in page.tsx.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-10 09:04:36 +01:00
BreakPilot Dev
2dd36099f1 feat(ocr): Add CV Document Reconstruction Pipeline for vocabulary extraction
New OCR method using classical Computer Vision: high-res rendering (432 DPI),
deskew, dewarp, binarization, projection-profile layout analysis, multi-pass
Tesseract OCR with region-specific PSM, and Y-coordinate line alignment.
Includes bugfix for convert_pdf_to_image call (line 869) and 39 unit tests.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-09 23:52:35 +01:00
BreakPilot Dev
2f8ffb7352 fix(ocr-compare): Replace Ollama call in grid analysis with heuristic from comparison results
Ollama crashes when two concurrent vision requests hit the 32B model
(compare-ocr + analyze-grid). The grid analysis was redundantly calling
Ollama again even though compare-ocr already extracted all vocabulary.

- compare-ocr now saves vocabulary in session for reuse
- analyze-grid builds grid from session data (no Ollama, instant response)
- Grid button disabled until comparison results are available
- Added export-to-editor functionality

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-09 18:08:13 +01:00
Benjamin Admin
c36af8d7d4 Restore OCR Compare page (ocr-compare/page.tsx)
Restores the full OCR comparison tool with vocab extraction,
green grid overlay, labeling, and LLM comparison features.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-09 10:56:03 +01:00
Benjamin Admin
ed275f4909 fix(admin-v2): Restore HEAD SDK files for compatibility with new pages
Restore the SDK context, types, and component files to the HEAD version
since newer pages (company-profile, import) depend on these API changes.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-09 10:18:39 +01:00
Benjamin Admin
bfdaf63ba9 fix: Restore all files lost during destructive rebase
A previous `git pull --rebase origin main` dropped 177 local commits,
losing 3400+ files across admin-v2, backend, studio-v2, website,
klausur-service, and many other services. The partial restore attempt
(660295e2) only recovered some files.

This commit restores all missing files from pre-rebase ref 98933f5e
while preserving post-rebase additions (night-scheduler, night-mode UI,
NightModeWidget dashboard integration).

Restored features include:
- AI Module Sidebar (FAB), OCR Labeling, OCR Compare
- GPU Dashboard, RAG Pipeline, Magic Help
- Klausur-Korrektur (8 files), Abitur-Archiv (5+ files)
- Companion, Zeugnisse-Crawler, Screen Flow
- Full backend, studio-v2, website, klausur-service
- All compliance SDKs, agent-core, voice-service
- CI/CD configs, documentation, scripts

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-09 09:51:32 +01:00