feat(ocr): Add Ground Truth labeling UI for OCR comparison

Adds a step-through tool for creating 100% correct reference data (ground truth)
with position information. Users scan a page, review each vocabulary entry with
image crops, confirm or correct the OCR text, and save the result as JSON.

Backend: extract_entries_with_boxes() helper + 3 endpoints (extract-with-boxes,
ground-truth save/load). Frontend: GroundTruthPanel component with SVG overlay,
ImageCrop, keyboard shortcuts (Enter/Tab/arrows), and tab navigation in page.tsx.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
BreakPilot Dev
2026-02-10 09:04:36 +01:00
parent d4a23e8d99
commit 8c77df494b
4 changed files with 872 additions and 3 deletions

View File

@@ -11,3 +11,5 @@ export { CellCorrectionDialog } from './CellCorrectionDialog'
export { BlockReviewPanel, BlockReviewSummary } from './BlockReviewPanel'
export type { BlockStatus, MethodResult, BlockReviewData } from './BlockReviewPanel'
export { GroundTruthPanel } from './GroundTruthPanel'