-
b697963186
fix: use Alpine-compatible addgroup/adduser flags in Dockerfiles
coolify
Sharang Parnerkar
2026-03-06 22:38:31 +01:00
-
14c8bb5da0
chore: LLM qwen3:30b-a3b → qwen3.5:35b-a3b
Benjamin Admin
2026-03-06 07:32:39 +01:00
-
4532f68173
fix: Word-Validation auf Segment-Woerter beschraenken
Benjamin Admin
2026-03-05 23:13:19 +01:00
-
391449fedf
fix: Seite an Sub-Headern segmentieren, groesstes Segment fuer Projektion
Benjamin Admin
2026-03-05 23:07:23 +01:00
-
cb2b924a7b
fix: word-coverage gap detection als Fallback bei Illustrationen
Benjamin Admin
2026-03-05 22:58:27 +01:00
-
8f3a50b981
fix: full-width Zeilen vor Spaltenerkennung maskieren
Benjamin Admin
2026-03-05 22:50:27 +01:00
-
0f821afb23
feat(sbom): Lehrer-spezifisch — 17 Core/Compliance-Eintraege entfernt, Beschreibungen angepasst
Benjamin Admin
2026-03-05 20:34:20 +01:00
-
2ad391e4e4
feat: Feinabstimmung mit 7 Schiebereglern fuer Deskew/Dewarp
Benjamin Admin
2026-03-05 18:22:33 +01:00
-
e0decac7a0
feat: Unified Inbox in Kommunikation-Navigation hinzugefuegt
Benjamin Admin
2026-03-05 18:04:30 +01:00
-
d39d249daa
feat: add pass 3 text-line regression to deskew pipeline
Benjamin Admin
2026-03-05 17:53:11 +01:00
-
538d5c732e
feat: two-pass deskew with wider angle range and residual correction
Benjamin Admin
2026-03-05 17:34:57 +01:00
-
b9c3c47a37
refactor: LLM Compare komplett entfernt, Video/Voice/Alerts Sidebar hinzugefuegt
Benjamin Admin
2026-03-05 17:34:54 +01:00
-
9912997187
refactor: Jitsi/Matrix/Voice von Core übernommen, Camunda/BPMN gelöscht, Kommunikation-Nav
Benjamin Admin
2026-03-05 17:01:47 +01:00
-
2ec4d8aabd
fix: JSX syntax — IIFE wrapping for vocabulary tab
Benjamin Admin
2026-03-05 17:01:33 +01:00
-
24366880ad
feat: vocab worksheet — full-quality images, insert triangles, dynamic columns
Benjamin Admin
2026-03-05 16:49:15 +01:00
-
20b341d839
fix: vocab worksheet fills full browser width, fix missing thumbnails
Benjamin Admin
2026-03-05 16:30:04 +01:00
-
d5be7b6f77
fix: vocab worksheet — wider table, show original pages, better layout
Benjamin Admin
2026-03-05 16:07:25 +01:00
-
b7ae36e92b
feat: use OCR pipeline instead of LLM vision for vocab worksheet extraction
Benjamin Admin
2026-03-05 15:35:44 +01:00
-
9ea77ba157
fix: Abschliessen button returns to session list on last pipeline step
Benjamin Admin
2026-03-05 15:05:48 +01:00
-
4f9cf3b9e8
fix: validation step buttons unreachable — reduce panel height + sticky bar
Benjamin Admin
2026-03-05 14:54:01 +01:00
-
b8a9493310
fix: deskew iterative — use vertical Sobel edges + vertical projection
Benjamin Admin
2026-03-05 14:23:43 +01:00
-
68a6b97654
fix: use gradient score instead of variance for iterative deskew
Benjamin Admin
2026-03-05 14:11:19 +01:00
-
af1b12c97d
feat: iterative projection-profile deskew (2-phase variance optimization)
Benjamin Admin
2026-03-05 13:46:44 +01:00
-
770aea611f
fix: correct example field (fixes iberqueren), disable cell-level bold
Benjamin Admin
2026-03-05 13:15:59 +01:00
-
1a2efbf075
fix: relative bold detection (page median), fix save/finish buttons
Benjamin Admin
2026-03-05 13:02:16 +01:00
-
cd12755da6
feat: OCR umlaut confusion correction + bold detection via stroke-width
Benjamin Admin
2026-03-05 12:06:57 +01:00
-
40cfc1acdd
fix: validation step — original image URL, white background, dynamic font size
Benjamin Admin
2026-03-05 11:40:24 +01:00
-
aa136a9f80
chore: add mflux model download script for off-peak scheduling
Benjamin Admin
2026-03-05 11:20:53 +01:00
-
e6858010c2
feat: RAG Chunk Browser — alle Collections + 59 EDPB/WP29/DSFA Eintraege
Benjamin Admin
2026-03-05 11:01:14 +01:00
-
1cc69d6b5e
feat: OCR pipeline step 8 — validation view with image detection & generation
Benjamin Admin
2026-03-05 10:40:37 +01:00
-
293e7914d8
feat: improved OCR pipeline session manager with categories, thumbnails, pipeline logging
Benjamin Admin
2026-03-05 09:44:38 +01:00
-
a58dfca1d8
fix: move char-confusion fix to correction step, add spell + page-ref corrections
Benjamin Admin
2026-03-05 00:26:13 +01:00
-
fd99d4f875
cleanup: remove sheet-specific code, reduce logging, document constants
Benjamin Admin
2026-03-05 00:04:02 +01:00
-
1e0c6bb4b5
feat: hybrid OCR — full-page for broad columns, cell-crop for narrow
Benjamin Admin
2026-03-04 23:38:44 +01:00
-
e6dc3fcdd7
fix: only replace phonetics in english field, fix grammar detection
Benjamin Admin
2026-03-04 23:19:03 +01:00
-
edbdac3203
fix: improve phonetic bracket replacement logic
Benjamin Admin
2026-03-04 23:13:34 +01:00
-
99573a46ef
debug: add phonetic bracket replacement logging
Benjamin Admin
2026-03-04 23:01:01 +01:00
-
6ad4b84584
fix: broaden phonetic bracket regex to catch Tesseract-garbled IPA
Benjamin Admin
2026-03-04 22:53:50 +01:00
-
f94a3836f8
fix: use Tesseract as default engine for cell-first OCR instead of RapidOCR
Benjamin Admin
2026-03-04 22:30:34 +01:00
-
34c649c8be
fix: send SSE keepalive events every 5s during batch OCR
Benjamin Admin
2026-03-04 22:21:14 +01:00
-
dd16c88007
fix: retry words request on 400/404 + add backend diagnostic logging
Benjamin Admin
2026-03-04 20:15:54 +01:00
-
9cbf0fb278
fix: Fake Compliance Advisor aus Lehrer KI-Admin entfernt
Benjamin Admin
2026-03-04 20:15:50 +01:00
-
90ecb46bed
fix: force 3x upscale for short RapidOCR crops + lower box_thresh
Benjamin Admin
2026-03-04 19:47:36 +01:00
-
bb0e23303c
debug: log RapidOCR upscale dimensions to verify scaling
Benjamin Admin
2026-03-04 18:18:03 +01:00
-
604da26b24
fix: upscale RapidOCR crops to min 150px (was 64px), matching Tesseract
Benjamin Admin
2026-03-04 17:38:06 +01:00
-
113a1c10e5
fix: add 3px cell padding + upscale small RapidOCR crops + diagnostic logging
Benjamin Admin
2026-03-04 16:45:59 +01:00
-
e4bdb3cc24
debug: add diagnostic logging to _ocr_cell_crop for empty cell investigation
Benjamin Admin
2026-03-04 16:35:33 +01:00
-
d0e7966925
fix: use header/footer row boundaries for _heal_row_gaps in cell-first OCR
Benjamin Admin
2026-03-04 15:44:13 +01:00
-
68d230c297
fix: use batch-then-stream SSE for cell-first OCR
Benjamin Admin
2026-03-04 14:51:55 +01:00
-
16dc77e5c2
chore: add migration 005_add_doc_type.sql
Benjamin Admin
2026-03-04 13:54:56 +01:00
-
29c74a9962
feat: cell-first OCR + document type detection + dynamic pipeline steps
Benjamin Admin
2026-03-04 13:52:38 +01:00
-
00a74b3144
revert: remove marker column OCR special handling
Benjamin Admin
2026-03-04 11:52:59 +01:00
-
489835a279
fix: detect red/coloured markers in OCR pipeline
Benjamin Admin
2026-03-04 11:38:12 +01:00
-
f0726d9a2b
fix: shrink overlapping neighbors after narrow column expansion
Benjamin Admin
2026-03-04 11:12:13 +01:00
-
ae1f9f7494
fix: expand narrow columns into neighbor space, not just gaps
Benjamin Admin
2026-03-04 10:49:10 +01:00
-
e4aff2b27e
fix: rewrite Method D to measure vertical column drift instead of text-line slope
Benjamin Admin
2026-03-04 10:31:19 +01:00
-
9dd77ab54a
fix: move column expansion AFTER sub-column split
Benjamin Admin
2026-03-04 10:07:40 +01:00
-
e426de937c
fix: expand narrow columns + lower dewarp thresholds for small angles
Benjamin Admin
2026-03-04 09:32:47 +01:00
-
0d3f001acb
fix: always include detections in dewarp response, even when no correction applied
Benjamin Admin
2026-03-04 09:05:43 +01:00
-
c484a89b78
fix: dewarp UI shows detection details, quality gate status, confidence bars
Benjamin Admin
2026-03-04 08:39:55 +01:00
-
d5f2ce4659
fix: Fabric.js v6 API compatibility + CLAUDE.md SSH commands
Benjamin Admin
2026-03-03 23:01:19 +01:00
-
ab3ecc7c08
feat: OCR pipeline v2.1 – narrow column OCR, dewarp automation, Fabric.js editor
Benjamin Admin
2026-03-03 22:44:14 +01:00
-
970ec1f548
docs: OCR-Pipeline v2.0.0 – alle Optimierungen 2026-03-03 dokumentiert
Benjamin Admin
2026-03-03 18:42:25 +01:00
-
a610bc75ba
fix: rename LLM-Korrektur to Korrektur in wizard stepper and types
Benjamin Admin
2026-03-03 17:56:46 +01:00
-
153f41358b
fix: remove stale allCells dependency in emptyCellIds memo
Benjamin Admin
2026-03-03 17:39:14 +01:00
-
d1c8075da2
fix: three OCR pipeline UX improvements
Benjamin Admin
2026-03-03 17:31:55 +01:00
-
f3d61a9394
fix: extend initial Tesseract scan to full image width for word detection
Benjamin Admin
2026-03-03 17:08:03 +01:00
-
ab2423bd10
fix: protect numbered list prefixes from 1→I confusion in char fix step
Benjamin Admin
2026-03-03 16:46:45 +01:00
-
b914b6f49d
fix(columns): extend rightmost column to full image width (w) not content right_x
Benjamin Admin
2026-03-03 16:25:07 +01:00
-
123b7ada0b
fix(columns): filter phantom narrow columns + rename step to OCR-Zeichenkorrektur
Benjamin Admin
2026-03-03 16:06:59 +01:00
-
cb61fab77b
fix(rows): filter artifact rows and heal gaps for full OCR height
Benjamin Admin
2026-03-03 15:38:58 +01:00
-
6623a5d10e
fix(columns): extend rightmost column to content right edge (right_x)
Benjamin Admin
2026-03-03 15:26:38 +01:00
-
21ea458fcf
feat(ocr-review): replace LLM with rule-based spell-checker (REVIEW_ENGINE=spell)
Benjamin Admin
2026-03-03 15:04:27 +01:00
-
b1f7fee284
fix(ocr-review): add pipe→1 as valid OCR correction in _is_spurious_change
Benjamin Admin
2026-03-03 14:50:16 +01:00
-
dc5d76ecf5
fix(llm-review): think=false und Logging in Streaming-Version fehlten
Benjamin Admin
2026-03-03 14:43:42 +01:00
-
1ac47cd9b7
fix(llm-review): JSON-Parse-Fehler durch Control-Zeichen beheben
Benjamin Admin
2026-03-03 14:37:16 +01:00
-
fa8e38db2d
fix(llm-review): Pre-Filter entfernt — alle Einträge ans LLM senden
Benjamin Admin
2026-03-03 14:29:46 +01:00
-
f1b6246838
fix(llm-review): Diagnose-Logging + think=false + <think>-Tag-Stripping
Benjamin Admin
2026-03-03 14:13:08 +01:00
-
2fce92d7b1
fix(llm-review): LLM übersetzt nicht mehr — nur noch OCR-Ziffernfehler
Benjamin Admin
2026-03-03 13:48:54 +01:00
-
7eb03ca8d1
fix(ocr-pipeline): IndentationError in auto-mode deskew block
Benjamin Admin
2026-03-03 13:21:49 +01:00
-
50e1c964ee
feat(klausur-service): OCR-Pipeline Optimierungen (Improvements 2-4)
Benjamin Admin
2026-03-03 13:13:20 +01:00
-
2e0f8632f8
feat(klausur): Handschrift entfernen + Klausur-HTR implementiert
Benjamin Admin
2026-03-03 12:04:26 +01:00
-
606bef0591
fix(ocr-pipeline): overlap-based word assignment and empty row filtering
Benjamin Admin
2026-03-03 11:00:29 +01:00
-
ccba2bb887
fix(ocr-pipeline): show sub-columns in reconstruction and LLM review steps
Benjamin Admin
2026-03-03 10:36:27 +01:00
-
ef6237ffdf
refactor(coolify): externalize postgres, qdrant, S3
Sharang Parnerkar
2026-03-03 09:23:32 +01:00
-
75bca1f02d
fix(ocr-cells): align cell bboxes exactly to column/row coordinates
Benjamin Admin
2026-03-03 09:21:56 +01:00
-
4d428980c1
refactor(word-step): make table fully generic and fix marker-only row filter
Benjamin Admin
2026-03-03 08:45:24 +01:00
-
dea3349b23
fix(ocr-pipeline): preserve sub-column data in vocab table display
Benjamin Admin
2026-03-03 08:06:15 +01:00
-
0d72f2c836
fix(sub-columns): protect sub-columns from column_ignore pre-filter
Benjamin Admin
2026-03-03 07:55:53 +01:00
-
d6a8c1d821
fix(streaming): include page_ref columns in SSE metadata
Benjamin Admin
2026-03-03 07:48:07 +01:00
-
6527beae03
fix(sub-columns): exclude header/footer words from alignment clustering
Benjamin Admin
2026-03-03 07:33:54 +01:00
-
3904ddb493
fix(sub-columns): convert relative word positions to absolute coords for split
Benjamin Admin
2026-03-02 19:16:13 +01:00
-
6e1a349eed
fix(tests): adjust word counts so 10% threshold works correctly
Benjamin Admin
2026-03-02 19:00:14 +01:00
-
7252f9a956
refactor(ocr-pipeline): use left-edge alignment approach for sub-column detection
Benjamin Admin
2026-03-02 18:56:38 +01:00
-
f13116345b
fix(tests): use correct bbox_pct dict format in _cells_to_vocab_entries tests
Benjamin Admin
2026-03-02 18:26:24 +01:00
-
991984d9c3
fix(tests): pass columns_meta arg to _cells_to_vocab_entries tests
Benjamin Admin
2026-03-02 18:23:55 +01:00
-
1a246eb059
feat(ocr-pipeline): generic sub-column detection via left-edge clustering
Benjamin Admin
2026-03-02 18:18:02 +01:00
-
0532b2a797
fix(ocr-pipeline): skip edge-touching gaps in header/footer detection
Benjamin Admin
2026-03-02 17:54:49 +01:00
-
f1fcc67357
fix(ocr-pipeline): clamp gap detection to img_h to avoid dewarp padding
Benjamin Admin
2026-03-02 17:06:58 +01:00
-
c8981423d4
feat(ocr-pipeline): distinguish header/footer vs margin_top/margin_bottom
Benjamin Admin
2026-03-02 16:55:41 +01:00