Benjamin Admin
2b1c499d54
fix(ocr-pipeline): filter OCR noise from image areas and artifacts
...
Two generic noise filters added to _ocr_single_cell():
1. Word confidence filter (conf < 30): removes low-confidence words
before text assembly. Catches trailing artifacts like "Es)" after
real text, and standalone noise from image edges.
2. Cell noise filter: clears cells whose entire text has no real
alphabetic word (>= 2 letters). Catches fragments like "E:", "3",
"u", "D", "2.77", "and )" from image areas, while keeping real
short words like "Ei", "go", "an".
Both filters apply to word-lookup AND cell-OCR fallback results.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com >
2026-03-02 09:56:54 +01:00
..
2026-03-01 11:08:52 +01:00
2026-02-11 23:47:26 +01:00
2026-02-28 02:18:29 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-03-02 09:56:54 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-26 22:16:37 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-03-02 09:41:30 +01:00
2026-02-28 02:18:29 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-28 20:23:40 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00