Benjamin Admin
ed7fc99fc4
Improve syllable divider insertion for dictionary pages
...
Rewrite cv_syllable_detect.py with pyphen-first approach:
- Remove unreliable CV gate (morphological pipe detection)
- Strip existing pipes and re-syllabify via pyphen (DE then EN)
- Merge pipe-gap spaces where OCR split words at divider positions
- Guard merges with function word blacklist and punctuation checks
Add false-positive prevention:
- Pre-check: skip if <5% of cells have existing | from OCR
- Call-site check: require article_col_index (der/die/das column)
- Prevents syllabification of synonym dictionaries and word lists
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com >
2026-03-24 19:44:29 +01:00
..
2026-03-01 11:08:52 +01:00
2026-02-11 23:47:26 +01:00
2026-03-23 09:21:27 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-03-23 09:53:02 +01:00
2026-03-24 09:23:30 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-03-16 08:12:52 +01:00
2026-03-11 20:41:29 +01:00
2026-03-20 18:21:00 +01:00
2026-03-23 09:53:02 +01:00
2026-03-23 09:53:02 +01:00
2026-03-24 07:45:39 +01:00
2026-03-24 13:52:11 +01:00
2026-03-17 16:39:15 +01:00
2026-03-09 15:24:56 +01:00
2026-03-24 19:44:29 +01:00
2026-03-12 06:46:05 +01:00
2026-03-24 07:45:39 +01:00
2026-03-24 09:23:30 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-03-24 19:44:29 +01:00
2026-03-24 15:11:29 +01:00
2026-03-03 12:04:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-03-14 23:41:03 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-03-18 13:46:48 +01:00
2026-03-18 08:42:00 +01:00
2026-03-23 16:27:12 +01:00
2026-03-19 09:19:09 +01:00
2026-03-18 08:42:00 +01:00
2026-03-18 11:08:23 +01:00
2026-03-18 08:42:00 +01:00
2026-03-24 13:16:44 +01:00
2026-03-18 08:42:00 +01:00
2026-03-24 17:05:33 +01:00
2026-03-24 17:05:33 +01:00
2026-03-18 08:42:00 +01:00
2026-03-24 17:05:33 +01:00
2026-03-24 09:23:30 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-03-24 14:17:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-03-07 22:16:29 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00
2026-02-11 23:47:26 +01:00