Fix syllable+IPA combination: strip bracket content before IPA guard

The _IPA_RE check in _syllabify_text() skipped entire cells containing any IPA character. After German IPA insertion adds [bɪltʃøn], the check blocked syllabification entirely. Now strips bracket content before checking, so programmatically inserted IPA doesn't prevent syllable divider insertion on the surrounding text. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-26 00:03:10 +01:00
parent f860eb66e6
commit 525de55791
1 changed files with 5 additions and 2 deletions
@@ -150,8 +150,11 @@ def _syllabify_text(text: str, hyph_de, hyph_en) -> str:
    if not text:
        return text

-    # Skip cells that contain IPA transcription characters
-    if _IPA_RE.search(text):
+    # Skip cells that contain IPA transcription characters outside brackets.
+    # Bracket content like [bɪltʃøn] is programmatically inserted and should
+    # not block syllabification of the surrounding text.
+    text_no_brackets = re.sub(r'\[[^\]]*\]', '', text)
+    if _IPA_RE.search(text_no_brackets):
        return text

    # Phase 1: strip existing pipe dividers for clean normalization