cf27a9530837e7f81fde6696c674f748c8887a0a
Replace projection-profile layout analysis with Tesseract word bounding box clustering to detect 5-column vocabulary layouts (page_ref, EN, DE, markers, examples). Falls back to projection profiles when < 3 clusters. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Description
No description provided
Languages
TypeScript
57.9%
Python
33.8%
Go
6.8%
C#
0.8%
Shell
0.2%
Other
0.3%