b914b6f49db8df3347bc2ac95bc8ba9b866dfbd1
right_x is the detected content boundary, which can still be several pixels short of actual text near the page margin. Since the page margin contains only white space, extending the last column's OCR crop to the full image width (w) is always safe and prevents right-edge text cutoff. Affects three locations in detect_column_geometry(): - Word count logging loop - ColumnGeometry boundary building (Step 8) - Phantom filter boundary adjustment (Step 9) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Description
No description provided
Languages
TypeScript
60.2%
Python
32.9%
Go
5.5%
C#
0.8%
CSS
0.2%
Other
0.3%