Benjamin Admin e426de937c
Some checks failed
CI / go-lint (push) Has been skipped
CI / python-lint (push) Has been skipped
CI / nodejs-lint (push) Has been skipped
CI / test-go-school (push) Successful in 27s
CI / test-go-edu-search (push) Successful in 27s
CI / test-python-klausur (push) Failing after 1m54s
CI / test-python-agent-core (push) Successful in 18s
CI / test-nodejs-website (push) Successful in 19s
fix: expand narrow columns + lower dewarp thresholds for small angles
Two fixes for edge case where residual shear pushes content out of
narrow columns (marker, page_ref):

1. Column expansion (Step 10): After detection, narrow columns (<10%
   content width) expand into adjacent whitespace gaps, claiming up to
   40% of the gap but never past the nearest word in the neighbor
   column. This gives marker/page_ref columns breathing room.

2. Dewarp sensitivity: Lower minimum angle from 0.15° to 0.08°, lower
   ensemble min confidence from 0.5 to 0.35, lower final threshold
   from 0.5 to 0.4, and skip quality gate for small corrections
   (<0.5°) where projection variance change is negligible.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-04 09:32:47 +01:00
Description
No description provided
42 MiB
Languages
TypeScript 60.2%
Python 32.9%
Go 5.5%
C# 0.8%
CSS 0.2%
Other 0.3%