chore: add per-cluster debug logging for column alignment detection
Some checks failed
CI / go-lint (push) Has been skipped
CI / python-lint (push) Has been skipped
CI / nodejs-lint (push) Has been skipped
CI / test-go-school (push) Successful in 26s
CI / test-go-edu-search (push) Successful in 26s
CI / test-python-klausur (push) Failing after 2m0s
CI / test-python-agent-core (push) Successful in 16s
CI / test-nodejs-website (push) Successful in 18s
Some checks failed
CI / go-lint (push) Has been skipped
CI / python-lint (push) Has been skipped
CI / nodejs-lint (push) Has been skipped
CI / test-go-school (push) Successful in 26s
CI / test-go-edu-search (push) Successful in 26s
CI / test-python-klausur (push) Failing after 2m0s
CI / test-python-agent-core (push) Successful in 16s
CI / test-nodejs-website (push) Successful in 18s
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
@@ -165,6 +165,12 @@ def _cluster_columns_by_alignment(
|
|||||||
]
|
]
|
||||||
significant = sorted(primary + secondary, key=lambda c: c["mean_x"])
|
significant = sorted(primary + secondary, key=lambda c: c["mean_x"])
|
||||||
|
|
||||||
|
for c in significant:
|
||||||
|
logger.info(
|
||||||
|
" significant cluster: x=%d (range %d-%d), %d words, %d rows (%.0f%%)",
|
||||||
|
c["mean_x"], c["min_edge"], c["max_edge"],
|
||||||
|
c["count"], c["distinct_rows"], c["row_coverage"] * 100,
|
||||||
|
)
|
||||||
logger.info(
|
logger.info(
|
||||||
"alignment columns: %d clusters, %d primary, %d secondary → %d significant",
|
"alignment columns: %d clusters, %d primary, %d secondary → %d significant",
|
||||||
len(clusters), len(primary), len(secondary), len(significant),
|
len(clusters), len(primary), len(secondary), len(significant),
|
||||||
|
|||||||
Reference in New Issue
Block a user