breakpilot-core

Author	SHA1	Message	Date
Benjamin Admin	96f94475f6	fix: downgrade to PaddleOCR 2.x — 3.x uses too much RAM on CPU All checks were successful CI / go-lint (push) Has been skipped Details CI / test-go-consent (push) Successful in 33s Details CI / test-python-voice (push) Successful in 31s Details CI / test-bqas (push) Successful in 34s Details CI / python-lint (push) Has been skipped Details CI / nodejs-lint (push) Has been skipped Details CI / Deploy (push) Successful in 2s Details PaddlePaddle 3.x + PP-OCRv5 requires >6GB RAM and has oneDNN compatibility issues on CPU. PaddleOCR 2.x with PP-OCRv4 works reliably with ~2-3GB RAM and has no MKLDNN issues. - Pin paddlepaddle<3.0.0 and paddleocr<3.0.0 - Simplify main.py — single init strategy, direct 2.x result format - Re-enable warmup (fits in memory with 2.x) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 19:13:33 +01:00
Benjamin Admin	3fd3336f6c	fix: force-disable oneDNN via paddle.set_flags and enable_mkldnn=False All checks were successful CI / go-lint (push) Has been skipped Details CI / test-go-consent (push) Successful in 34s Details CI / test-python-voice (push) Successful in 32s Details CI / test-bqas (push) Successful in 32s Details CI / Deploy (push) Successful in 2s Details CI / python-lint (push) Has been skipped Details CI / nodejs-lint (push) Has been skipped Details Previous FLAGS_use_mkldnn env var was ignored by PaddlePaddle 3.x. Now using paddle.set_flags() API and PaddleOCR enable_mkldnn param. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 19:01:46 +01:00
Benjamin Admin	eaba087d11	fix: disable oneDNN/MKLDNN and support PaddleOCR 3.x result format All checks were successful CI / test-go-consent (push) Successful in 31s Details CI / go-lint (push) Has been skipped Details CI / python-lint (push) Has been skipped Details CI / nodejs-lint (push) Has been skipped Details CI / test-python-voice (push) Successful in 1m19s Details CI / test-bqas (push) Successful in 32s Details CI / Deploy (push) Successful in 2s Details - Set FLAGS_use_mkldnn=0 before paddle import to avoid ConvertPirAttribute2RuntimeAttribute error - Support both PaddleOCR 2.x (list) and 3.x (dict) result formats - Use use_textline_orientation (3.x) instead of use_angle_cls - Remove latin lang fallback (not supported in 3.x) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 18:52:31 +01:00
Benjamin Admin	ed2cc234b8	fix: add error handling and logging to OCR endpoint All checks were successful CI / nodejs-lint (push) Has been skipped Details CI / go-lint (push) Has been skipped Details CI / python-lint (push) Has been skipped Details CI / test-go-consent (push) Successful in 31s Details CI / test-python-voice (push) Successful in 32s Details CI / test-bqas (push) Successful in 33s Details CI / Deploy (push) Successful in 2s Details Return detailed error message instead of generic 500, and handle empty OCR results gracefully. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 18:37:32 +01:00
Benjamin Admin	ffd3fd1d7c	fix: remove warmup OCR call — causes OOM on 6G container All checks were successful CI / go-lint (push) Has been skipped Details CI / python-lint (push) Has been skipped Details CI / nodejs-lint (push) Has been skipped Details CI / test-go-consent (push) Successful in 38s Details CI / test-python-voice (push) Successful in 38s Details CI / test-bqas (push) Successful in 50s Details CI / Deploy (push) Successful in 2s Details The warmup OCR call during startup pushes memory over 6G and causes OOM kills + restart loops. First real OCR request will be slow (JIT compilation) but container stays stable. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 18:24:55 +01:00
Benjamin Admin	8979aa8e43	fix: add warmup OCR call to avoid timeout on first request All checks were successful CI / go-lint (push) Has been skipped Details CI / python-lint (push) Has been skipped Details CI / nodejs-lint (push) Has been skipped Details CI / test-go-consent (push) Successful in 43s Details CI / test-python-voice (push) Successful in 35s Details CI / test-bqas (push) Successful in 34s Details CI / Deploy (push) Successful in 3s Details Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 16:56:08 +01:00
Benjamin Admin	65177d3ff7	fix: robust PaddleOCR init with multiple fallback strategies Some checks failed CI / go-lint (pull_request) Failing after 2s Details CI / python-lint (pull_request) Failing after 11s Details CI / nodejs-lint (pull_request) Failing after 2s Details CI / test-go-consent (pull_request) Failing after 2s Details CI / test-python-voice (pull_request) Failing after 14s Details CI / test-bqas (pull_request) Failing after 11s Details CI / deploy-hetzner (pull_request) Has been skipped Details Deploy to Coolify / deploy (push) Has been cancelled Details PaddleOCR 3.x removed show_log param and lang='latin'. Try multiple init strategies in order until one succeeds. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 11:09:33 +01:00
Benjamin Admin	5ee3cc0104	fix: load PaddleOCR model in background thread Some checks failed Deploy to Coolify / deploy (push) Has been cancelled Details The import and model loading can take minutes and was blocking the startup event, causing health checks to timeout. Now loads in a background thread — health endpoint returns 200 immediately with status 'loading' until model is ready. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 10:21:59 +01:00
Benjamin Admin	b36712247b	fix: add detailed logging for PaddleOCR model loading debug Some checks failed Deploy to Coolify / deploy (push) Has been cancelled Details Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 10:19:10 +01:00
Benjamin Admin	86b11c7e5f	fix: catch all exceptions in PaddleOCR version fallback Some checks failed Deploy to Coolify / deploy (push) Has been cancelled Details PaddleOCR 2.8.1 throws a generic Exception (not ValueError) when ocr_version='PP-OCRv5' is used. Broadened except clause to catch any error and fall back to lang='latin' for older versions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 10:12:32 +01:00
Benjamin Admin	8003dcac39	fix: PaddleOCR 3.4.0 compatibility — use lang=en with PP-OCRv5 Some checks failed Deploy to Coolify / deploy (push) Has been cancelled Details PaddleOCR 3.4.0 removed 'latin' language support, causing ValueError at startup. Now uses lang='en' with ocr_version='PP-OCRv5' and falls back to lang='latin' for older PaddleOCR versions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 09:54:52 +01:00
Benjamin Admin	79891063dd	fix: pin PaddlePaddle 2.6.2 + PaddleOCR 2.8.1 (stable, no PIR bug) Some checks failed Deploy to Coolify / deploy (push) Has been cancelled Details PaddlePaddle 3.x hat oneDNN/PIR Executor Bug. Zurueck auf 2.6.2 mit bewaeherter ocr() API statt predict(). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 13:32:54 +01:00
Benjamin Admin	3133615044	fix: add libgomp1 (OpenMP) + remove unused lang parameter Some checks failed Deploy to Coolify / deploy (push) Has been cancelled Details PaddlePaddle braucht libgomp.so.1 fuer Inferenz. lang wird ignoriert bei explizitem model_name. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 13:19:47 +01:00
Benjamin Admin	2bc0f87325	fix: PaddleOCR model pre-load at startup + 5min healthcheck grace Some checks failed Deploy to Coolify / deploy (push) Has been cancelled Details Model wird beim Container-Start geladen (nicht erst beim ersten Request). Health-Check start_period auf 300s erhoeht fuer initialen Download. /health gibt "loading" zurueck bis Modell bereit ist. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 13:12:14 +01:00
Benjamin Admin	4ee38d6f0b	fix: remove show_log (unknown in PaddleOCR v3 API) Some checks failed Deploy to Coolify / deploy (push) Has been cancelled Details Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 12:52:52 +01:00
Benjamin Admin	992d4f2a6b	fix: PaddleOCR v3 API — explicit model name + predict() statt ocr() Some checks failed Deploy to Coolify / deploy (push) Has been cancelled Details lang="latin" braucht text_recognition_model_name in PP-OCRv5. Neue API nutzt predict() statt ocr(), Ergebnis-Format angepasst. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 12:47:07 +01:00
Benjamin Admin	7cdb53051f	feat: PaddleOCR Service (PP-OCRv5 Latin auf x86_64) Some checks failed Deploy to Coolify / deploy (push) Has been cancelled Details Microservice fuer PaddleOCR auf Hetzner. FastAPI mit /ocr und /health Endpoints, API-Key Auth, 4GB Memory Limit, Modell-Cache Volume. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 10:20:41 +01:00

17 Commits