This repository has been archived on 2026-02-15 . You can view files and clone it. You cannot open issues or pull requests or push a commit.
53219e3eafef64b9ce7bcad680824edd4b5e0f4f
New modules: - tesseract_vocab_extractor.py: Bounding-box OCR with multi-PSM pipeline - grid_detection_service.py: CV-based grid/table detection for worksheets - vocab_session_store.py: PostgreSQL persistence for vocab sessions - trocr_api.py: TrOCR handwriting recognition endpoint - dsfa_rag_api.py + dsfa_corpus_ingestion.py: DSFA RAG corpus search Changes: - Dockerfile: Install tesseract-ocr + deu/eng language packs - requirements.txt: Add PyMuPDF, pytesseract, Pillow - main.py: Register new routers, init DB pools + Qdrant collections Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Description
ARCHIVIERT - Migriert nach breakpilot-core, breakpilot-lehrer, breakpilot-compliance
Languages
TypeScript
47.5%
Python
34.1%
Go
12.5%
JavaScript
2.4%
HTML
1.3%
Other
1.9%