Files
breakpilot-lehrer/klausur-service/backend/requirements.txt
Benjamin Admin f860eb66e6
Some checks failed
CI / go-lint (push) Has been skipped
CI / python-lint (push) Has been skipped
CI / nodejs-lint (push) Has been skipped
CI / test-go-school (push) Successful in 33s
CI / test-go-edu-search (push) Successful in 29s
CI / test-python-klausur (push) Failing after 2m12s
CI / test-python-agent-core (push) Successful in 15s
CI / test-nodejs-website (push) Successful in 17s
Add German IPA support (wiki-pronunciation-dict + epitran)
Hybrid approach mirroring English IPA:
- Primary: wiki-pronunciation-dict (636k entries, CC-BY-SA, Wiktionary)
- Fallback: epitran rule-based G2P (MIT license)

IPA modes now use language-appropriate dictionaries:
- auto/en: English IPA (Britfone + eng_to_ipa)
- de: German IPA (wiki-pronunciation-dict + epitran)
- all: EN column gets English IPA, other columns get German IPA
- none: disabled

Frontend shows CC-BY-SA attribution when German IPA is active.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-25 22:18:20 +01:00

63 lines
1.4 KiB
Plaintext

fastapi>=0.109.0
uvicorn[standard]>=0.27.0
python-multipart>=0.0.6
pyjwt>=2.8.0
httpx>=0.26.0
python-dotenv>=1.0.0
# BYOEH Dependencies
qdrant-client>=1.7.0
cryptography>=41.0.0
PyPDF2>=3.0.0
PyMuPDF>=1.24.0
# PyTorch CPU-only (smaller, no CUDA needed for Docker on Mac)
--extra-index-url https://download.pytorch.org/whl/cpu
torch>=2.0.0
# Local Embeddings (no API key needed)
sentence-transformers>=2.2.0
# MinIO Object Storage
minio>=7.2.0
# OpenCV for handwriting detection (headless = no GUI, smaller for CI)
opencv-python-headless>=4.8.0
# Tesseract OCR Python binding (requires system tesseract-ocr package)
pytesseract>=0.3.10
Pillow>=10.0.0
# RapidOCR (PaddleOCR models on ONNX Runtime — works on ARM64 natively)
rapidocr
onnxruntime
# IPA pronunciation dictionary lookup (MIT license, bundled CMU dict ~134k words)
eng-to-ipa
# German IPA rule-based fallback for OOV words (MIT license)
epitran
# Spell-checker for rule-based OCR correction (MIT license)
pyspellchecker>=0.8.1
# Syllable hyphenation for dictionary pipe-divider insertion (MIT license)
pyphen>=0.16.0
# PostgreSQL (for metrics storage)
psycopg2-binary>=2.9.0
asyncpg>=0.29.0
# Email validation for Pydantic
email-validator>=2.0.0
# DOCX export for reconstruction editor (MIT license)
python-docx>=1.1.0
# ONNX model export and optimization (Apache-2.0)
optimum[onnxruntime]>=1.17.0
# Testing
pytest>=8.0.0
pytest-asyncio>=0.23.0