# document-crawler Python/FastAPI service for document ingestion and compliance gap analysis. Parses PDF, DOCX, XLSX, PPTX; runs gap analysis against compliance requirements; coordinates with `ai-compliance-sdk` via the LLM gateway; archives to `dsms-gateway`. **Port:** `8098` (container: `bp-compliance-document-crawler`) **Stack:** Python 3.11, FastAPI. ## Architecture Small service — already well under the LOC budget. Follow `../AGENTS.python.md` for any additions. ## Run locally ```bash cd document-crawler pip install -r requirements.txt uvicorn main:app --reload --port 8098 ``` ## Tests ```bash pytest tests/ -v ``` ## Public API surface `GET /health`, document upload/parse endpoints, gap-analysis endpoints. See the OpenAPI doc at `/docs` when running.