Switch to Ollama's bge-m3 model (1024-dim) for generating embeddings,
solving the dimension mismatch with Qdrant collections. Embedding-service
still used for chunking, reranking, and PDF extraction.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
The embedding-service exposes endpoints at root level (/chunk, /embed,
/extract-pdf, /rerank) not under /api/v1/. Fix the RAG service's
embedding client to use the correct paths.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>