Files
breakpilot-core/control-pipeline/api
Benjamin Admin 2188d6645e fix(llm-dedup): increase timeout to 120s, add /no_think, limit output to 200 tokens
qwen3.5 uses extended thinking by default which causes 95s+ responses
and 30s timeouts. Add /no_think to system prompt and num_predict=200
to keep responses short.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-22 20:27:58 +02:00
..