qwen3.5 uses extended thinking by default which causes 95s+ responses and 30s timeouts. Add /no_think to system prompt and num_predict=200 to keep responses short. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
qwen3.5 uses extended thinking by default which causes 95s+ responses and 30s timeouts. Add /no_think to system prompt and num_predict=200 to keep responses short. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>