e8ec50e0fc
YAML-based test package with 4 categories (6 each): - Standard sector cases (Telko, SaaS, Energie, Automotive, Health, Law) - Scope-beats-sector (Bank+Battery, KI-Recruiting, White-Label, Payments) - False friends (Stripe!=PSD2, Hotline!=TKG, Repo-signals!=regulation) - Escalation (IoT-SIM, FinTech unclear, Treuhand, KI-Diagnose) Enforces 5 acceptance rules: no false certainty, scope>sector, repo signals insufficient, standard first, 40%+ negative tests. Scoring framework: must_include + must_not_include + reasoning + escalation. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>