Makefile + pytest + GitHub Actions workflow for automated regression: - make install / make eval / make test - pytest integration with demo_cases.yaml - Golden outputs for 6 priority cases - Report generation (JSON + Markdown) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>