>Scope
Run Microsoft's eval-recipes benchmarks to validate amplihack improvements against baseline agents.
**Version**: 0.1.0-preview | **Last Updated**: 2025-11-15 | **Framework Version**: 0.1.0-preview