Runloop Scenario API
Author and run individual evaluation Scenarios — the atomic test unit for AI coding agents. Each Scenario captures input context, environment setup, an agent invocation, and one or more Scenario Scorers that produce pass/fail signals plus structured scores. Used standalone or composed into Benchmarks.