Technical specification for the official ARC Prize 2026 Kaggle evaluation contract: validation vs scoring rerun, gateway sidecar integration, agent registration, submission.parquet schema, template-agent requirements, and shared ASRA notebook tooling. Documents the root cause of generic Kaggle Error failures and the gateway pattern that produced the first successful submission.
Interim consolidated evaluation report for ASRA: three tracks (gateway plumbing, competition scores, repeated-run learning), Phase 1–2 Kaggle results, Phase 2 Original ARC benchmark summary, agent version ladder, and roadmap to v1 after Stage 1 gateway migration completes.
Evaluation protocol for measuring genuine learning across repeated ARC-AGI-3 episodes: full-game and single-level setups, primary metrics (action count, dead-end rate, visit redundancy), cross-run persistence requirements, and mapping to ASRA Phase 1–3 transition logging, semantics inference, and episodic memory.