AgenticAI
What We Learned by Watching One Model Watch Another
A frontier AI model read nine labeled debugging traces — three from itself, three from each of two peer models, all on the same bug. In a single API call it wrote 101 words describing what the peers did that it didn't. Those 101 words, pasted verbatim into its