AI coding benchmark MirrorCode published its full results June 26, showing Claude Opus 4.7 autonomously rebuilt a 60,000-line interpreter and scored 56% overall — completing tasks that take human ...
Anthropic pulled two new AI models offline after the Trump administration said they were too dangerous for foreigners to use.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results