Gemini 2.5 Deep Think scores competitive coding gold in ‘profound leap’ for abstract problem-solving
After a mathematics win in July, Gemini 2.5 Deep Think has now earned a gold-medal level performance in competitive coding. The International Collegiate Programming Contest (ICPC) is the “oldest, ...
OpenAI and Google LLC today disclosed that their latest reasoning models achieved gold-level performance in a recent coding competition. The ICPC, as the event is called, is the world’s most ...
OpenAI and Google DeepMind demonstrated that their foundation models could outperform human coders — and win — showing that large language models (LLMs) can solve complex, previously unsolved ...
Researchers from Stanford, Princeton, and Cornell have developed a new benchmark to more accurately evaluate the coding abilities of large language models (LLMs). Called CodeClash, the new benchmark ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results