LLMs Score 0% in Elite Coding Problems It finds that frontier models like GPT-4 struggle—scoring just 53% on medium-level problems and a shocking 0% on hard ones.
AI Agents Flunk CRM Tests, Mishandle Confidential Data The findings raise red flags for enterprises betting on AI agents to drive efficiencies.
AI Supercomputers by 2030 Could Cost $200 bn, Use 2 mn Chips, and Demand Power of 9 Nuclear Reactors Power constraints, not compute or chips, will likely become the primary bottleneck in AI advancement
Apple Finds AI Is Great at Pretending to Think—But Not Much Else The study challenges the prevailing belief that such models truly “think” like humans
Cisco Unveils Quantum Networking Chip Prototype The chip leverages existing networking technology to connect smaller quantum machines into larger, more powerful systems
Detecting Hallucinations in LLMs is Impossible, New Research Says If AI system is trained only on correct data, automated hallucination detection becomes fundamentally impossible