Wayfinding AI: Transforming Health Conversations Through Context-Seeking Online health information seeking has become ubiquitous, with 58.5% of US adults using the internet for health and medical questions . However, this widespread access comes with significant chall... AI Context-Seeking Conversational AI Google Research Health LLM Medical AI User Experience
AlphaEvolve: AI-Powered Mathematical Discovery at Scale Google DeepMind's AlphaEvolve is moving the bar yet again in how artificial intelligence tackles mathematical discovery. Published (preprint) in November 2025 by researchers Bogdan Georgiev, e t al at... AI AlphaProof automated reasoning combinatorics DeepMind evolutionary computation FunSearch large language models machine learning mathematical discovery mathematics optimization Terence Tao
SWE-Bench Pro Sets A Higher Bar For AI Coding Agents As AI coding agents approach human-level performance on existing benchmarks, the research community faces a critical challenge: how do we continue measuring progress when current evaluation suites are... AI benchmarks coding agents software engineering
SciVer Puts Multimodal Claim Verification To The Test Scientific claim verification and reproducibility have emerged as a critical challenges in the era of information abundance and multimodal AI systems. Unlike traditional fact-checking that relies prim... AI benchmark claim verification multimodal scientific reasoning
Scaling Laws Unveiled: The Mathematical Blueprint Behind Smarter, Cheaper AI Models Training a state-of-the-art language model can cost tens of millions of dollars and months of computation time, not to mention the expertise needed for development. Yet until now, researchers have bee... AI Chinchilla deep learning GPT-3 language models machine learning neural networks optimization principal component analysis scaling laws statistical methodology