How DiscoRL Is Changing the Rules: AI That Discovers Its Own Learning Algorithms What if artificial intelligence could not only learn from experience but also invent the very rules that govern its learning, outpacing even the best human-crafted algorithms? Google DeepMind has take... AI research algorithm discovery automation DeepMind generalization meta-learning neural networks reinforcement learning
OpenEnv: Fueling the Future of Agentic AI with Open, Standardized Environments AI agents are getting smarter, but their ability to interact with the world safely and effectively hinges on more than just powerful models. They require environments purpose-built for safety , flexib... agentic systems AI agents environments Hugging Face Meta open source reinforcement learning standardization
Agent Lightning: Decoupled RL Training for Any AI Agent Agent Lightning is a Microsoft Research project that turns existing agents into trainable systems with minimal code changes. Instead of rewriting your agent to fit a trainer loop, you attach a lightwe... AI agents AutoGen DPO LangGraph OpenAI Agents reinforcement learning RLHF VERL vLLM
Code World Model: A 32B Agentic Coding LLM Grounded In Execution Traces This article analyzes a Meta FAIR technical report introducing the Code World Model (CWM), a 32-billion-parameter decoder-only transformer trained to model program execution and agentic software engin... agents code generation execution traces LLM reinforcement learning software engineering
SmallThinker: Bringing Powerful Language Models to Local Devices Researchers from Shanghai Jiao Tong University’s Institute of Parallel and Distributed Systems, the School of Artificial Intelligence, and Zenergize AI introduced SmallThinker : a family of large lang... AI Models AI training reinforcement learning
TextArena Uses Competitive Gameplay to Advance AI As language models quickly catch up with and surpass traditional benchmarks, the need for more effective measurement tools becomes urgent. TextArena steps in as an innovative, open-source platf... agentic AI AI benchmarking LLM evaluation open source reinforcement learning soft skills text-based games TrueSkill
New Qwen3-Coder Thrives in Agentic Coding and Developer Workflows Qwen3-Coder, the newest release from the Qwen team, is redefining what’s possible for agentic code models. Its flagship variant, Qwen3-Coder-480B-A35B-Instruct, leverages an impressive 480-billion par... AI coding APIs developer tools machine learning open source reinforcement learning software engineering
Ether0 Is Transforming Chemistry with AI-Powered Scientific Reasoning ether0, FutureHouse's new open-source, 24-billion-parameter model, hints at a future where scientific breakthroughs are achieved faster thanks to AI models that excel at complex reasoning in fields li... AI chemistry AI safety drug discovery FutureHouse molecular design open source AI reinforcement learning scientific reasoning