DeepSWE-Preview Sets a New Standard for Open-Source Coding Agents with Reinforcement Learning Imagine a coding agent that not only keeps pace with its open-source contemporaries but actually outshines them, all powered by reinforcement learning ( RL ). DeepSWE-Preview, a collaboration be... coding agents emergent behavior LLM open source reinforcement learning rLLM software engineering test-time scaling
PRefLexOR: Teaching AI to Reason Like Scientists Imagine an AI that doesn't just provide fast answers but engages in deep, reflective reasoning—constantly refining its understanding the way a dedicated scientist would. PRefLexOR, a novel framework f... AI reasoning knowledge graphs language models preference optimization recursive modeling reinforcement learning scientific discovery
Ether0 Is Transforming Chemistry with AI-Powered Scientific Reasoning ether0, FutureHouse's new open-source, 24-billion-parameter model, hints at a future where scientific breakthroughs are achieved faster thanks to AI models that excel at complex reasoning in fields li... AI chemistry AI safety drug discovery FutureHouse molecular design open source AI reinforcement learning scientific reasoning