PASS Puts Probabilities on Agentic Workflows for Safer, Adaptive Chest X-ray AI Chest X-rays are fast, cheap, and ubiquitous, but reading them well demands careful multi-structure reasoning. The paper PASS introduces a multimodal agentic system that treats chest X-ray (CXR) analy... agentic systems CXR medical AI multimodal radiology reinforcement learning
SmallThinker: Bringing Powerful Language Models to Local Devices Researchers from Shanghai Jiao Tong University’s Institute of Parallel and Distributed Systems, the School of Artificial Intelligence, and Zenergize AI introduced SmallThinker : a family of large lang... AI Models AI training reinforcement learning
TextArena Uses Competitive Gameplay to Advance AI As language models quickly catch up with and surpass traditional benchmarks, the need for more effective measurement tools becomes urgent. TextArena steps in as an innovative, open-source platf... agentic AI AI benchmarking LLM evaluation open source reinforcement learning soft skills text-based games TrueSkill
MiroMind-M1: Redefining Open-Source Mathematical Reasoning for AI Open-source AI is entering a new phase, with MiroMind-M1 leading the charge in mathematical reasoning. This project goes beyond simply releasing models by offering full transparency, every model, data... AI transparency CAMPO chain-of-thought large language models mathematical reasoning open-source AI reinforcement learning token efficiency