How Bloom Is Transforming Automated Behavioral Evaluations for Frontier AI Models Evaluating cutting-edge AI models poses a significant challenge for developers and safety researchers. Manual behavioral assessments are time-consuming and struggle to keep up with rapid model advance... agentic frameworks AI evaluation AI safety Anthropic automation behavioral testing model alignment open-source
Automating AI Alignment: How Anthropic’s Bloom Reimagines Behavioral Evaluation Evaluating the behavior of advanced AI models is a growing challenge as systems become more capable and complex. Manual assessment methods can’t keep up with rapid model evolution, risking outdated be... AI alignment AI safety Anthropic automation behavioral evaluation Bloom model benchmarking open source
Unleashing On-Device Agentic Power: How Fara-7B Transforms Human-Computer Interaction Microsoft Research’s Fara-7B is a small, open-weight agentic model that interacts with your device in a human-like way. It looks to fulfil the promise ofhaving a digital assistant that doesn’t just un... agentic AI AI safety benchmarking on-device AI open source small language models synthetic data web automation
OpenAI's gpt-oss-safeguard: A New Era for Policy-Driven AI Safety OpenAI has introduced gpt-oss-safeguard , a groundbreaking family of open-source reasoning models designed to transform safety classification in artificial intelligence. Unlike rigid, traditional clas... AI safety community collaboration content moderation developer tools machine learning open-source policy reasoning
Gemini Robotics On-Device: Bringing Advanced AI Directly to Robots Imagine a world where robots react instantly, adapt to changing tasks, and operate independently of the cloud. Google's DeepMind is turning this vision into reality with Gemini Robotics On-Devic... AI safety developer tools Gemini Robotics machine learning on-device AI robotic dexterity robotics
Ether0 Is Transforming Chemistry with AI-Powered Scientific Reasoning ether0, FutureHouse's new open-source, 24-billion-parameter model, hints at a future where scientific breakthroughs are achieved faster thanks to AI models that excel at complex reasoning in fields li... AI chemistry AI safety drug discovery FutureHouse molecular design open source AI reinforcement learning scientific reasoning