Blog Posts | Joshua Berkowitz

6 Articles

github × AI safety ×

How Bloom Is Transforming Automated Behavioral Evaluations for Frontier AI Models

Evaluating cutting-edge AI models poses a significant challenge for developers and safety researchers. Manual behavioral assessments are time-consuming and struggle to keep up with rapid model advance...

agentic frameworks AI evaluation AI safety Anthropic automation behavioral testing model alignment open-source

Dec 30, 2025

0 4004

News

Automating AI Alignment: How Anthropic’s Bloom Reimagines Behavioral Evaluation

Evaluating the behavior of advanced AI models is a growing challenge as systems become more capable and complex. Manual assessment methods can’t keep up with rapid model evolution, risking outdated be...

AI alignment AI safety Anthropic automation behavioral evaluation Bloom model benchmarking open source

Dec 26, 2025

0 2937

News

Unleashing On-Device Agentic Power: How Fara-7B Transforms Human-Computer Interaction

Microsoft Research’s Fara-7B is a small, open-weight agentic model that interacts with your device in a human-like way. It looks to fulfil the promise ofhaving a digital assistant that doesn’t just un...

agentic AI AI safety benchmarking on-device AI open source small language models synthetic data web automation

Nov 25, 2025

0 10252

News

OpenAI's gpt-oss-safeguard: A New Era for Policy-Driven AI Safety

OpenAI has introduced gpt-oss-safeguard , a groundbreaking family of open-source reasoning models designed to transform safety classification in artificial intelligence. Unlike rigid, traditional clas...

AI safety community collaboration content moderation developer tools machine learning open-source policy reasoning

Nov 5, 2025

0 13937

News

Gemini Robotics On-Device: Bringing Advanced AI Directly to Robots

Imagine a world where robots react instantly, adapt to changing tasks, and operate independently of the cloud. Google's DeepMind is turning this vision into reality with Gemini Robotics On-Devic...

AI safety developer tools Gemini Robotics machine learning on-device AI robotic dexterity robotics

Jul 7, 2025

0 5533

News

Ether0 Is Transforming Chemistry with AI-Powered Scientific Reasoning

ether0, FutureHouse's new open-source, 24-billion-parameter model, hints at a future where scientific breakthroughs are achieved faster thanks to AI models that excel at complex reasoning in fields li...

AI chemistry AI safety drug discovery FutureHouse molecular design open source AI reinforcement learning scientific reasoning

Jun 15, 2025

0 8085

News

Our latest content

Check out what's new !

See all

Ads

Prompt Maker Image Generator

Struggling with the perfect AI image prompt? My free app helps you generate brilliant ideas and instantly creates an image to match. Go from concept to creation in two clicks!

Most Popular Articles

Check out what the hot topics are!

See all

Every shirt tells a story—and every story

#ClothingForACause