Blog Posts | Joshua Berkowitz

2 Articles

RLHF ×

Agent Lightning: Decoupled RL Training for Any AI Agent

Agent Lightning is a Microsoft Research project that turns existing agents into trainable systems with minimal code changes. Instead of rewriting your agent to fit a trainer loop, you attach a lightwe...

AI agents AutoGen DPO LangGraph OpenAI Agents reinforcement learning RLHF VERL vLLM

Oct 8, 2025

0 58608

Github Repos

Rubrics as Rewards: A New Paradigm for Training Reliable AI

AI models face significant challenges when applied to nuanced, high-stakes fields like medicine and science. Standard training techniques, such as Reinforcement Learning from Human Feedback (RLHF), of...

AI safety AI training expert guidance language models model evaluation RLHF rubrics

Sep 23, 2025

0 6952

News

Our latest content

Check out what's new !

See all

Ads

Prompt Maker Image Generator

Struggling with the perfect AI image prompt? My free app helps you generate brilliant ideas and instantly creates an image to match. Go from concept to creation in two clicks!

Try It

Most Popular Articles

Check out what the hot topics are!

See all

Follow us

Our latest content

Prompt Maker Image Generator

Most Popular Articles

Every shirt tells a story—and every story

#ClothingForACause