Blog Posts | Joshua Berkowitz

3 Articles

github × benchmarks ×

SWE-Bench Pro Sets A Higher Bar For AI Coding Agents

As AI coding agents approach human-level performance on existing benchmarks, the research community faces a critical challenge: how do we continue measuring progress when current evaluation suites are...

AI benchmarks coding agents software engineering

Sep 23, 2025

0 116226

Papers

Smarter Nucleic Acid Design: How NucleoBench and AdaBeam Are Unlocking the Future of Nucleic Acid Engineering

Designing DNA and RNA with precision is crucial for advances in modern therapeutics, but the vastness of biological sequence space makes this an immense computational challenge. Traditional search met...

AI algorithms benchmarks bioinformatics nucleic acids open source sequence design

Sep 18, 2025

0 4785

News

Why AI Isn’t Ready to Take Over All of Software Engineering - Yet

Many of us software dev are starting to envision a future where AI handles the tedious aspects of software engineering; tidying up legacy code, migrating complex systems, and squashing bugs, while hum...

AI challenges autonomous systems benchmarks code generation human-AI collaboration large codebases software engineering

Aug 3, 2025

0 5357

News

Our latest content

Check out what's new !

See all

Ads

Prompt Maker Image Generator

Struggling with the perfect AI image prompt? My free app helps you generate brilliant ideas and instantly creates an image to match. Go from concept to creation in two clicks!

Most Popular Articles

Check out what the hot topics are!

See all

Every shirt tells a story—and every story

#ClothingForACause