Blog Posts | Joshua Berkowitz

4 Articles

speculative decoding ×

Speculative Cascades: The Hybrid Solution Driving Smarter, Faster LLM Inference

As user expectations and AI adoption soar, delivering fast, cost-effective, and high-quality results from LLMs has become a pressing goal for developers and organizations alike. Speculative cascades a...

AI efficiency AI optimization cascades language models LLM inference machine learning speculative decoding

Sep 21, 2025

0 5797

News

Speculative Cascades: Unlocking Smarter, Faster LLM Inference

Large language models (LLMs) are transforming digital experiences, but their impressive capabilities often come at the cost of slow and expensive inference. As businesses and users expect faster, more...

AI efficiency cascades cost-quality tradeoff hybrid models language models LLM inference speculative decoding

Sep 14, 2025

0 17864

News

Visual Studio Copilot Is Making Edits Intelligent and Reliable

The latest improvements in Visual Studio Copilot are bringing smarter and more reliable AI-driven code edits directly into development projects. Let's take a look at how the open-source IDE is powerin...

AI editing AI models code quality Copilot developer tools speculative decoding Visual Studio

Aug 2, 2025

0 1551

News

NVIDIA Blackwell and Llama 4 Maverick: Ushering in a New Era of AI Inference Speed

An NVIDIA AI system accomplished a record breaking 1,000+ tokens per second, per user, from a 400-billion-parameter language model all on a single machine. NVIDIA’s Blackwell architecture, paired with...

AI inference Blackwell GPU acceleration Llama 4 NVIDIA speculative decoding TensorRT-LLM

May 23, 2025

0 4840

News

Get All The Latest Research & News!

Subscribe

Our latest content

Check out what's new !

See all

Ads

Prompt Maker Image Generator

Struggling with the perfect AI image prompt? My free app helps you generate brilliant ideas and instantly creates an image to match. Go from concept to creation in two clicks!

Most Popular Articles

Check out what the hot topics are!

See all

Every shirt tells a story—and every story

#ClothingForACause