VS Code Gets a Major AI Code Search Upgrade: GitHub Copilot’s New Embedding Model Explained

Instant, Accurate Code Search Inside Your Editor

Get All The Latest to Your Inbox!

Advertise Here!

Gain premium exposure to our growing audience of professionals. Learn More

Inquire Now

Modern development demands quick access to the right code. With GitHub Copilot's upgraded embedding model in Visual Studio Code, developers now enjoy smarter, context-aware code search that pinpoints relevant snippets instantly.

Why Embeddings Revolutionize Code Search

Traditional code search relies on keywords, often missing the developer's true intent. The new embedding model changes that by representing code and queries as vectors (embeddings), allowing Copilot to match based on meaning, not just words. This means you can find the right documentation, bug reference, or function even if you don’t use the exact terminology.

What’s New: Significant Model Improvements
37.6% higher retrieval quality: More relevant results, with code acceptance ratios more than doubling for C# and Java in VS Code.

Twice the throughput: Faster embedding creation speeds up searches, helping developers stay in their flow.

8x smaller index size: Searches are more efficient and scalable on both local and remote environments.

These advances mean Copilot chat, in-editor searches, and agentic replies are more accurate and less demanding on system resources.

Sharper Context Recognition

The model distinguishes between similar code snippets with impressive accuracy. For example, it can surface a findOne method when that’s exactly what you need, avoiding confusion with a similar find function. This level of precision is invaluable when:

Searching for a specific test in a sprawling codebase
Tracking down helper methods hidden in multiple files
Debugging by finding where certain errors are handled

How the Model Was Built

GitHub trained this model using contrastive learning and techniques like InfoNCE loss and Matryoshka Representation Learning. By focusing on “hard negatives” (snippets that are nearly right but not quite) the model learns to return the most accurate results. Large language models helped identify these tricky examples from public GitHub repositories and Microsoft’s own codebases.

The training set was diverse, covering Python, Java, C++, JavaScript/TypeScript, and C#, ensuring strong performance across major languages. Multiple benchmarks tested the model’s ability to connect natural language queries with code, recognize code similarity, and retrieve relevant solutions.

This release marks a strong step forward, but GitHub isn’t stopping here. Plans include expanding the training data to more languages and repositories, improving hard negative mining, and scaling up model size and accuracy. The ultimate goal is to make Copilot an even smarter, more helpful coding assistant for every developer.

Conclusion

With its new embedding model, GitHub Copilot enables more meaningful code search, faster results, and a streamlined development process in VS Code. It’s a leap forward in AI-powered coding, making it easier than ever to find and use the code you need.

Source: GitHub Blog

in News

# AI code search developer productivity embeddings GitHub Copilot machine learning software engineering VS Code

Source: https://github.blog/news-insights/product-news/copilot-new-embedding-model-vs-code/

Joshua Berkowitz October 11, 2025

Views 19701

Share this post

blogs

Our latest content

Check out what's new !

See all

Ads

Prompt Maker Image Generator

Struggling with the perfect AI image prompt? My free app helps you generate brilliant ideas and instantly creates an image to match. Go from concept to creation in two clicks!

Try It

Most Popular Articles

Check out what the hot topics are!