Unleashing On-Device Agentic Power: How Fara-7B Transforms Human-Computer Interaction

Meet the Next Generation of Computer-Use Agents

Get All The Latest to Your Inbox!

Advertise Here!

Gain premium exposure to our growing audience of professionals. Learn More

Inquire Now

Microsoft Research’s Fara-7B is a small, open-weight agentic model that interacts with your device in a human-like way. It looks to fulfil the promise of having a digital assistant that doesn’t just understand your commands but actively gets things done on your computer - automating web searches, filling out forms, booking tickets, or navigating complex sites.

How Fara-7B Stands Out in the AI Landscape

Fara-7B is a compact language model with 7 billion parameters, meticulously crafted for computer-use agent (CUA) scenarios. Unlike traditional chatbots, Fara-7B performs tasks directly on your device by interpreting visual inputs, such as browser screenshots, to decide where to click, type, or scroll. No reliance on accessibility trees or external parsing means faster, more private, and efficient operation.

On-device execution: Your data stays on your machine, enhancing privacy and reducing latency.
Human-like operation: The model makes decisions based solely on the visible screen, just like a real user.
Open-weight access: Researchers and developers can freely experiment via Microsoft Foundry and Hugging Face.

Innovative Synthetic Data Training

Training a capable computer-use agent demands vast and varied interaction data. Microsoft tackles this with an innovative synthetic data pipeline, leveraging the Magentic-One multi-agent system.

Tasks are proposed, solved, and verified using automated agents, creating a dataset of 145,000 unique task trajectories and over 1 million steps, spanning real-world scenarios from shopping to travel.

Video 1: A demo of a shopping scenario with Fara-7B through Magentic-UI. Fara-7B is asked to purchase an X-Box Spongebob controller. Fara-7B goes on to complete this task, but while doing so, also stops at every Critical Point to get input and approval from the user before proceeding. Credit: Microsoft

Key Elements of the Training Approach

Task variety: Sourced from a broad index of public websites across domains.
Multi-step complexity: Tasks require planning and adaptive reasoning, not just single actions.
Stringent verification: Each task’s completion is confirmed with both automatic checks and visual validation.

Performance: Efficiency Meets Accuracy

Despite its modest size, Fara-7B delivers state-of-the-art performance on leading web automation benchmarks such as WebVoyager, Online-Mind2Web, DeepShop, and the newly launched WebTailBench.

It consistently completes tasks with fewer steps and higher accuracy than many larger models, making it both cost-effective and practical for daily use. Human evaluations highlight its real-world applicability, showing a 62% success rate on WebVoyager tasks which is a notable achievement for a model of its scale.

Streamlined task completion: Takes about 16 steps per task, outperforming peers that require up to 41 steps.
Superior benchmarking: Outshines competitors in success rates across diverse tasks.
Resource efficiency: Lower token usage means reduced computational costs.

Prioritizing Safety and Transparency

Fara-7B integrates strong safety and transparency protocols. Every action is logged for auditability, and the agent requests user approval before sensitive steps, protecting privacy and giving users ultimate control. The model is trained to refuse potentially harmful or risky actions, achieving an impressive 82% refusal rate in challenging safety tests. Microsoft encourages running Fara-7B in sandboxed environments and maintaining vigilant oversight as the field evolves.

Privacy-first design: Operates solely on visible screen data with no external data access.
User consent checkpoints: Pauses for approval at critical junctures.
Responsible AI standards: Safety training aligns with Microsoft’s rigorous guidelines.

How to Get Started and What’s Next

Fara-7B is openly available for research and development via Microsoft Foundry and Hugging Face under the MIT license. The model works seamlessly with the Magentic-UI for contained evaluations and is optimized for Copilot+ PCs running Windows 11, making it easy to experiment with real-world tasks. Looking forward, Microsoft aims to enhance Fara-7B with improved multimodal models and reinforcement learning, inviting the AI community to contribute and accelerate progress.

Takeaway

Fara-7B signals a major leap in agentic AI by delivering practical, private, and efficient automation for everyday computing. Its on-device capabilities promise a future where digital assistants are not just smarter, but safer and more aligned with user needs. As the technology and community mature, Fara-7B paves the way for the next era of personalized, trustworthy AI agents.

Source: Microsoft Research Blog

in News

# agentic AI AI safety benchmarking on-device AI open source small language models synthetic data web automation

Source: https://www.microsoft.com/en-us/research/blog/fara-7b-an-efficient-agentic-model-for-computer-use/

Joshua Berkowitz November 25, 2025

Views 7172

Share this post

blogs

Our latest content

Check out what's new !

See all

Ads

Prompt Maker Image Generator

Struggling with the perfect AI image prompt? My free app helps you generate brilliant ideas and instantly creates an image to match. Go from concept to creation in two clicks!

Try It

Most Popular Articles

Check out what the hot topics are!