Skip to Content

Deep Learning is Revolutionizing Computational Chemistry at Microsoft

Predicting Molecules Without the Lab: The Power of AI

Get All The Latest Research & News!

Thanks for registering!

Thanks to recent breakthroughs at Microsoft Research, scientists may be able to forecast the properties of new molecules with laboratory-level accuracy, all from behind a computer screen.  By marrying deep learning with massive, high-quality datasets, researchers are redefining how Density Functional Theory (DFT) predicts molecular behavior and material properties.

DFT: The Cornerstone and Its Challenge

DFT remains indispensable in computational chemistry for simulating matter at the atomic scale. It circumvents the complexity of solving the full electron problem by relying on the exchange-correlation (XC) functional, a mathematical shortcut that lacks a perfect formula. 

Scientists have historically depended on hundreds of hand-crafted approximations, each a compromise between accuracy and computational practicality.


The Quest for Chemical Accuracy

Despite DFT’s usefulness, its predictions typically fall short of “chemical accuracy”, the gold standard for matching experimental results. 

This shortfall has kept computational chemistry in a supporting role, helping interpret but rarely predict experimental outcomes. The field’s long-standing goal has been to discover a universal XC functional that can consistently deliver reliable predictions without excessive computational costs.


Deep Learning Changes the Game

Microsoft’s team realized that deep learning could bypass the design limitations of traditional XC functionals. By training neural networks directly on rich datasets, they can let AI learn the intricate mapping from electron density to XC energy. The primary hurdle? Amassing enough precise data to satisfy these data-driven models.

Data, Innovation, and Model Design

To build a formidable training foundation, the researchers collaborated with academic partners to calculate atomization energies for a vast array of small molecules, using advanced quantum chemistry techniques. 

This produced a dataset that dwarfs previous efforts, now partially open to the community. For modeling, they crafted a unique neural architecture capable of learning from complex electron density data, ensuring scalability and accuracy.

Meet Skala: A Leap Forward

The result is Skala, a machine-learned XC functional that achieves near-chemical accuracy, a first for the field. Skala matches or outperforms top-tier hybrid functionals on benchmark tests, but at a much lower computational cost. This advance signals a major turning point, as deep learning finally surmounts a long-standing barrier in DFT.

  • Unprecedented data: Training on a large, diverse, and accurate chemical dataset
  • Innovative modeling: New neural architectures that learn directly from electron density
  • Benchmark achievements: Skala delivers the accuracy needed for predictive chemistry
  • Community benefit: Open release of datasets and the Skala functional to stimulate further research

Skala is a new density functional for the exchange-correlation energy that employs meta-GGA ingredients plus D3 dispersion and machine-learned nonlocal features of the electron density. Some exact constraints were imposed, and some others “emerge” from the fitting to about 150,000 accurate energy differences for sp molecules and atoms. Skala achieves high, hybrid-like accuracy on a large and diverse data set of properties of main group molecules, which has no overlap with its training set. The computational cost of Skala is higher than that of the r2SCAN meta-GGA for small molecules, but about the same for systems with 1,000 or more occupied orbitals. Its cost seems to be only 10% of the cost of standard hybrids and 1% of the cost of local hybrids. Developed by a Microsoft team of density functional theorists and deep-learning experts, Skala could be the first machine-learned density functional to compete with existing functionals for wide use in computational chemistry, and a sign of things to come in that and related fields. Skala learned from big data and was taught by insightful human scientists.”

— John P. Perdew, Professor of Physics, School of Science and Engineering, Tulane University

Broader Impact and Industry Momentum

With Skala’s strong debut in main group chemistry, Microsoft Research is ramping up efforts to broaden their datasets and address more chemical systems. Industry stakeholders in pharmaceuticals and materials science see this as transformative, promising faster, more accurate computational screening and discovery.

Open Science and Collaboration

The team is launching an early access program, inviting companies and labs to apply these advances in real-world scenarios. By sharing their models and data openly, Microsoft aims to catalyze global collaboration, accelerating progress in computational chemistry, biochemistry, and beyond.

Deep learning is set to deliver on computational chemistry’s greatest promise: accurate, efficient, and general models for predicting the behavior of molecules and materials. As these tools become more accessible, expect a seismic shift in how scientific discovery unfolds, moving ever more from the lab bench to the computer.

Source: Microsoft Research Blog


Deep Learning is Revolutionizing Computational Chemistry at Microsoft
Joshua Berkowitz June 29, 2025
Share this post