Potential Energy Surface Softening in Universal Machine Learning Interatomic Potentials

When and Where

Dec 6, 2024
11:15am - 11:30am

Hynes, Level 2, Room 210

Presenter(s)

Bowen Deng

Yunyeong Choi

Peichen Zhong

Janosh Riebesell

Shashwat Anand

Zhuohan Li

KyuJung Jun

Kristin Persson

Gerbrand Ceder

Co-Author(s)

Bowen Deng¹,Yunyeong Choi¹,Peichen Zhong¹,Janosh Riebesell²,Shashwat Anand³,Zhuohan Li³,KyuJung Jun¹,Kristin Persson^1,3,Gerbrand Ceder^1,3

University of California, Berkeley¹,University of Cambridge²,Lawrence Berkeley National Laboratory³

Abstract

Bowen Deng¹,Yunyeong Choi¹,Peichen Zhong¹,Janosh Riebesell²,Shashwat Anand³,Zhuohan Li³,KyuJung Jun¹,Kristin Persson^1,3,Gerbrand Ceder^1,3

University of California, Berkeley¹,University of Cambridge²,Lawrence Berkeley National Laboratory³

Artificial Intelligence is increasingly shifting the paradigm of materials discovery. One of the major contributions came from machine learning interatomic potentials (MLIPs), which enabled the chance to scale atomic-level quantum chemical accuracy to large-scale simulations. Recent advancements have seen the emergence of universal MLIPs (uMLIPs) that are pre-trained on diverse materials datasets, providing opportunities for both ready-to-use universal force fields and robust foundations for downstream machine learning refinements. However, the performance of uMLIPs in extrapolating to out-of-distribution (OOD) complex atomic environments remains unclear.

In this talk, we will discuss the limitations and potential improvements of current foundational uMLIPs including M3GNet, CHGNet and MACE-MP-0 through a series of OOD benchmark tests including surfaces, defects, phonons, ion migration barriers, etc. We uncovered a systematic potential energy surface (PES) softening effect characterized by the underprediction of energy and forces in all benchmark tests with all current uMLIPs. We demonstrate that the PES softening issue can be effectively rectified by fine-tuning with a single additional data point. Our findings suggest that a considerable fraction of uMLIP errors are highly systematic, and can therefore be efficiently corrected. Our result provides a theoretical foundation for the widely observed data-efficient performance boosts achieved by fine-tuning uMLIPs and highlights the advantage of next-generation atomic modeling with large and comprehensive foundational AI models.

Symposium Organizers

Kjell Jorner, ETH Zurich

Jian Lin, University of Missouri-Columbia

Daniel Tabor, Texas A&M University

Dmitry Zubarev, IBM

Session Chairs

Jian Lin

Dmitry Zubarev

Symposium Supporters

2024 MRS Fall Meeting & Exhibit