Can Chemical Property Prediction Models Extrapolate? Understanding How to Move Towards Generalizable Chemical Foundation Models

When and Where

Dec 5, 2024
9:00am - 9:15am

Hynes, Level 2, Room 210

Presenter(s)

Evan Antoniuk

Shehtab Zaman

Busra Demirci

Peggy Li

Yu-Ting Hsu

James Diffenderfer

Ken Chiu

Anna Hiszpanski

Nikoli Dryden

Tal Ben-Nun

Bhavya Kailkhura

Brian Van Essen

Co-Author(s)

Evan Antoniuk¹,Shehtab Zaman²,Busra Demirci²,Peggy Li¹,Yu-Ting Hsu¹,James Diffenderfer¹,Ken Chiu²,Anna Hiszpanski¹,Nikoli Dryden¹,Tal Ben-Nun¹,Bhavya Kailkhura¹,Brian Van Essen¹

Lawrence Livermore National Laboratory¹,Binghamton University, The State University of New York²

Abstract

Evan Antoniuk¹,Shehtab Zaman²,Busra Demirci²,Peggy Li¹,Yu-Ting Hsu¹,James Diffenderfer¹,Ken Chiu²,Anna Hiszpanski¹,Nikoli Dryden¹,Tal Ben-Nun¹,Bhavya Kailkhura¹,Brian Van Essen¹

Lawrence Livermore National Laboratory¹,Binghamton University, The State University of New York²

Alongside the push towards artificial general intelligence, there has been a surge of interest in the chemical community to develop large-scale chemical foundation models. This goal is motivated by the idea that a single foundation model trained on vast quantities of chemical data can be broadly applied to multiple applications without needing retraining. Although several chemical foundation models have already displayed their proficiency at predicting molecular properties, their capability to generalize across chemical space has yet to be systematically evaluated.
In this talk, I will discuss our work on understanding the capabilities of current state-of-the-art models (both foundation models and otherwise) to extrapolate to new chemistries. From performing large-scale benchmarking of over seven models across ten unique datasets and millions of molecular samples, we find that no existing model has the ability to consistently extrapolate to new chemistry. As a result, we perform various ablation experiments to understand how to provide better generalization capabilities to models. Specifically, we probe how pre-training objectives, multi-task learning, model architecture and molecule representation affect the resulting extrapolation performance. The insights gained from these experiments provide actionable guidance on how the materials community can proceed towards achieving generalizable materials foundation models. In addition, the extrapolation tasks established in our benchmarking provide the materials community with a uniform benchmark for tracking progress in the extrapolation abilities of chemical models.

This work was produced under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.

Symposium Organizers

Kjell Jorner, ETH Zurich

Jian Lin, University of Missouri-Columbia

Daniel Tabor, Texas A&M University

Dmitry Zubarev, IBM

Session Chairs

Kjell Jorner

Dmitry Zubarev

Symposium Supporters

2024 MRS Fall Meeting & Exhibit