MRS Meetings and Events

 

DS01.13.02 2022 MRS Spring Meeting

Unified Language of Synthesis Actions for Representation of Synthesis Protocols—Making Steps Toward Autonomous Materials Synthesis

When and Where

May 13, 2022
8:30am - 8:45am

Hawai'i Convention Center, Level 3, Lili'U Theater, 310

Presenter

Co-Author(s)

Zheren Wang1,2,Kevin Cruse1,2,Yuxing Fei1,2,Ann Chia1,Yan Zeng2,Haoyan Huo1,2,Tanjin He1,2,Bowen Deng1,2,Olga Kononova1,Gerbrand Ceder1,2

University of California, Berkeley1,Lawrence Berkeley National Laboratory2

Abstract

Zheren Wang1,2,Kevin Cruse1,2,Yuxing Fei1,2,Ann Chia1,Yan Zeng2,Haoyan Huo1,2,Tanjin He1,2,Bowen Deng1,2,Olga Kononova1,Gerbrand Ceder1,2

University of California, Berkeley1,Lawrence Berkeley National Laboratory2
Over the past decade, we have witnessed growing success of data-driven and artificial intelligence (AI)-based methodologies promoting breakthroughs in predicting materials structure, properties, and functionality. Nonetheless, adapting AI power to predict and control materials synthesis is still challenging and requires substantial effort in gathering high-quality large-scale datasets. A possible approach would be to extract synthesis information from scientific publications. However, this route is still challenging, especially for extracting synthesis actions, because of the lack of a comprehensive labeled dataset using a solid, robust, and well-established ontology for describing synthesis procedures. Taking a step further, the growing interest in automated AI-guided materials synthesis demands a robust and unified language for describing synthesis protocols in order to make them applicable to the autonomous robotic platform.<br/>In this work, we propose a <i>unified language of synthesis actions </i>(ULSA) to describe solid-state, sol-gel, precipitation, and solvo-/hydrothermal synthesis procedures. We also present a labeled dataset of 3,040 synthesis sentences created using the proposed ULSA schema. To verify the applicability of the ULSA and the dataset, we trained a neural network-based model that identifies a sequence of synthesis actions in a paragraph, maps them into the ULSA, and builds a graph of the synthesis procedure. Analysis of the graphs from thousands of paragraphs has shown that this ULSA vocabulary is enough to obtain high-accuracy extraction of synthesis actions as well as to pick the important features of each of the aforementioned synthesis types. We anticipate that these results will be widely used by the researchers interested in scientific text mining and will help to achieve a breakthrough in predictive and AI-guided autonomous materials synthesis.

Keywords

chemical synthesis

Symposium Organizers

Mathieu Bauchy, University of California, Los Angeles
Mathew Cherukara, Argonne National Laboratory
Grace Gu, University of California, Berkeley
Badri Narayanan, University of Louisville

Publishing Alliance

MRS publishes with Springer Nature