Hawaiʻi Convention Center, Level 3, Room 313B
The ability to leverage machine learning (ML) is rapidly becoming a part of the material scientist’s toolkit, whether for aiding ab initio computational work, screening datasets of candidate materials or interfacing with in-lab experimenters and equipment. This tutorial aims to address the problems that frequently arise after a materials scientist builds their first few successful ML models: How can multiple versions of models, and their predictions, be effectively tracked? Are there ways to automatically test models before running large batch predictions?
This tutorial will use case studies from recent work in combinatorial science and materials screening to present an interactive Python tutorial for participants. Familiarity with basic machine learning principles along with Git version control are recommended for participation in this tutorial.
Participants will learn:
Prerequisites: Familiarity with Python and version control (e.g., Github). Prior experience building machine learning models is a necessary prerequisite. A laptop with Python and Github access to follow along in real-time is strongly recommended.
Tracking your ML Models and Their Predictions
Edward Kim, Xero; Jason Hattrick-Simpers, University of Toronto; Arun Mannodi Kanakkithodi, Purdue University
This session will include a case study from research in autonomous materials science and an interactive code walk through on preserving the lineage of and linkages between ML models and their predictions. The session will also include Q&A on some recommended tools, best practices and applications across materials domains.
Machine Learning with Automated Pipelines to Reduce Risk
Edward Kim, Xero; Jason Hattrick-Simpers, University of Toronto; Arun Mannodi Kanakkithodi, Purdue University
This section will include a case study from research in materials screening and a guided code walk through on automated testing of ML models and leveraging pipeline tools to manage ML workflows. The session will also include Q&A on some recommended tools, best practice, and applications across materials domains.