608138 On the Data-Driven Discovery and Calibration of Closures

Thursday, November 19, 2020
Food, Pharmaceutical & Bioengineering Division (15) (PreRecorded+)
Seungjoon Lee1,2, Constantinos Siettos3, Tom S. Bertalan2, Daniel B. Amchin4, Tapomoy Bhattacharjee4, Sujit S. Datta4 and Ioannis G. Kevrekidis2,5,6, (1)Department of Applied Data Science, San Jose State University, San Jose, CA, (2)Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, (3)Dipartimento di Matematica e Applicazioni,, Università degli Studi di Napoli "Federico II, Naples, Italy, (4)Chemical and Biological Engineering, Princeton University, Princeton, NJ, (5)Urology, Johns Hopkins University School of Medicine, Baltimore, MD, (6)Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, MD

We explore the data-driven learning of closures in problems involving bacterial chemotactic motion. The data come from agent-based stochastic computations as well as from experiments, and are validated through computational data from established continuum models. Machine learning has long been used to learn unknown PDEs from spatiotemporal data [1] and recently there has been a renewed interest, especially for the case of multiscale simulation data [2]. Starting from neural network architectures that learn a PDE from spatiotemporal data as a "black box", we proceed to learn only parts of this evolution law as a "gray box", and, in particular, the unknown closure terms.

Using agent-based simulation data and experimental data from E. Coli bacterial chemotaxis, we explore the data-driven discovery of the chemotactic terms; find different equivalent parametrizations of them, and also show how to calibrate approximate (qualitatively correct but quantitatively inaccurate) closures to "the truth" in a data-driven manner.

We also show how the procedure can be linked with equation-free multiscale computational techniques (like patch dynamics) to help collect the necessary macroscale data in the most parsimonious way possible. Possibly the most important finding is that several different "equivalent on the data" PDEs as well as several different "equivalent on the data" closures can be identified that are consistent with the observations. We will also discuss the explainability (in terms of humanly understandable terms) of the variables and parametrizations of the discovered closures. The work forms a bridge between analytical/mechanistic understanding, and data-driven "black box" learning of physical process dynamics, allowing for a synergy between the two options.

[1] Gonzalez-Garcia, R., Rico-Martinez, R. and Kevrekidis, I.G., (1998). Identification of distributed parameter systems: A neural net based approach. Computers & chemical engineering, 22, pp.S965-S968

[2] Lee, S., Kooshkbaghi, M., Spiliotis, K., Siettos, C.I. and Kevrekidis, I.G. (2020), Coarse-scale PDEs from fine-scale observations via machine learning, Chaos: An Interdisciplinary Journal of Nonlinear Science, 30(1), p.013141.


Extended Abstract: File Not Uploaded
See more of this Session: Systems Biology of Microbes
See more of this Group/Topical: Food, Pharmaceutical & Bioengineering Division