Computational graph completion
- Creators
- Owhadi, Houman
Abstract
We introduce a framework for generating, organizing, and reasoning with computational knowledge. It is motivated by the observation that most problems in Computational Sciences and Engineering (CSE) can be described as that of completing (from data) a computational graph (or hypergraph) representing dependencies between functions and variables. In that setting nodes represent variables and edges (or hyperedges) represent functions (or functionals). Functions and variables may be known, unknown, or random. Data come in the form of observations of distinct values of a finite number of subsets of the variables of the graph (satisfying its functional dependencies). The underlying problem combines a regression problem (approximating unknown functions) with a matrix completion problem (recovering unobserved variables in the data). Replacing unknown functions by Gaussian processes and conditioning on observed data provides a simple but efficient approach to completing such graphs. Since the proposed framework is highly expressive, it has a vast potential application scope. Since the completion process can be automatized, as one solves √√2+√3 on a pocket calculator without thinking about it, one could, with the proposed framework, solve a complex CSE problem by drawing a diagram. Compared to traditional regression/kriging, the proposed framework can be used to recover unknown functions with much scarcer data by exploiting interdependencies between multiple functions and variables. The computational graph completion (CGC) problem addressed by the proposed framework could therefore also be interpreted as a generalization of that of solving linear systems of equations to that of approximating unknown variables and functions with noisy, incomplete, and nonlinear dependencies. Numerous examples illustrate the flexibility, scope, efficacy, and robustness of the CGC framework and show how it can be used as a pathway to identifying simple solutions to classical CSE problems. These examples include the seamless CGC representation of known methods (for solving/learning PDEs, surrogate/multiscale modeling, mode decomposition, deep learning) and the discovery of new ones (digital twin modeling, dimension reduction).
Additional Information
© The Author(s), under exclusive licence to Springer Nature Switzerland AG 2022. Received 20 October 2021; Accepted 13 March 2022; Published 18 April 2022. The author gratefully acknowledges partial support by the Air Force Office of Scientific Research under MURI award number FA9550-20-1-0358 (Machine Learning and Physics-Based Modeling and Simulation). Thanks to Amy Braverman, Jouni Susiluoto, and Otto Lamminpaeae for stimulating discussions. Thanks to an anonymous referee and to Jean-Luc Cambier for helpful comments and feedback. Data Availability: Data sharing is not applicable to this article as no new data were created or analyzed in this study.Attached Files
Accepted Version - 2110.10323.pdf
Files
Name | Size | Download all |
---|---|---|
md5:888eb83445c836e76da0a79dbbc2197f
|
2.1 MB | Preview Download |
Additional details
- Eprint ID
- 114687
- Resolver ID
- CaltechAUTHORS:20220511-647768900
- Air Force Office of Scientific Research (AFOSR)
- FA9550-20-1-0358
- Created
-
2022-05-11Created from EPrint's datestamp field
- Updated
-
2022-05-17Created from EPrint's last_modified field