Shaping the Water-Harvesting Behavior of Metal–Organic Frameworks Aided by Fine-Tuned GPT Models
Abstract
We construct a data set of metal–organic framework (MOF) linkers and employ a fine-tuned GPT assistant to propose MOF linker designs by mutating and modifying the existing linker structures. This strategy allows the GPT model to learn the intricate language of chemistry in molecular representations, thereby achieving an enhanced accuracy in generating linker structures compared with its base models. Aiming to highlight the significance of linker design strategies in advancing the discovery of water-harvesting MOFs, we conducted a systematic MOF variant expansion upon state-of-the-art MOF-303 utilizing a multidimensional approach that integrates linker extension with multivariate tuning strategies. We synthesized a series of isoreticular aluminum MOFs, termed Long-Arm MOFs (LAMOF-1 to LAMOF-10), featuring linkers that bear various combinations of heteroatoms in their five-membered ring moiety, replacing pyrazole with either thiophene, furan, or thiazole rings or a combination of two. Beyond their consistent and robust architecture, as demonstrated by permanent porosity and thermal stability, the LAMOF series offers a generalizable synthesis strategy. Importantly, these 10 LAMOFs establish new benchmarks for water uptake (up to 0.64 g g⁻¹) and operational humidity ranges (between 13 and 53%), thereby expanding the diversity of water-harvesting MOFs.
Copyright and License
© 2023 American Chemical Society.
Acknowledgement
This material is based upon work supported by the Defense Advanced Research Projects Agency (DARPA) under contract HR0011-21-C-0020. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of DARPA. The computational work is partially supported by the Department of Energy (DOE), Office of Basic Energy Sciences, Division of Chemical Sciences, Geosciences, and Biosciences, under award DE-SC0023454. In addition, the National Science Foundation (NSF), Division of Chemistry, Chemical Structure, Dynamics, and Mechanisms A (CSDM–A), provided support for the computational resources, award number: CHE-2223442. The authors also extend their gratitude to the Research Computing Center at the University of Chicago for providing computational resources. Additionally, this research utilized the facilities of the Advanced Light Source, a DOE Office of Science User Facility, under contract no. DE-AC02-05CH11231. The study made use of instruments located in the College of Chemistry Nuclear Magnetic Resonance (NMR) Facility, partially supported by NIH S10OD024998. The authors are grateful to Dr. Seth Cohen (DARPA) and Dr. David Moore (General Electric) for their helpful comments and suggestions on this work. Moreover, Z.Z. expresses gratitude to Drs. Nikita Hanikel and Daria Kurandina, Ms. Oufan Zhang, and Mr. Boyu Qie for their valuable discussions. Z.Z. also acknowledges financial support from a Kavli ENSI Graduate Student Fellowship.
Contributions
The manuscript was written through contributions of all authors. All authors have given approval to the final version of the manuscript.
Conflict of Interest
The authors declare the following competing financial interest(s): Omar M. Yaghi is co-founder of ATOCO Inc., aiming at commercializing related technologies.
Data Availability
CCDC 2302011 (LAMOF-2) contains the supplementary crystallographic data for this paper.
Files
Name | Size | Download all |
---|---|---|
md5:350e42ebb75b88945c2e36a659869918
|
785.4 kB | Preview Download |
md5:ff2fa79711ba6dfe481386dd344c94ee
|
10.9 MB | Preview Download |
Additional details
- ISSN
- 1520-5126
- Defense Advanced Research Projects Agency
- HR0011-21-C-0020
- United States Department of Energy
- DE-SC0023454
- National Science Foundation
- CHE-2223442
- United States Department of Energy
- DE-AC02-05CH11231
- National Institutes of Health
- S10OD024998
- The Kavli Foundation