Published March 2025 | Version Supplemental material
Journal Article Open

A novel framework for river organic carbon retrieval through satellite data and machine learning

  • 1. ROR icon Peking University
  • 2. ROR icon California Institute of Technology
  • 3. ROR icon Florida State University
  • 4. ROR icon Institute of Geographic Sciences and Natural Resources Research

Abstract

Rivers transport large amounts of carbon, serving as a critical link between terrestrial, coastal, and atmospheric biogeochemical cycles. However, our observations and understanding of long-term river carbon dynamics in large-scale remain limited. Integrating machine learning with remote sensing offers an effective approach for quantifying organic carbon (OC) from space. Here, we develop the Aquatic-Organic Carbon (Aqua-OC), a dynamic machine learning retrieval framework designed to estimate reach-scale river OC using nearly half a century of analysis-ready Landsat archives. We first integrate a globally representative river OC dataset, comprising 299,330 measurements of dissolved organic carbon (DOC) and 101,878 measurements of particulate organic carbon (POC). This dataset is then used to evaluate the performance of four machine learning methods, i.e., random forest (RF), extreme gradient boosting (XGBoost), Support vector regression (SVR), and deep neural network (DNN), using an optical water type classification strategy. We further leverage multimodal input features to enhance the Aqua-OC framework and OC retrieval accuracy by considering various factors related to OC sources and environmental conditions. The results demonstrate that the Aqua-OC can effectively estimate DOC (R2 = 0.68, RMSE = 2.88 mg/L, Bias = 2.63 %, Error = 12.52 %) and POC (R2 = 0.76, RMSE = 1.76 mg/L, Bias = 6.31 %, Error = 21.36 %). Additionally, the Mississippi River Basin case study demonstrates Aqua-OC’s capability to map nearly four decades of reach-scale OC changes at a basin scale. This study provides a generalized method for satellite-based river OC retrieval at fine spatial and long-term temporal scales, thus offering an effective tool to quantify the rivers’ role in the global carbon cycle.

Copyright and License

© 2025 International Society for Photogrammetry and Remote Sensing, Inc. (ISPRS). Published by Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.

Acknowledgement

This study is supported by the National Natural Science Foundation of China (Grant No. 52479055), National Key R&D Program of China (Grant No. 2023YFC3209900), the Fundamental Research Funds for the Central Universities, Peking University (7100604495), Alibaba DAMO Academy Young Fellow Award, AI for Science (AI4S)-Preferred Program of Peking University and the Excellent Young Scientists Fund of the National Natural Science Foundation of China. We thank Jan Karlsson, Tom Battin and Marisa Repasch for the earlier fruitful discussions. We appreciate the careful and thoughtful comments of two anonymous reviewers, which helped improve this paper substantially.

Supplemental Material

Supplementary Data 1 (DOCX)

Files

Files (4.0 MB)

Name Size Download all
md5:3de6f05b2198b70e9e5c54bda1f3ed9d
4.0 MB Download

Additional details

Related works

Funding

National Natural Science Foundation of China
52479055
Ministry of Science and Technology of the People's Republic of China
2023YFC3209900
Central South University
Peking University
7100604495

Dates

Accepted
2025-01-24
Available
2025-02-07
Available online
Available
2025-02-07
Version of record

Caltech Custom Metadata

Caltech groups
Division of Geological and Planetary Sciences (GPS)
Publication Status
Published