A Caltech Library Service

Ten Simple Rules for the Care and Feeding of Scientific Data

Goodman, Alyssa and Pepe, Alberto and Blocker, Alexander W. and Borgman, Christine L. and Cranmer, Kyle and Crosas, Merce and Di Stefano, Rosanne and Gil, Yolanda and Groth, Paul and Hedstrom, Margaret and Hogg, David W. and Kashyap, Vinay and Mahabal, Ashish and Siemiginowska, Aneta and Slavkovic, Aleksandra (2014) Ten Simple Rules for the Care and Feeding of Scientific Data. PLoS Computational Biology, 10 (4). Art. No. e1003542. ISSN 1553-734X. PMCID PMC3998871.

PDF - Published Version
Creative Commons Attribution.


Use this Persistent URL to link to this item:


In the early 1600s, Galileo Galilei turned a telescope toward Jupiter. In his log book each night, he drew to-scale schematic diagrams of Jupiter and some oddly moving points of light near it. Galileo labeled each drawing with the date. Eventually he used his observations to conclude that the Earth orbits the Sun, just as the four Galilean moons orbit Jupiter. History shows Galileo to be much more than an astronomical hero, though. His clear and careful record keeping and publication style not only let Galileo understand the solar system, they continue to let anyone understand how Galileo did it. Galileo's notes directly integrated his data (drawings of Jupiter and its moons), key metadata (timing of each observation, weather, and telescope properties), and text (descriptions of methods, analysis, and conclusions). Critically, when Galileo included the information from those notes in Sidereus Nuncius, this integration of text, data, and metadata was preserved, as shown in Figure 1. Galileo's work advanced the “Scientific Revolution,” and his approach to observation and analysis contributed significantly to the shaping of today's modern “scientific method”.

Item Type:Article
Related URLs:
URLURL TypeDescription
Hogg, David W.0000-0003-2866-9403
Mahabal, Ashish0000-0003-2242-0244
Additional Information:© 2014 Goodman et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Published: April 24, 2014. This article is an outcome of an Exploratory Seminar called ‘‘What to Keep and How to Analyze It: Data Curation and Data Analysis with Multiple Phases’’ ( organized by Xiao-Li Meng and Alyssa Goodman, held on May 9–10, 2013 at the Radcliffe Institute for Advanced Study, Harvard University, Cambridge, Massachusetts. This article was written collaboratively, online, in the open, using Authorea. Every iteration of the writing procedure is logged and available in the online version of this article at Funding: The authors received no specific funding for writing this manuscript. Competing Interests: The authors have declared that no competing interests exist.
Issue or Number:4
PubMed Central ID:PMC3998871
Record Number:CaltechAUTHORS:20140626-084859742
Persistent URL:
Official Citation:Goodman A, Pepe A, Blocker AW, Borgman CL, Cranmer K, et al. (2014) Ten Simple Rules for the Care and Feeding of Scientific Data. PLoS Comput Biol 10(4): e1003542. doi:10.1371/journal.pcbi.1003542
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:46521
Deposited By: Jason Perez
Deposited On:26 Jun 2014 19:22
Last Modified:09 Mar 2020 13:18

Repository Staff Only: item control page