Dissecting heterogeneous cell populations across drug and disease conditions with PopAlign
Abstract
Single-cell measurement techniques can now probe gene expression in heterogeneous cell populations from the human body across a range of environmental and physiological conditions. However, new mathematical and computational methods are required to represent and analyze gene expression changes that occur in complex mixtures of single cells as they respond to signals, drugs, or disease states. Here, we introduce a mathematical modeling platform, PopAlign, that automatically identifies subpopulations of cells within a heterogeneous mixture, and tracks gene expression and cell abundance changes across subpopulations by constructing and comparing probabilistic models. We apply PopAlign to analyze the impact of 42 different immunomodulatory compounds on a heterogeneous population of donor-derived human immune cells as well as patient-specific disease signatures in multiple myeloma. PopAlign scales to comparisons involving tens to hundreds of samples, enabling large-scale studies of natural and engineered cell populations as they respond to drugs, signals or physiological change.
Additional Information
© 2020 National Academy of Sciences. Published under the PNAS license. Edited by Jonathan S. Weissman, University of California, San Francisco, CA, and approved September 25, 2020 (received for review March 31, 2020). PNAS first published October 30, 2020. We thank Justin Bois, Eric Chow, Allan-Hermann Pool, Jase Gehring, Tami Khazaei, and members of the M.T. laboratory for helpful feedback and discussions; Chris McGinnis and David Patterson for experimental guidance; and Inna-Marie Strazhnik for figure editing and illustrations. This work was performed at the Beckman Institute Single-Cell Profiling and Engineering Center. M.T. was supported by the Shurl and Kay Curci Foundation and the Heritage Medical Research Institute. Data Availability: Single-cell gene-expression data have been deposited in Figshare (https://doi.org/10.6084/m9.figshare.11837097) (48). The software package, implemented in Python 3, can be found at GitHub, https://github.com/thomsonlab/popalign. Author contributions: S.C., P.R., F.P., and M.T. designed research; S.C., P.R., J.H.P., T.T., E.C., J.R.H., and M.T. performed research; S.C., P.R., and M.T. contributed new reagents/analytic tools; S.C., P.R., and M.T. analyzed data; S.C. and M.T. wrote the paper; and F.P. provided clinical guidance and context for data interpretation. Competing interest statement: S.C., M.T., and P.R. have filed a US and Patent Cooperation Treaty patent for the PopAlign computational framework. This article is a PNAS Direct Submission. This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2005990117/-/DCSupplemental.Attached Files
Published - 28784.full.pdf
Submitted - 421354v9.full.pdf
Supplemental Material - 421354-1.pdf
Supplemental Material - pnas.2005990117.sapp.pdf
Files
Additional details
- Alternative title
- Dissecting heterogeneous cell-populations across signaling and disease conditions with PopAlign
- PMCID
- PMC7682438
- Eprint ID
- 90008
- Resolver ID
- CaltechAUTHORS:20180927-114224716
- Shurl and Kay Curci Foundation
- 12540322
- Heritage Medical Research Institute
- HMRI-15-09-01
- Created
-
2018-09-27Created from EPrint's datestamp field
- Updated
-
2023-07-18Created from EPrint's last_modified field
- Caltech groups
- Heritage Medical Research Institute, Division of Biology and Biological Engineering