Published June 2020 | Version Submitted + Published + Supplemental Material
Journal Article Open

Interpretable factor models of single-cell RNA-seq via variational autoencoders

Abstract

Motivation: Single-cell RNA-seq makes possible the investigation of variability in gene expression among cells, and dependence of variation on cell type. Statistical inference methods for such analyses must be scalable, and ideally interpretable. Results: We present an approach based on a modification of a recently published highly scalable variational autoencoder framework that provides interpretability without sacrificing much accuracy. We demonstrate that our approach enables identification of gene programs in massive datasets. Our strategy, namely the learning of factor models with the auto-encoding variational Bayes framework, is not domain specific and may be useful for other applications. Availability and implementation: The factor model is available in the scVI package hosted at https://github.com/YosefLab/scVI/.

Additional Information

© 2020 The Author(s). Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. Received: 13 September 2019; Revision received: 03 February 2020; Accepted: 20 February 2020; Published: 16 March 2020. We thank Eduardo da Veiga Beltrame and Romain Lopez for helpful feedback on the manuscript. Sina Booeshaghi provided useful comments on the LDVAE software. Additionally, we thank the users of scVI who provided helpful discussion about the implementation on Github. Funding: This work was supported by the National Institutes of Health [U19MH114830 to V.S. and L.P.]; and Error! Hyperlink reference not valid. [CZF2019-002454 to A.G. and N.Y.]. Conflict of Interest: none declared.

Attached Files

Published - btaa169.pdf

Submitted - 737601.full.pdf

Supplemental Material - btaa169_supplementary_data.zip

Files

737601.full.pdf

Files (3.2 MB)

Name Size Download all
md5:9787943b2df06a8f29e290131183738e
580.5 kB Preview Download
md5:3bd63f48f50256bda477a6648561ca93
339.2 kB Preview Download
md5:d000e1db22335f0c82f22a309fd6c2f0
2.3 MB Preview Download

Additional details

Identifiers

PMCID
PMC7267837
Eprint ID
97957
Resolver ID
CaltechAUTHORS:20190816-135915873

Related works

Funding

NIH
U19MH114830
Chan Zuckerberg Foundation
CZF2019-002454

Dates

Created
2019-08-16
Created from EPrint's datestamp field
Updated
2022-02-15
Created from EPrint's last_modified field

Caltech Custom Metadata

Caltech groups
Division of Biology and Biological Engineering (BBE)