Brown, Brielin C. and Bray, Nicolas L. and Pachter, Lior (2018) Expression reflects population structure. PLoS Genetics, 14 (12). Art. No. e1007841. ISSN 1553-7390. PMCID PMC6317812. https://resolver.caltech.edu/CaltechAUTHORS:20181008-162020262
Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20181008-162020262
Abstract
Population structure in genotype data has been extensively studied, and is revealed by looking at the principal components of the genotype matrix. However, no similar analysis of population structure in gene expression data has been conducted, in part because a naïve principal components analysis of the gene expression matrix does not cluster by population. We identify a linear projection that reveals population structure in gene expression data. Our approach relies on the coupling of the principal components of genotype to the principal components of gene expression via canonical correlation analysis. Our method is able to determine the significance of the variance in the canonical correlation projection explained by each gene. We identify 3,571 significant genes, only 837 of which had been previously reported to have an associated eQTL in the GEUVADIS results. We show that our projections are not primarily driven by differences in allele frequency at known cis-eQTLs and that similar projections can be recovered using only several hundred randomly selected genes and SNPs. Finally, we present preliminary work on the consequences for eQTL analysis. We observe that using our projection co-ordinates as covariates results in the discovery of slightly fewer genes with eQTLs, but that these genes replicate in GTEx matched tissue at a slightly higher rate.
Item Type: | Article | |||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Related URLs: |
| |||||||||||||||||||||||||||||||||||||||
ORCID: |
| |||||||||||||||||||||||||||||||||||||||
Additional Information: | © 2018 Brown et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Received: July 30, 2018; Accepted: November 20, 2018; Published: December 19, 2018. The authors would like to thank Shannon McCurdy for invaluable feedback on this manuscript. LP and NB were funded by National Institutes of Health grant R01HG008164. LP was also funded by National Institutes of Health grant DK094699. BB was funded by the National Science Foundation Graduate Research Fellowship Program. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Data Availability: GEUVADIS project RNA-seq reads are available at the European Nucleotide Archive (accession number ENA: ERP001942). 1000 genomes genotypes are available from cog-genomics (https://www.cog-genomics.org/plink/1.9/resources#1kg). Analysis software are available on github (https://github.com/pachterlab/PCCA/). Gencode v27 transcripts are available at ftp://ftp.sanger.ac.uk/pub/gencode/Gencode_human/release_27/gencode.v27.pc_transcripts.fa.gz. Gencode v27 GTF is available at ftp://ftp.sanger.ac.uk/pub/gencode/Gencode_human/release_27/gencode.v27.annotation.gtf.gz. The authors have declared that no competing interests exist. | |||||||||||||||||||||||||||||||||||||||
Funders: |
| |||||||||||||||||||||||||||||||||||||||
Issue or Number: | 12 | |||||||||||||||||||||||||||||||||||||||
PubMed Central ID: | PMC6317812 | |||||||||||||||||||||||||||||||||||||||
Record Number: | CaltechAUTHORS:20181008-162020262 | |||||||||||||||||||||||||||||||||||||||
Persistent URL: | https://resolver.caltech.edu/CaltechAUTHORS:20181008-162020262 | |||||||||||||||||||||||||||||||||||||||
Official Citation: | Brown BC, Bray NL, Pachter L (2018) Expression reflects population structure. PLoS Genet 14(12): e1007841. https://doi.org/10.1371/journal.pgen.1007841 | |||||||||||||||||||||||||||||||||||||||
Usage Policy: | No commercial reproduction, distribution, display or performance rights in this work are provided. | |||||||||||||||||||||||||||||||||||||||
ID Code: | 90174 | |||||||||||||||||||||||||||||||||||||||
Collection: | CaltechAUTHORS | |||||||||||||||||||||||||||||||||||||||
Deposited By: | George Porter | |||||||||||||||||||||||||||||||||||||||
Deposited On: | 09 Oct 2018 14:44 | |||||||||||||||||||||||||||||||||||||||
Last Modified: | 03 Oct 2019 20:22 |
Repository Staff Only: item control page