CaltechAUTHORS
  A Caltech Library Service

Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks

Trapnell, Cole and Roberts, Adam and Goff, Loyal and Pertea, Geo and Kim, Daehwan and Kelley, David R. and Pimentel, Harold and Salzberg, Steven L. and Rinn, John L. and Pachter, Lior (2012) Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nature Protocols, 7 (3). pp. 562-578. ISSN 1754-2189. PMCID PMC3334321. https://resolver.caltech.edu/CaltechAUTHORS:20170303-165006599

[img] PDF - Accepted Version
See Usage Policy.

2MB

Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20170303-165006599

Abstract

Recent advances in high-throughput cDNA sequencing (RNA-seq) can reveal new genes and splice variants and quantify expression genome-wide in a single assay. The volume and complexity of data from RNA-seq experiments necessitate scalable, fast and mathematically principled analysis software. TopHat and Cufflinks are free, open-source software tools for gene discovery and comprehensive expression analysis of high-throughput mRNA sequencing (RNA-seq) data. Together, they allow biologists to identify new genes and new splice variants of known ones, as well as compare gene and transcript expression under two or more conditions. This protocol describes in detail how to use TopHat and Cufflinks to perform such analyses. It also covers several accessory tools and utilities that aid in managing data, including CummeRbund, a tool for visualizing RNA-seq analysis results. Although the procedure assumes basic informatics skills, these tools assume little to no background with RNA-seq analysis and are meant for novices and experts alike. The protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results. The protocol's execution time depends on the volume of transcriptome sequencing data and available computing resources but takes less than 1 d of computer time for typical experiments and ~1 h of hands-on time.


Item Type:Article
Related URLs:
URLURL TypeDescription
http://dx.doi.org/10.1038/nprot.2012.016DOIArticle
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3334321/PubMed CentralArticle
http://rdcu.be/pMCUPublisherFree ReadCube access
http://dx.doi.org/10.1038/nprot1014-2513aDOICorrigendum
http://rdcu.be/pMCSPublisherFree ReadCube access -- Corrigendum
http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE32038OrganizationGene Expression Omnibus
ORCID:
AuthorORCID
Pachter, Lior0000-0002-9164-6231
Additional Information:© 2012 Macmillan Publishers Limited. Published online 1 March 2012; corrected after print 7 August 2014; doi:10.1038/nprot.2012.016. We are grateful to D. Hendrickson, M. Cabili and B. Langmead for helpful technical discussions. The TopHat and Cufflinks projects are supported by US National Institutes of Health grants R01-HG006102 (to S.L.S.) and R01-HG006129-01 (to L.P.). C.T. is a Damon Runyon Cancer Foundation Fellow. L.G. is a National Science Foundation Postdoctoral Fellow. A.R. is a National Science Foundation Graduate Research Fellow. J.L.R. is a Damon Runyon-Rachleff, Searle, and Smith Family Scholar, and is supported by Director's New Innovator Awards (1DP2OD00667-01). This work was funded in part by the Center of Excellence in Genome Science from the US National Human Genome Research Institute (J.L.R.). J.L.R. is an investigator of the Merkin Foundation for Stem Cell Research at the Broad Institute. Author Contributions: C.T. is the lead developer for the TopHat and Cufflinks projects. L.G. designed and wrote CummeRbund. D.K., H.P. and G.P. are developers of TopHat. A.R. and G.P. are developers of Cufflinks and its accompanying utilities. C.T. developed the protocol, generated the example experiment and performed the analysis. L.P., S.L.S. and C.T. conceived the TopHat and Cufflinks software projects. C.T., D.R.K. and J.L.R. wrote the manuscript. The authors declare no competing financial interests.
Funders:
Funding AgencyGrant Number
NIHR01-HG006102
NIHR01-HG006129-01
Damon Runyon Cancer Research FoundationUNSPECIFIED
NSF Postdoctoral FellowshipUNSPECIFIED
NSF Graduate Research FellowshipUNSPECIFIED
Searle Scholars ProgramUNSPECIFIED
Smith Family FoundationUNSPECIFIED
NIH1DP2OD00667-01
Merkin Foundation for Stem Cell ResearchUNSPECIFIED
Issue or Number:3
PubMed Central ID:PMC3334321
Record Number:CaltechAUTHORS:20170303-165006599
Persistent URL:https://resolver.caltech.edu/CaltechAUTHORS:20170303-165006599
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:74752
Collection:CaltechAUTHORS
Deposited By: George Porter
Deposited On:06 Mar 2017 17:01
Last Modified:10 Jun 2020 23:32

Repository Staff Only: item control page