CaltechAUTHORS
  A Caltech Library Service

A direct comparison of genome alignment and transcriptome pseudoalignment

Yi, Lynn and Liu, Lauren and Melsted, Páll and Pachter, Lior (2018) A direct comparison of genome alignment and transcriptome pseudoalignment. . (Unpublished) https://resolver.caltech.edu/CaltechAUTHORS:20181029-160959571

[img] PDF - Submitted Version
Creative Commons Attribution Non-commercial No Derivatives.

1700Kb

Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20181029-160959571

Abstract

Motivation: Genome alignment of reads is the first step of most genome analysis workflows. In the case of RNA-Seq, transcriptome pseudoalignment of reads is a fast alternative to genome alignment, but the different coordinate systems of the genome and transcriptome have made it difficult to perform direct comparisons between the approaches. Results: We have developed tools for converting genome alignments to transcriptome pseudoalignments, and conversely, for projecting transcriptome pseudoalignments to genome alignments. Using these tools, we performed a direct comparison of genome alignment with transcriptome pseudoalignment. We find that both approaches produce similar quantifications. This means that for many applications genome alignment and transcriptome pseudoalignment are interchangeable. Availability and Implementation: bam2tcc is a C++14 software for converting alignments in SAM/BAM format to transcript compatibility counts (TCCs) and is available at https://github.com/pachterlab/bam2tcc. kallisto genomebam is a user option of kallisto that outputs a sorted BAM file in genome coordinates as part of transcriptome pseudoalignment. The feature has been released with kallisto v0.44.0, and is available at https://pachterlab.github.io/kallisto/.


Item Type:Report or Paper (Discussion Paper)
Related URLs:
URLURL TypeDescription
https://doi.org/10.1101/444620DOIDiscussion Paper
ORCID:
AuthorORCID
Yi, Lynn0000-0003-4575-0158
Pachter, Lior0000-0002-9164-6231
Additional Information:The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license. bioRxiv preprint first posted online Oct. 16, 2018. Availability and Implementation: bam2tcc is available at https://github.com/pachterlab/bam2tcc. kallisto v0.44.0 containing the novel genomebam feature is available at https://pachterlab.github.io/kallisto/. The scripts and code used to regenerate our analysis are available at https://github.com/pachterlab/YLMP_2018. We thank Valentine Svensson for helpful feedback during the implementation of bam2tcc. Funding: LY was funded by the UCLA/Caltech MSTP, NIH T32 GM007616, NIH U19MH114830, the Lee Ramo Endowment, the Treadway Endowment, and the Hearst Endowment. LP was partly funded by U19MH114830.
Funders:
Funding AgencyGrant Number
UCLA-Caltech Medical Scientist Training ProgramUNSPECIFIED
NIH Predoctoral FellowshipT32 GM007616
NIHU19MH114830
Lee Ramo EndowmentUNSPECIFIED
Treadway EndowmentUNSPECIFIED
Hearst EndowmentUNSPECIFIED
NIHU19MH114830
Record Number:CaltechAUTHORS:20181029-160959571
Persistent URL:https://resolver.caltech.edu/CaltechAUTHORS:20181029-160959571
Official Citation:A direct comparison of genome alignment and transcriptome pseudoalignment. Lynn Yi, Lauren Liu, Pall Melsted, Lior Pachter. bioRxiv 444620; doi: https://doi.org/10.1101/444620
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:90482
Collection:CaltechAUTHORS
Deposited By: Tony Diaz
Deposited On:30 Oct 2018 01:50
Last Modified:03 Oct 2019 20:25

Repository Staff Only: item control page