Jinich, Adrian and Zaveri, Anisha and DeJesus, Michael A. and Flores-Bautista, Emanuel and Smith, Clare M. and Sassetti, Christopher M. and Rock, Jeremy M. and Ehrt, Sabine and Schnappinger, Dirk and Ioerger, Thomas R. and Rhee, Kyu (2021) Mycobacterium tuberculosis transposon sequencing database (MtbTnDB): a large-scale guide to genetic conditional essentiality. . (Unpublished) https://resolver.caltech.edu/CaltechAUTHORS:20210308-122017457
![]() |
PDF
- Submitted Version
Creative Commons Attribution Non-commercial No Derivatives. 2MB |
Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20210308-122017457
Abstract
Characterization of gene essentiality across different conditions is a useful approach for predicting gene function. Transposon sequencing (TnSeq) is a powerful means of generating genome-wide profiles of essentiality and has been used extensively in Mycobacterium tuberculosis (Mtb) genetic research. Over the past two decades, dozens of TnSeq screens have been published, yielding valuable insights into the biology of Mtb in vitro, inside macrophages, and in model host organisms. However, these Mtb TnSeq profiles are distributed across dozens of research papers within supplementary materials, which makes querying them cumbersome and assembling a complete and consistent synthesis of existing data challenging. Here, we address this problem by building a central repository of publicly available TnSeq screens performed in M. tuberculosis, which we call the Mtb transposon sequencing database (MtbTnDB). The MtbTnDB encompasses 64 published and unpublished TnSeq screens, and is standardized, open-access, and allows users easy access to data, visualizations, and functional predictions through an interactive web-app (www.mtbtndb.app). We also present evidence that (i) genes in the same genomic neighborhood tend to have similar TnSeq profiles, and (ii) clusters of genes with similar TnSeq profiles tend to be enriched for genes belonging to the same functional categories. Finally, we test and evaluate machine learning models trained on TnSeq profiles to guide functional annotation of orphan genes in Mtb. In addition to facilitating the exploration of conditional genetic essentiality in this important human pathogen via a centralized TnSeq data repository, the MtbTnDB will enable hypothesis generation and the extraction of meaningful patterns by facilitating the comparison of datasets across conditions. This will provide a basis for insights into the functional organization of Mtb genes as well as gene function prediction.
Item Type: | Report or Paper (Discussion Paper) | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
Related URLs: |
| |||||||||
ORCID: |
| |||||||||
Additional Information: | The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license. This version posted March 6, 2021. The authors have declared no competing interest. | |||||||||
DOI: | 10.1101/2021.03.05.434127 | |||||||||
Record Number: | CaltechAUTHORS:20210308-122017457 | |||||||||
Persistent URL: | https://resolver.caltech.edu/CaltechAUTHORS:20210308-122017457 | |||||||||
Official Citation: | The Mycobacterium tuberculosis transposon sequencing database (MtbTnDB): a large-scale guide to genetic conditional essentiality. Adrian Jinich, Anisha Zaveri, Michael A. DeJesus, Emanuel Flores-Bautista, Clare M. Smith, Christopher M. Sassetti, Jeremy M. Rock, Sabine Ehrt, Dirk Schnappinger, Thomas R. Ioerger, Kyu Rhee. bioRxiv 2021.03.05.434127; doi: https://doi.org/10.1101/2021.03.05.434127 | |||||||||
Usage Policy: | No commercial reproduction, distribution, display or performance rights in this work are provided. | |||||||||
ID Code: | 108342 | |||||||||
Collection: | CaltechAUTHORS | |||||||||
Deposited By: | Tony Diaz | |||||||||
Deposited On: | 08 Mar 2021 20:42 | |||||||||
Last Modified: | 16 Nov 2021 19:11 |
Repository Staff Only: item control page