Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published June 14, 2024 | Published
Journal Article Open

Flexible parsing, interpretation, and editing of technical sequences with splitcode

  • 1. ROR icon California Institute of Technology

Abstract

Motivation

Next-generation sequencing libraries are constructed with numerous synthetic constructs such as sequencing adapters, barcodes, and unique molecular identifiers. Such sequences can be essential for interpreting results of sequencing assays, and when they contain information pertinent to an experiment, they must be processed and analyzed.

Results

We present a tool called splitcode, that enables flexible and efficient parsing, interpreting, and editing of sequencing reads. This versatile tool facilitates simple, reproducible preprocessing of reads from libraries constructed for a large array of single-cell and bulk sequencing assays.

Availability

The splitcode program is free, open source, and available for download at http://github.com/pachterlab/splitcode.

Supplementary information

Supplementary data are available at Bioinformatics online.

Copyright and License

Funding

D.K.S. was funded by the UCLA-Caltech Medical Scientist Training Program (NIH NIGMS training grant T32 GM008042). L.P. was supported in part by the National Institutes of Health (NIH) grants U19MH114830 and 5UM1HG012077-02.

Conflict of Interest

none declared

Files

btae331.pdf
Files (683.8 kB)
Name Size Download all
md5:2d6229e2ccd8d35b7adf26a1c4010650
451.7 kB Preview Download
md5:90d85617946b2a23b8230ee1d33276f9
232.1 kB Preview Download

Additional details

Created:
June 18, 2024
Modified:
June 24, 2024