A Caltech Library Service

Accurate Identification of Novel Human Genes Through Simultaneous Gene Prediction in Human, Mouse, and Rat

Dewey, Colin and Wu, Jia Qian and Cawley, Simon and Alexandersson, Marina and Gibbs, Richard and Pachter, Lior (2004) Accurate Identification of Novel Human Genes Through Simultaneous Gene Prediction in Human, Mouse, and Rat. Genome Research, 14 (4). pp. 661-664. ISSN 1088-9051. PMCID PMC383310. doi:10.1101/gr.1939804.

[img] PDF - Published Version
Creative Commons Attribution Non-commercial.


Use this Persistent URL to link to this item:


We describe a new method for simultaneously identifying novel homologous genes with identical structure in the human, mouse, and rat genomes by combining pairwise predictions made with the SLAM gene-finding program. Using this method, we found 3698 gene triples in the human, mouse, and rat genomes which are predicted with exactly the same gene structure. We show, both computationally and experimentally, that the introns of these triples are predicted accurately as compared with the introns of other ab initio gene prediction sets. Computationally, we compared the introns of these gene triples, as well as those from other ab initio gene finders, with known intron annotations. We show that a unique property of SLAM, namely that it predicts gene structures simultaneously in two organisms, is key to producing sets of predictions that are highly accurate in intron structure when combined with other programs. Experimentally, we performed reverse transcription-polymerase chain reaction (RT-PCR) in both the human and rat to test the exon pairs flanking introns from a subset of the gene triples for which the human gene had not been previously identified. By performing RT-PCR on orthologous introns in both the human and rat genomes, we additionally explore the validity of using RT-PCR as a method for confirming gene predictions.

Item Type:Article
Related URLs:
URLURL TypeDescription CentralArticle
Pachter, Lior0000-0002-9164-6231
Additional Information:© 2004 Cold Spring Harbor Laboratory Press. The Authors acknowledge that six months after the full-issue publication date, the Article will be distributed under a Creative Commons CC-BY-NC License (Attribution-NonCommercial 4.0 International License, Accepted January 26, 2004. Received November 5, 2003. L.P. and C.D. were partially supported by NIH grant R01 HG2362-2. The whole-genome SLAM runs were performed on the Affymetrix computing cluster. R.G. and J.Q.W. were partially supported by grants from the NHGRI/NHLBI (1 U54 HG02345) and NCI/SAIC (20XS182A). The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 USC section 1734 solely to indicate this fact.
Funding AgencyGrant Number
NIHR01 HG2362-2
NIH1 U54 HG02345
National Human Genome Research InstituteUNSPECIFIED
National Heart, Lung and Blood InstituteUNSPECIFIED
National Cancer Institute20XS182A
Issue or Number:4
PubMed Central ID:PMC383310
Record Number:CaltechAUTHORS:20170308-144150791
Persistent URL:
Official Citation:Accurate Identification of Novel Human Genes Through Simultaneous Gene Prediction in Human, Mouse, and Rat Colin Dewey, Jia Qian Wu, Simon Cawley, Marina Alexandersson, Richard Gibbs, and Lior Pachter Genome Res. April 2004 14: 661-664; doi:10.1101/gr.1939804
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:74924
Deposited By: George Porter
Deposited On:09 Mar 2017 15:36
Last Modified:15 Nov 2021 16:29

Repository Staff Only: item control page