Initial sequencing and comparative analysis of the mouse genome
Abstract
The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.
Additional Information
© 2002 Macmillan Publishers Limited. Received 18 September 2002; Accepted 31 October 2002. We thank J. Takahashi and M. Johnston for comments on the manuscript; the Mouse Liaison Group for strategic advice; L. Gaffney, D. Leja and K.-S. Toh for graphical help; B. Graham and G. Roberts for administrative work on sequencing of individual mouse BACs; and P. Kassos and M. McMurtry for secretarial assistance. We thank D. Hill and L. Corbani of the Mouse Genome Informatics Group for their contributions to the GO analysis for mouse and human, and the members of the Bork group at EMBL for discussions. Funding was provided by the National Institutes of Health (National Human Genome Research Institute, National Cancer Institute, National Institute of Dental and Craniofacial Research, National Institute of Diabetes and Digestive and Kidney Diseases, National Institute of General Medical Sciences, National Eye Institute, National Institute of Environmental Health Sciences, National Institute of Aging, National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institute on Deafness and Other Communication Disorders, National Institute of Mental Health, National Institute on Drug Abuse, National Center for Research Resources, the National Heart Lung and Blood Institute and The Fogarty International Center); the Wellcome Trust; the Howard Hughes Medical Institute; the United States Department of Energy; the National Science Foundation; the Medical Research Council; NSERC; BMBF (German Ministry for Research and Education); the European Molecular Biology Laboratory; Plan Nacional de I + D and Instituto Carlos III; Swiss National Science Foundation, NCCR Frontiers in Genetics, the Swiss Cancer League and the 'Childcare' and 'J. Lejeune' Foundations; and the Ministry of Education, Culture, Sports, Science and Technology of Japan. The initial threefold sequence coverage was partly supported by the Mouse Sequencing Consortium (GlaxoSmithKline, Merck and Affymetrix) through the Foundation for the National Institutes of Health. We acknowledge A. Holden for coordinating the Mouse Sequencing Consortium. We thank the Sanger Institute systems group for maintenance and provision of the computer resource. The MGSC also used Hewlett-Packard Company's BioCluster, a configuration of 27 HP AlphaServer ES40 systems with 100 CPUs and 1 terabyte of storage. The BioCluster is housed in Hewlett-Packard's IQ Solutions Center, and was accessed remotely. The computing resource greatly accelerated the analysis. Authors' contributions: The following authors contributed to project leadership: R. H. Waterston, K. Lindblad-Toh, E. Birney, J. Rogers, M. R. Brent, F. S. Collins, R. Guigó, R. C. Hardison, D. Haussler, D. B. Jaffe, W. J. Kent, W. Miller, C. P. Ponting, A. Smit, M. C. Zody and E. S. Lander. Availability of sequence and assembly data: Unprocessed sequence reads are available from the NCBI trace archive (ftp://ftp.ncbi.nih.gov/pub/TraceDB/mus_musculus/). Raw assembly data (before removal of contaminants, anchoring to chromosomes, and addition of finished sequence) are available from the Whitehead Institute for Biomedical Research (WIBR) (ftp://wolfram.wi.mit.edu/pub/mouse_contigs/Mar10_02/). The released assembly MGSCv3 is available from Ensembl (http://www.ensembl.org/Mus_musculus/), NCBI (ftp://ftp.ncbi.nih.gov/genomes/M_musculus/MGSCv3_Release1/), UCSC (http://genome.ucsc.edu/downloads.html) and WIBR (ftp://wolfram.wi.mit.edu/pub/mouse_contigs/MGSC_V3/). (See Supplementary Information for detailed Methods.) The author declares no competing financial interests.Attached Files
Supplemental Material - nature01262-s1.doc
Supplemental Material - nature01262-s10.doc
Supplemental Material - nature01262-s11.jpg
Supplemental Material - nature01262-s12.jpg
Supplemental Material - nature01262-s13.jpg
Supplemental Material - nature01262-s14.jpg
Supplemental Material - nature01262-s15.jpg
Supplemental Material - nature01262-s16.jpg
Supplemental Material - nature01262-s2.doc
Supplemental Material - nature01262-s3.doc
Supplemental Material - nature01262-s4.doc
Supplemental Material - nature01262-s5.doc
Supplemental Material - nature01262-s6.doc
Supplemental Material - nature01262-s7.doc
Supplemental Material - nature01262-s8.doc
Supplemental Material - nature01262-s9.doc
Files
Name | Size | Download all |
---|---|---|
md5:741a8fc26a6b2fbb1c5ffd3c7ada17e4
|
19.5 kB | Download |
md5:afb69d161e6f0fe556e0ddab2089a002
|
19.5 kB | Download |
md5:4afb897b930a2369c8aed1678b8ef5ee
|
43.9 kB | Preview Download |
md5:23d03e1bacecdb4653b84709a4c30ecc
|
38.9 kB | Download |
md5:3e7fc52c00d3a84721c80ca64348849f
|
122.2 kB | Preview Download |
md5:7fb614c642dddbff337494aa42bda6c2
|
35.3 kB | Download |
md5:f213c9b9147f49349ed39fba884d0c8f
|
24.1 kB | Download |
md5:f07d963f40f27aa971c43d139eda7442
|
28.2 kB | Download |
md5:98b548e0bf8e30f3797ff92661e7fdc5
|
61.9 kB | Preview Download |
md5:d281b0e280946f05916ae2efcd9598a7
|
24.6 kB | Download |
md5:8c0b9d7063216b37eaf83050c833d729
|
20.0 kB | Download |
md5:f09b742b4b8826c8f1cb16d305fbb03f
|
239.9 kB | Preview Download |
md5:d8927491a1854f98c34abc7d9235afa4
|
107.5 kB | Download |
md5:7904908fad351c192431adda778b950d
|
80.3 kB | Preview Download |
md5:c7446becc576a546e1166fa13822f0c0
|
126.9 kB | Preview Download |
md5:7d5acde9d882764e17c43acc19804aa1
|
25.6 kB | Download |
Additional details
- Eprint ID
- 74965
- DOI
- 10.1038/nature01262
- Resolver ID
- CaltechAUTHORS:20170309-090859678
- NIH
- Wellcome Trust
- Howard Hughes Medical Institute (HHMI)
- Department of Energy (DOE)
- NSF
- Medical Research Council (UK)
- Natural Sciences and Engineering Research Council of Canada (NSERC)
- Bundesministerium für Bildung und Forschung (BMBF)
- European Molecular Biology Laboratory (EMBL)
- Plan Nacional de I + D (Spain)
- Instituto Carlos III
- Swiss National Science Foundation (SNSF)
- Swiss Cancer League
- ChildCare Foundation
- Jerome Lejeune Foundation
- Ministry of Education, Culture, Sports, Science and Technology (MEXT)
- Mouse Sequencing Consortium
- Created
-
2017-03-09Created from EPrint's datestamp field
- Updated
-
2021-11-15Created from EPrint's last_modified field