A Caltech Library Service

A statistical model for microarrays, optimal estimation algorithms, and limits of performance

Vikalo, Haris and Hassibi, Babak and Hassibi, Arjang (2006) A statistical model for microarrays, optimal estimation algorithms, and limits of performance. IEEE Transactions on Signal Processing, 54 (6, pt.). pp. 2444-2455. ISSN 1053-587X.

See Usage Policy.


Use this Persistent URL to link to this item:


DNA microarray technology relies on the hybridization process, which is stochastic in nature. Currently, probabilistic cross hybridization of nonspecific targets, as well as the shot noise (Poisson noise) originating from specific targets binding, are among the main obstacles for achieving high accuracy in DNA microarray analysis. In this paper, statistical techniques are used to model the hybridization and cross-hybridization processes and, based on the model, optimal algorithms are employed to detect the targets and to estimate their quantities. To verify the theory, two sets of microarray experiments are conducted: one with oligonucleotide targets and the other with complementary DNA (cDNA) targets in the presence of biological background. Both experiments indicate that, by appropriately modeling the cross-hybridization interference, significant improvement in the accuracy over conventional methods such as direct readout can be obtained. This substantiates the fact that the accuracy of microarrays can become exclusively noise limited, rather than interference (i.e., cross-hybridization) limited. The techniques presented in this paper potentially increase considerably the signal-to-noise ratio (SNR), dynamic range, and resolution of DNA and protein microarrays as well as other affinity-based biosensors. A preliminary study of the Cramer-Rao bound for estimating the target concentrations suggests that, in some regimes, cross hybridization may even be beneficial-a result with potential ramifications for probe design, which is currently focused on minimizing cross hybridization. Finally, in its current form, the proposed method is best suited to low-density arrays arising in diagnostics, single nucleotide polymorphism (SNP) detection, toxicology, etc. How to scale it to high-density arrays (with many thousands of spots) is an interesting challenge.

Item Type:Article
Related URLs:
URLURL TypeDescription
Additional Information:© Copyright 2006 IEEE. Reprinted with permission. Manuscript received May 1, 2005; revised January 7, 2006. [Posted online: 2006-06-05] This work was supported in part by an award from the David and Lucille Packard Foundation for Science and Engineering, a Grubstake Award from the California Institute of Technology, and by the Millard and Muriel Jacobs Genetics and Genomics Laboratory at California Institute of Technology. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Jaakko Astola.
Subject Keywords:Cross hybridization, DNA microarrays, maximum a posteriori, maximum likelihood, minimum-mean-square-error (MMSE) estimation, Poisson noise, quantum-limited SNR, shot noise, statistical modeling
Issue or Number:6, pt.
Record Number:CaltechAUTHORS:VIKieeetsp06
Persistent URL:
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:4772
Deposited By: Archive Administrator
Deposited On:06 Sep 2006
Last Modified:02 Oct 2019 23:16

Repository Staff Only: item control page