A Caltech Library Service

Rule-based Cross-matching of Very Large Catalogs

Ogle, Patrick M. and Mazzarella, Joe and Ebert, Rick and Fadda, Dario and Lo, Tak and Terek, Scott and Schmitz, Marion (2015) Rule-based Cross-matching of Very Large Catalogs. In: Astronomical Data Analysis Software and Systems XXIV. Astronomical Society of the Pacific conference series. No.495. Astronomical Society of the Pacific , San Francisco, CA, pp. 25-35. ISBN 9781583818749.

[img] PDF - Published Version
See Usage Policy.

[img] PDF - Submitted Version
See Usage Policy.


Use this Persistent URL to link to this item:


The NASA Extragalactic Database (NED) has deployed a new rule-based cross-matching algorithm called Match Expert (MatchEx), capable of cross-matching very large catalogs (VLCs) with >10 million objects. MatchEx goes beyond traditional position-based cross-matching algorithms by using other available data together with expert logic to determine which candidate match is the best. Furthermore, the local background density of sources is used to determine and minimize the false-positive match rate and to estimate match completeness. The logical outcome and statistical probability of each match decision is stored in the database and may be used to tune the algorithm and adjust match parameter thresholds. For our first production run, we cross-matched the GALEX All Sky Survey Catalog (GASC), containing nearly 40 million NUV-detected sources, against a directory of 180 million objects in NED. Candidate matches were identified for each GASC source within a 7''.5 radius. These candidates were filtered on position-based matching probability and on other criteria including object type and object name. We estimate a match completeness of 97.6% and a match accuracy of 99.75%. Over the next year, we will be cross-matching over 2 billion catalog sources to NED, including the Spitzer Source List, the 2MASS point-source catalog, AllWISE, and SDSS DR 10. We expect to add new capabilities to filter candidate matches based on photometry, redshifts, and refined object classifications. We will also extend MatchEx to handle more heterogenous datasets federated from smaller catalogs through NED's literature pipeline.

Item Type:Book Section
Related URLs:
URLURL TypeDescription Paper
Ogle, Patrick M.0000-0002-3471-981X
Mazzarella, Joe0000-0002-8204-8619
Alternate Title:Rule-based Cross-matching of Very Large Catalogs in NED
Additional Information:© 2015 Astronomical Society of the Pacific. The NASA/IPAC Extragalactic Database (NED) is operated by the Jet Propulsion Laboratory, California Institute of Technology, under contract with the National Aeronautics and Space Administration.
Group:Infrared Processing and Analysis Center (IPAC)
Funding AgencyGrant Number
Record Number:CaltechAUTHORS:20160324-081318970
Persistent URL:
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:65643
Deposited By: Tony Diaz
Deposited On:30 Mar 2016 23:48
Last Modified:05 Oct 2017 21:34

Repository Staff Only: item control page