Published October 10, 2025 | Version Published
Journal Article Open

QZO: A Catalog of 5 Million Quasars from the Zwicky Transient Facility

  • 1. ROR icon California Institute of Technology
  • 2. ROR icon Jet Propulsion Lab
  • 3. ROR icon University of Washington
  • 4. ROR icon Infrared Processing and Analysis Center
  • 5. ROR icon Drexel University

Abstract

Machine learning methods are well established in the classification of quasars (QSOs). However, the advent of light-curve observations adds a great amount of complexity to the problem. Our goal is to use the Zwicky Transient Facility (ZTF) to create a catalog of QSOs. We process the ZTF DR20 light curves with a transformer artificial neural network and combine different surveys with extreme gradient boosting. Based on ZTF g-band and Wide-field Infrared Survey Explorer (WISE) observations, we find 4,849,574 objects classified as QSOs with confidence higher than 90% (QZO). We robustly classify objects fainter than the 5σ signal-to-noise ratio (SNR) limit at g = 20.8 by requiring g < nobs/80 + 20.375. For 33% of QZO objects, with available WISE data, we publish redshifts with estimated error Δz/(1 + z) = 0.14. We find that ZTF classification is superior to the Pan-STARRS static bands, and on par with WISE and Gaia measurements, but the light curves provide the most important features for QSO classification in the ZTF data set. Using ZTF g-band data with at least 100 observational epochs per light curve, we obtain a 97% F1 score for QSOs. We find that with 3 day median cadence, a survey time span of at least 900 days is required to achieve a 90% QSO F1 score. However, one can obtain the same score with a survey time span of 1800 days and the median cadence prolonged to 12 days.

Copyright and License

© 2025. The Author(s). Published by the American Astronomical Society. Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Acknowledgement

S.J.N. and M.J.G. are supported by the US National Science Foundation (NSF) through grant AST-2108402. Based on observations obtained with the Samuel Oschin Telescope 48 inch and the 60 inch Telescopes at the Palomar Observatory as part of ZTF. ZTF is supported by the National Science Foundation under grant Nos. AST-1440341 and AST-2034437, and a collaboration including current partners Caltech, IPAC, the Oskar Klein Center at Stockholm University, the University of Maryland, University of California, Berkeley, the University of Wisconsin at Milwaukee, University of Warwick, Ruhr University Bochum, Cornell University, Northwestern University, and Drexel University. Operations are conducted by COO, IPAC, and University of Wisconsin (UW). WISE, which is a joint project of the University of California, Los Angeles, and the Jet Propulsion Laboratory/California Institute of Technology, and NEOWISE, which is a project of the Jet Propulsion Laboratory/California Institute of Technology. WISE and NEOWISE are funded by the National Aeronautics and Space Administration. European Space Agency (ESA) mission Gaia,10 processed by the Gaia Data Processing and Analysis Consortium (DPAC).11 Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement. The PS1 surveys and the PS1 public science archive have been made possible through contributions by the Institute for Astronomy, the University of Hawaii, the Pan-STARRS Project Office, the Max-Planck Society and its participating institutes, the Max Planck Institute for Astronomy, Heidelberg and the Max Planck Institute for Extraterrestrial Physics, Garching, The Johns Hopkins University, Durham University, the University of Edinburgh, the Queen’s University Belfast, the Harvard-Smithsonian Center for Astrophysics, the Las Cumbres Observatory Global Telescope Network Incorporated, the National Central University of Taiwan, the Space Telescope Science Institute, the National Aeronautics and Space Administration under grant No. NNX08AR22G issued through the Planetary Science Division of the NASA Science Mission Directorate, the National Science Foundation grant No. AST-1238877, the University of Maryland, Eotvos Lorand University (ELTE), the Los Alamos National Laboratory, and the Gordon and Betty Moore Foundation. Based on observations obtained at the Hale Telescope, Palomar Observatory, as part of a collaborative agreement between the Caltech Optical Observatories and the Jet Propulsion Laboratory. The Gordon and Betty Moore Foundation, through both the Data-Driven Investigator Program and a dedicated grant, provided critical funding for SkyPortal. PS data can be found in MAST (STScI 2022), AllWISE (E. L. Wright et al. 2019), and Gaia EDR3 (Gaia Collaboration 2020) data can be found in IPAC.

Data Availability

We release the catalog and models in Zenodo at DOI: 10.5281/zenodo.16410988. The code and notebooks are available in GitHub12 and in Zenodo (S. Nakoneczny 2025).

Facilities

PO:1.2m - Palomar Observatory's 1.2 meter Samuel Oschin Telescope, Sloan - Sloan Digital Sky Survey Telescope, PS1 - Panoramic Survey Telescope and Rapid Response System Telescope #1 (Pan-STARRS), WISE - Wide-field Infrared Survey Explorer, Gaia - .

Software References

Python 3 (G. Van Rossum & F. L. Drake 2009), TensorFlow (M. Abadi et al. 2015), Keras (F. Chollet et al. 2015), Astromer (C. Donoso-Oliva et al. 2023), XGBoost (T. Chen & C. Guestrin 2016), Scikit-learn (F. Pedregosa et al. 2011), Astropy (Astropy Collaboration et al. 201320182022), SkyPortal (S. J. van der Walt et al. 2019; M. W. Coughlin et al. 2023), NumPy (C. R. Harris et al. 2020), SciPy (P. Virtanen et al. 2020), IPython (F. Pérez & B. E. Granger 2007), Pandas (W. McKinney 2010), Matplotlib (J. D. Hunter 2007), seaborn (M. L. Waskom 2021), tqdm (C. da Costa-Luis 2019).

Files

Nakoneczny_2025_ApJ_992_153.pdf

Files (3.8 MB)

Name Size Download all
md5:a93affc36a9971b8080be72ccd04cc0f
3.8 MB Preview Download

Additional details

Related works

Is new version of
Discussion Paper: arXiv:2502.13054 (arXiv)
Is supplemented by
Dataset: 10.5281/zenodo.16410988 (DOI)

Funding

National Science Foundation
AST-2108402
National Science Foundation
AST-1440341
National Science Foundation
AST-2034437
National Aeronautics and Space Administration
NNX08AR22G
National Science Foundation
AST-1238877
University of Maryland, Baltimore
Eötvös Loránd University
Los Alamos National Laboratory
Gordon and Betty Moore Foundation

Dates

Accepted
2025-07-25
Available
2025-10-10
Published online

Caltech Custom Metadata

Caltech groups
Astronomy Department, Center for Data-Driven Discovery (CDDD), Infrared Processing and Analysis Center (IPAC), Zwicky Transient Facility, Division of Physics, Mathematics and Astronomy (PMA)
Publication Status
Published