Selection of Dwarf Galaxies Hosting Active Galactic Nuclei: A Measure of Bias and Contamination Using Unsupervised Machine Learning Techniques
Creators
Abstract
Identifying active galactic nuclei (AGNs) in dwarf galaxies is critical for understanding black hole formation but remains observationally challenging due to their low luminosities, metallicities, and star formation–driven emission that can obscure AGN signatures. Machine learning techniques, particularly unsupervised methods, offer new ways to address these challenges by uncovering patterns in complex, multiwavelength data. In this study, we apply Self-Organizing Maps (SOMs) to explore the spectral energy distribution (SED) manifold of dwarf galaxies and evaluate AGN selection biases across various diagnostics. We train a 51 × 51 SOM on 30,344 dwarf galaxies (z < 0.055, M* < 109.5M⊙) from the NSA catalog using nine-band photometry spanning near-UV to mid-infrared. A set of 438 previously identified dwarf AGNs, selected via mid-infrared color, optical emission lines, X-ray, optical variability, and broad-line features, was mapped onto the SOM. AGNs identified by different methods occupy distinct and partially overlapping regions in SED space, reflecting biases related to host galaxy properties. Wide-field Infrared Survey Explorer (WISE)-selected AGNs are strongly concentrated in lower-mass regions and form two distinct clumps: one associated with bluer, starburst-like systems and the other with redder galaxies showing spectral features more typical of AGNs. This separation may help disentangle true AGN hosts from starburst contaminants in WISE-selected samples. Additionally, AGNs selected via various diagnostics tend to avoid regions of strong star formation, while a subset of lower-mass AGNs occupy SOM regions indicative of high AGN luminosity relative to their stellar content. Our results demonstrate the utility of manifold learning in refining AGN selection in the low-mass regime.
Copyright and License
© 2025. The Author(s). Published by the American Astronomical Society. Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
Acknowledgement
We thank Michael McDonald and Nahum Arav for helpful discussions, and Chengzhang Jiang for sharing ideas relevant to this work. We also acknowledge our collaborators for their input in shaping the direction of the project. Partial support for A.A. and G.C. was provided by NASA through a grant for program JWST-GO-03663 from the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Inc., under NASA contract NAS 5-03127.
Files
Sanjaripour_2025_ApJ_992_138.pdf
Files
(4.2 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:cd8f6d200f486483b6c90b837047c9fa
|
4.2 MB | Preview Download |
Additional details
Related works
- Is new version of
- Discussion Paper: arXiv:2505.16509 (arXiv)
Funding
- National Aeronautics and Space Administration
- JWST-GO-03663
- Space Telescope Science Institute
- National Aeronautics and Space Administration
- NAS 5-03127
Dates
- Accepted
-
2025-09-02