CaltechAUTHORS
  A Caltech Library Service

Optimizing spectroscopic follow-up strategies for supernova photometric classification with active learning

Ishida, E. E. O. and Beck, R. and González-Gaitán, S. and de Souza, R. S. and Krone-Martins, A. and Barrett, J. W. and Kennamer, N. and Vilalta, R. and Burgess, J. M. and Quint, B. and Vitorelli, A. Z. and Mahabal, A. and Gangler, E. (2019) Optimizing spectroscopic follow-up strategies for supernova photometric classification with active learning. Monthly Notices of the Royal Astronomical Society, 483 (1). pp. 2-18. ISSN 0035-8711. https://resolver.caltech.edu/CaltechAUTHORS:20190411-143047160

[img] PDF - Published Version
See Usage Policy.

4Mb
[img] PDF - Submitted Version
See Usage Policy.

3230Kb

Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20190411-143047160

Abstract

We report a framework for spectroscopic follow-up design for optimizing supernova photometric classification. The strategy accounts for the unavoidable mismatch between spectroscopic and photometric samples, and can be used even in the beginning of a new survey – without any initial training set. The framework falls under the umbrella of active learning (AL), a class of algorithms that aims to minimize labelling costs by identifying a few, carefully chosen, objects that have high potential in improving the classifier predictions. As a proof of concept, we use the simulated data released after the SuperNova Photometric Classification Challenge (SNPCC) and a random forest classifier. Our results show that, using only 12 per cent the number of training objects in the SNPCC spectroscopic sample, this approach is able to double purity results. Moreover, in order to take into account multiple spectroscopic observations in the same night, we propose a semisupervised batch-mode AL algorithm that selects a set of N = 5 most informative objects at each night. In comparison with the initial state using the traditional approach, our method achieves 2.3 times higher purity and comparable figure of merit results after only 180 d of observation, or 800 queries (73 per cent of the SNPCC spectroscopic sample size). Such results were obtained using the same amount of spectroscopic time necessary to observe the original SNPCC spectroscopic sample, showing that this type of strategy is feasible with current available spectroscopic resources. The code used in this work is available in the COINtoolbox.


Item Type:Article
Related URLs:
URLURL TypeDescription
https://doi.org/10.1093/mnras/sty3015DOIArticle
https://arxiv.org/abs/1804.03765arXivDiscussion Paper
https://github.com/COINtoolbox/ActSNClassRelated ItemCode
ORCID:
AuthorORCID
Ishida, E. E. O.0000-0002-0406-076X
González-Gaitán, S.0000-0001-9541-0317
Vilalta, R.0000-0001-8165-8805
Burgess, J. M.0000-0003-3345-9515
Quint, B.0000-0002-1557-3560
Mahabal, A.0000-0003-2242-0244
Additional Information:© 2018 The Author(s) Published by Oxford University Press on behalf of the Royal Astronomical Society. This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model). Accepted 2018 November 5. Received 2018 November 4; in original form 2018 April 19. Published: 06 November 2018. This work was created during the 4th COIN Residence Program (CRP#4), held in Clermont-Ferrand, France on August 2017, with support from Université Clermont-Auvergne and La Région Auvergne-Rhône-Alpes. This project is financially supported by CNRS as part of its MOMENTUM programme over the 2018–2020 period. EEOI thanks Michele Sasdelli for comments on the draft and Isobel Hook for useful discussions. AKM acknowledges the support from the Portuguese Fundação para a Ciência e a Tecnologia (FCT) through grants SFRH/BPD/74697/2010, from the Portuguese Strategic Programme UID/FIS/00099/2013 for CENTRA, the ESA contract AO/1-7836/14/NL/HB and Caltech Division of Physics, Mathematics and Astronomy for hosting a research leave during 2017-2018, when this paper was prepared. RSS thanks the support from NASA under the Astrophysics Theory Program Grant 14-ATP14-0007. RB acknowledges support from the National Science Foundation (NSF) award 1616974 and the NKFI NN 114560 grant of Hungary. BQ acknowledges financial support from CNPq-Brazil under the process number 205459/2014-5. AZV acknowledges financial support from CNPq. AM thanks partial support from NSF through grants AST-0909182, AST-1313422, AST-1413600, and AST-1518308. This work has made use of the computing facilities of the Laboratory of Astroinformatics (IAG/USP, NAT/Unicsul), whose purchase was made possible by the Brazilian agency FAPESP (grant 2009/54006-4) and the INCT-A. This work was partly supported by the Center for Advanced Computing and Data Systems (CACDS) and by the Texas Institute for Measurement, Evaluation, and Statistics (TIMES) at the University of Houston. This project has been supported by a Marie Sklodowska-Curie Innovative Training Network Fellowship of the European Commission’s Horizon 2020 Programme under contract number 675440 AMVA4NewPhysics. The Cosmostatistics Initiative (COIN) is a non-profit organization whose aim is to nourish the synergy between astrophysics, cosmology, statistics, and machine learning communities. This work benefited from the following collaborative platforms: Overleaf, Github, and Slack.
Funders:
Funding AgencyGrant Number
Centre National de la Recherche Scientifique (CNRS)UNSPECIFIED
Fundação para a Ciência e a Tecnologia (FCT)SFRH/BPD/74697/2010
Fundação para a Ciência e a Tecnologia (FCT)UID/FIS/00099/2013
European Space Agency (ESA)AO/1-7836/14/NL/HB
Caltech Division of Physics, Mathematics and AstronomyUNSPECIFIED
NASA14-ATP14-0007
NSFAST-1616974
National Research, Development and Innovation Office (Hungary)NN 114560
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)205459/2014-5
NSFAST-0909182
NSFAST-1313422
NSFAST-1413600
NSFAST-1518308
Fundação de Amparo à Pesquisa do Estado de Sao Paulo (FAPESP)2009/54006-4
Instituto Nacional de Ciência e Tecnologia (INCT)UNSPECIFIED
Center for Advanced Computing and Data Systems (CACDS)UNSPECIFIED
Texas Institute for Measurement, Evaluation, and Statistics (TIMES)UNSPECIFIED
University of HoustonUNSPECIFIED
European Research Council (ERC)675440
Subject Keywords:methods: data analysis –methods: observational – supernovae: general
Issue or Number:1
Record Number:CaltechAUTHORS:20190411-143047160
Persistent URL:https://resolver.caltech.edu/CaltechAUTHORS:20190411-143047160
Official Citation:E E O Ishida, R Beck, S González-Gaitán, R S de Souza, A Krone-Martins, J W Barrett, N Kennamer, R Vilalta, J M Burgess, B Quint, A Z Vitorelli, A Mahabal, E Gangler, COIN collaboration, Optimizing spectroscopic follow-up strategies for supernova photometric classification with active learning, Monthly Notices of the Royal Astronomical Society, Volume 483, Issue 1, February 2019, Pages 2–18, https://doi.org/10.1093/mnras/sty3015
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:94668
Collection:CaltechAUTHORS
Deposited By: Tony Diaz
Deposited On:11 Apr 2019 22:18
Last Modified:09 Mar 2020 13:18

Repository Staff Only: item control page