Version November 4, 2024
Preprint typeset using L
A
T
E
X style openjournal v. 09/06/15
A GENERATIVE MODEL FOR
GAIA
ASTROMETRIC ORBIT CATALOGS: SELECTION FUNCTIONS FOR
BINARY STARS, GIANT PLANETS, AND COMPACT OBJECT COMPANIONS
Kareem El-Badry
1
,
2
, Casey Lam
3
, Berry Holl
4
,
5
, Jean-Louis Halbwachs
6
, Hans-Walter Rix
2
, Tsevi
Mazeh
7
, and Sahar Shahaf
8
1
Department of Astronomy, California Institute of Technology, 1200 E. California Blvd., Pasadena, CA 91125, USA
2
Max-Planck Institute for Astronomy, K ̈onigstuhl 17, D-69117 Heidelberg, Germany
3
Observatories of the Carnegie Institution for Science, 813 Santa Barbara St., Pasadena, CA 91101, USA
4
Department of Astronomy, University of Geneva, Chemin Pegasi 51, 1290 Versoix, Switzerland
5
Department of Astronomy, University of Geneva, Chemin d’Ecogia 16, 1290 Versoix, Switzerland
6
Universit ́e de Strasbourg, CNRS, Observatoire astronomique de Strasbourg, UMR 7550, 11 rue de l’Universit ́e, Strasbourg, France
7
School of Physics and Astronomy, Tel Aviv University, Tel Aviv, 6997801, Israel and
8
Department of Particle Physics and Astrophysics, Weizmann Institute of Science, Rehovot 7610001, Israel
Version November 4, 2024
ABSTRACT
Astrometry from
Gaia
DR3 has produced a sample of
∼
170,000 Keplerian orbital solutions, with
many more anticipated in the next few years. These data have enormous potential to constrain the
population of binary stars, giant planets, and compact objects in the Solar neighborhood. But in order
to use the published orbit catalogs for statistical inference, it is necessary to understand their selection
function: what is the probability that a binary with a given set of properties ends up in a catalog?
We show that such a selection function for the
Gaia
DR3 astrometric binary catalog can be forward-
modeled from the
Gaia
scanning law, including individual 1D astrometric measurements, the fitting of
a cascade of astrometric models, and quality cuts applied in post-processing. We populate a synthetic
Milky Way model with binary stars and generate a mock catalog of astrometric orbits. The mock
catalog is quite similar to the DR3 astrometric binary sample, suggesting that our selection function is
a sensible approximation of reality. Our fitting also produces a sample of spurious astrometric orbits
similar to those found in DR3; these are mainly the result of scan angle-dependent astrometric biases
in marginally resolved wide binaries. We show that
Gaia’s
sensitivity to astrometric binaries falls off
rapidly at high eccentricities, but only weakly at high inclinations. We predict that DR4 will yield
∼
1
million astrometric orbits, mostly for bright (
G
≲
15) systems with long periods (
P
orb
≳
1000 d). We
provide code to simulate and fit realistic
Gaia
epoch astrometry for any data release and determine
whether any hypothetical binary would receive a cataloged orbital solution.
Subject headings:
astrometry – catalogues – methods: statistical – binaries: general
1.
INTRODUCTION
The third data release of the
Gaia
mission (DR3; Gaia
Collaboration et al. 2016, 2023a) included astrometric or-
bital solutions for
∼
168
,
000 binary systems, represent-
ing a vast improvement in sample size and completeness
over all previous work (Gaia Collaboration et al. 2023b).
These data have already enabled discovery of several as-
trophysically interesting objects, including a sample of
compact objects in au-scale orbits with stellar compan-
ions (e.g. Shahaf et al. 2023) and a sample of giant plan-
ets orbiting nearby stars (Holl et al. 2023a). For a review
of the
Gaia
binary star sample, see El-Badry (2024).
Epoch-level astrometric data was not published in
DR3, and several quality cuts were imposed on the pub-
lished orbital solutions. These cuts depend on a variety of
quantities, such as the signal-to-noise ratio of the photo-
center semi-major axis, the orbital period, and the paral-
lax and eccentricity uncertainties. These cuts – combined
with the fact that
Gaia’s
sensitivity to astrometric orbits
Corresponding author: kelbadry@caltech.edu
depends on quantities such as the eccentricity and orien-
tation in ways that are difficult to predict analytically –
make it a nontrivial problem to use the
Gaia
astromet-
ric binary sample for population interference, and few
attempts at modeling the sample have been made so far.
In this paper, we use a model of the solar neighbor-
hood’s binary population to build a forward-model for
the
Gaia
DR3 astrometric binary catalog. This entails
modeling the
Gaia
observations at the level of individ-
ual epochs, predicting the observation times, scan an-
gles, and simulated one-dimensional astrometry for each
simulated source from the
Gaia
scanning law. We in-
clude common false positives and reproduce the cascade
of single-star, accelerating, and full orbital astrometric
models employed in producing the observed catalog. Be-
cause our modeling results in a mock catalog of orbital
solutions that resembles the one actually published in
DR3, we believe that it captures the most important se-
lection effects and false positives of the real catalog. It
can thus be used both to interpret the DR3 binary sam-
ple and to forecast what will be discoverable in future
data releases.
arXiv:2411.00088v1 [astro-ph.SR] 31 Oct 2024
2
The remainder of this paper is organized as follows. In
Section 2, we summarize the basics of
Gaia
observations
and the astrometric signal caused by a binary. Section 3
describes the Galactic model, simulated binary popula-
tion, and the assumptions we make to predict 1D epoch
astrometry. We discuss the astrometric model cascade
in Section 4 and present the resulting mock catalog in
Section 5. Finally, Section 6 presents a Monte Carlo se-
lection function that can be used to model and fit epoch
astrometry for arbitrary binaries. Section 7 summarizes
our main results.
This paper aims to develop a relatively detailed model
of
Gaia
orbit catalogs by forward-modeling epoch as-
trometry and the astrometric model cascade. A com-
panion paper, Lam et al. (2024), develops a model that
is more approximate but significantly less computation-
ally expensive.
2.
HOW GAIA OBSERVES
How
Gaia
observes has been summarized extensively
in other work, and we refer to Gaia Collaboration et al.
(2016) for a detailed description. We summarize the most
important aspects here.
Gaia
is equipped with two telescopes whose fields of
view are separated by 106.5 degrees. The satellite rotates
with a period of 6 hours, such that the two telescopes
sweep out an annulus with a width of about 0.7 deg on
the sky. The rotation axis precesses with a period of
63 days and a fixed tilt angle of 45 degrees with respect
to the Sun direction, causing this annulus to rotate on
the sky. At the same time, the spacecraft orbits the
Sun with a period of a year. The combination of these
three rotations results in full-sky coverage, though with
the observing cadence and distribution of scan angles
varying significantly across the sky (for details, see Holl
et al. 2023b). For stars brighter than
G
= 15, the median
number of visibility periods used in DR3 is 20, with a
(16-84)% range of 16-27. Here a visibility period refers
to a group of observations separated by other groups of
observations by at least 4 days. Data contributing to
DR3 solutions were obtained over a period of about 1000
d.
As a source moves across the astrometric field of view
(FOV), it is independently observed by 8-9 different
CCDs. Each CCD observation results in an independent
measurement of the source’s position in the along-scan
(AL) direction relative to a reference position assigned
to the source at a reference time. Crucially,
only
a 1D
measurement in the AL direction is made with high pre-
cision and used in the astrometric solution. At
G
≲
14
– which is the regime relevant to most astrometric bi-
naries published so far – the per-CCD observations cur-
rently reach an AL precision of order 0.12 mas. All the
CCD transits in a given FOV transit are spread over
a period of less than a minute, which is much shorter
than the orbital periods of astrophysically plausible bi-
naries to which
Gaia
is sensitive. The 8-9 CCD transits
can thus be regarded as essentially independent measure-
ments of the same quantity, and can be averaged to ob-
tain smaller uncertainties in the CCD-averaged measure-
ments. Our modeling in this work suggest that these
measurements are indeed nearly independent and not
systematics-dominated, such that the per-FOV transit
uncertainties are
∼
3 times smaller than the per-CCD
uncertainties (Appendix C).
Gaia
astrometry is built upon a global astrometric so-
lution, whose calculation requires the simultaneous opti-
mization of millions of attitude and calibration parame-
ters, and astrometric parameters for
∼
100 million stars
(Lindegren et al. 2012). Most of these parameters are of
little interest for the analysis of one source. It is pos-
sible to transform the astrometric measurements for a
single source to “local plane coordinates” in a tangent
plane to the unit sphere in the vicinity of each source,
as described by Lindegren & Bastian (2022). The as-
trometric measurements required to describe the motion
of a source can then be represented with just a small
file containing the AL displacements, their uncertainties,
and associated metadata (scan angles, parallax factors,
transit times, etc.) for each source. Thus far, such epoch
astrometry has been published for only one source: the
binary Gaia BH3 (Gaia Collaboration et al. 2024b).
3.
GALACTIC MODEL AND BINARY
POPULATION
We now describe the Galactic model and binary pop-
ulation from which we forward-model
Gaia
observations
and construct a mock astrometric orbit catalog. This will
allow us to validate our selection function by comparing
the mock catalog to the DR3 data.
3.1.
Galactic model
Our modeling uses
Galaxia
(Sharma et al. 2011),
which generates synthetic resolved-star surveys from the
Besan ̧con model of the Milky Way (Robin et al. 2003).
We use a modified version of the code described by Lam
et al. (2020). We only attempt to model sources within
2 kpc of the Sun, because 99% of all astrometric orbits
published in DR3 are found within 2 kpc.
Galaxia
predicts a total of 1.1 billion sources (all rep-
resenting single stars) within 2 kpc of the Sun. Com-
paring to the
Gaia
DR3
gaia
source
catalog, we find
that
Galaxia
predicts 1.4 times more sources than are
observed (with
parallax
over
error > 10
) within both
50 and 100 pc. Since
Gaia
is expected to be nearly com-
plete within these volumes for sources above the hydro-
gen burning limit (Gaia Collaboration et al. 2021), we
conclude that
Galaxia
predicts 40% too many sources
in the Solar neighborhood. We thus discard a random
subset of predicted sources, chosen independent of dis-
tance and apparent magnitude, to reduce source counts
by a factor of 1.4
3.2.
Binary population
Galaxia
does not include binary stars. We thus use
COSMIC
(Breivik et al. 2020) to generate a zero-age binary
population according to the model from Moe & Di Ste-
fano (2017). We assume a Kroupa (2001) primary mass
function
1
and draw mass ratios, periods, and eccentrici-
ties from the covariant, mass-dependent distributions in-
ferred by Moe & Di Stefano (2017). The assumed binary
fraction is
≈
41% for solar-type primaries,
2
increasing
1
Note that this is different from the more top-heavy mass func-
tion assumed in COSMIC by default.
2
This is lower than
f
mult
≈
0
.
5, the mean number of com-
panions per solar-type primary, because higher-order multiples
contribute more than one companion.
Only companions with
0
.
2
<
log (
P
orb
/
d)
<
8 are considered.
3
to 57% at
M
1
= 3
M
⊙
and 80% at
M
1
= 6
M
⊙
. Below
0
.
8
M
⊙
, the binary fraction is assumed to decrease lin-
early with log
M
1
, from 40% at 0
.
8
M
⊙
to 0 at 0
.
08
M
⊙
.
Almost all of the binaries predicted to be detectable in
DR3 have primary masses between 0.7 and 1
.
3
M
⊙
, so
it is the binary population in this mass range – which
is constrained primarily by the Raghavan et al. (2010)
survey of nearby G stars – that is most important for
our modeling.
COSMIC
predicts populations of single and binary stars.
We match both binaries and singles to the
Galaxia
-
predicted sources, generating sources until the total
number of stellar systems (singles plus binaries, with
each binary representing a single system) in the zero-
age
COSMIC
-predicted population exceeds the number of
Galaxia
sources within 2 kpc by 10%. This accounts for
the fact that the
Galaxia
population only contains stars
that are still alive; for a Kroupa IMF and constant star
formation history over 12 Gyr, about
≈
10% of all stars
that have been born will have already died.
Massive stars are preferentially found in the Galac-
tic disk and thus are subject to higher extinction than
typical lower-mass stars. To account for this, we place
each simulated binary at the position of the
Galaxia
star whose mass is closest to that of the primary. We
assign inclinations by drawing from a sin(
i
) distribution.
We draw longitudes of the ascending node, Ω, and ar-
guments of periastron,
ω
, from
U
(0
,
2
π
). We define the
reference phase as
φ
0
=
2
πT
p
P
orb
, where
T
p
is the epoch of
periastron, and draw
φ
0
from
U
(0
,
1).
We assign
G
−
band absolute magnitudes to the pri-
mary and secondary components using the
isochrones
package (Morton 2015), using
MIST
models (Choi et al.
2016) and assuming a uniform age distribution between
0 and 12 Gyr. We remove binaries with initial masses
>
8
M
⊙
whose primaries have died, assuming that most
would-be black hole or neutron star binaries are de-
stroyed or dramatically tightened by common envelope
evolution and supernova kicks. We similarly assume that
most binaries containing white dwarfs (WDs; i.e., pri-
maries with initial masses
<
8
M
⊙
that have terminated
their evolution) shrink to short periods and are not de-
tectable astrometrically. However, motivated by the re-
sults of Shahaf et al. (2024) and Yamaguchi et al. (2024),
we assume that 10% of WD + luminous star binaries with
initial separations of (2
−
6) au form wide post-common
envelope binaries with separations that are 50% of their
initial separations. This prescription is quite simplified
and is not expected to capture all aspects of the observed
population. The 10% fraction is chosen to approximately
match the number of binaries with large mass functions
in the observed catalog (Section 3.4). We assign WD
masses using the initial-final mass relation of Weidemann
(2000). We remove binaries in which either star currently
fills its Roche lobe or would have filled it previously (e.g.,
red clump stars that would have filled their Roche lobes
at the tip of the first giant branch). Because most bi-
naries that are astrometrically detectable contain main-
sequence stars in au-scale orbits, the treatment of evolu-
tionary effects has minor effects on the overall properties
of the observable population.
We calculate extinctions to each binary using the
combined19
dust map in the
mwdust
package (Bovy et al.
4
6
8
10
12
14
16
18
20
G
[mag]
10
1
10
0
σ
η
[mas]
per CCD
per FOV transit
Fig. 1.—
Assumed AL displacement uncertainties. Black line
shows the uncertainty per CCD transit, adopted from Holl et al.
(2023a). Tan line shows our assumed uncertainty per FOV transit
and is smaller than the black line by a factor of
√
8, representing the
result of averaging measurements from an average of 8 uncorrelated
CCD measurements per FOV transit. The uncertainties are small-
est at
G
= 8
−
14 and are dominated by calibration systematics
in this magnitude range, but we find (Appendix C) that measure-
ments from individual CCDs can still be usefully combined to yield
FOV transit-averaged uncertainties a factor of
≈
√
8 smaller. The
uncertainties at
G
≳
14 are photon-limited and thus rise rapidly
toward faint magnitudes. Calibration problems worsen at
G
≲
8.
2016), which combines the maps from Drimmel et al.
(2003), Marshall et al. (2006), and Green et al. (2019).
We assume
A
G
= 2
.
8
E
(
B
−
V
), where
E
(
B
−
V
) is on
the Schlegel et al. (1998) scale. We then calculate each
binary’s total
G
−
band apparent magnitude, angular sep-
aration,
ρ
, and magnitude difference, ∆
G
=
G
2
−
G
1
.
Binaries that are resolved or marginally resolved are re-
moved from the sample. As we show in Appendix B,
the transition between resolved and unresolved binaries
depends on both
ρ
and ∆
G
, and in DR3 can be approx-
imated as ∆
G
[mag] =
1
25