Supplemental Materials
February 26, 2022
1 Bayesian Differential Ranking
Conceptually, the goal of a differential analysis is to make a statement about change in abundance for a
given feature
i
between conditions A and B by evaluating the following null hypothesis:
A
i
B
i
= 1
However, most omic datasets do not provide a direct observation of the absolute quantities of
A
i
and
B
i
,
or the total microbial loads
N
A
i
and
N
B
i
, but rather only an observation of their proportions
p
A
i
and
p
B
i
,
respectively, within each dataset, which are determined by a bias term,
N
A
N
B
. This bias term, given by
A
i
B
i
=
p
A
i
N
A
p
B
i
N
B
=
p
A
i
p
B
i
N
A
N
B
results in high false discovery rates (FDRs) that cannot be adjusted for in models analyzing compositional
omics datasets because the overall contribution of
N
A
and
N
B
to change can not be unequivocally quantified
(Vandeputte et al. 2017; Hawinkel et al. 2019). To avoid the total biomass bias without having to resort
to performing traditional FDR corrections, we adopted a ranking approach that allowed us to sort omic
features by their log-fold change values independently of how large their change was in absolute terms
(Morton, Marotz, et al. 2019). Since the biomass bias impacts every species within a dataset equally, the
ranking approach ignores this bias, making the approach scale-invariant (Equation 1).
rank