Professional Documents
Culture Documents
ABSTRACT
It is well established that the chemical structure of the Milky Way exhibits a bimodality with
respect to the α-enhancement of stars at a given [Fe/H]. This has been studied largely based on a
bulk α abundance, computed as a summary of several individual α-elements. Inspired by the expected
subtle differences in their nucleosynthetic origins, here we probe the higher level of granularity encoded
in the inter-family [Mg/Si] abundance ratio. Using a large sample of stars with APOGEE abundance
measurements, we first demonstrate that there is additional information in this ratio beyond what
is already apparent in [α/Fe] and [Fe/H] alone. We then consider Gaia astrometry and stellar age
estimates to empirically characterize the relationships between [Mg/Si] and various stellar properties.
We find small but significant trends between this ratio and α-enhancement, age, [Fe/H], location in
the Galaxy, and orbital actions. To connect these observed [Mg/Si] variations to a physical origin,
we attempt to predict the Mg and Si abundances of stars with the galactic chemical evolution model
Chempy. We find that we are unable to reproduce abundances for the stars that we fit, which highlights
tensions between the yield tables, the chemical evolution model, and the data. We conclude that a more
data-driven approach to nucleosynthetic yield tables and chemical evolution modeling is necessary to
maximize insights from large spectroscopic surveys.
Keywords: Galaxy: abundances – disk – evolution – formation – stars: abundances – methods: data
analysis
Beyond the solar neighborhood, large surveys such as terms of their orbital actions (Jφ , JR , Jz ) at all stellar
APOGEE and Gaia have enabled the empirical character- ages.
ization of the bimodal α sequence throughout the Milky While differences between the high and low-α se-
Way disk (Bovy et al. 2012b; Nidever et al. 2014; Hayden quence have been characterized empirically, the origin
et al. 2015; Mackereth et al. 2017). For instance, using of the bimodality remains an open question. Nonethe-
SDSS/SEGUE spectra Bovy et al. (2012b) examine how less, recent simulation work modeling Milky Way-like
the α abundances of stars vary with location from disk galaxies has made substantial progress (Grand et al.
midplane (|z| ∼ 0.3 - 3 kpc), as well as with Galactocen- 2018; Mackereth et al. 2018; Clarke et al. 2019). For
tric radius (R = 5 - 12 kpc). They find that compared example, analyzing the large-volume EAGLE cosmolog-
to stars with low-α abundances, the population of stars ical simulation, Mackereth et al. (2018) find that a bi-
with the highest [α/Fe] enrichment are more vertically modal α sequence occurs in galaxies that experience a
extended, yet radially concentrated. Using a larger sam- early phase of anomalously rapid mass accretion. Con-
ple of stars from APOGEE, Hayden et al. (2015) confirm sequently, they report that only ∼5% of Milky Way-like
this result and further characterize how the ratio of stars galaxies in the EAGLE volume exhibit a bimodal α se-
with low versus high-α abundances varies throughout quence similar to our Galaxy’s, implying that this chem-
the disk. They find that the inner disk (R . 9 kpc) is ical structure is rare. More recently, Clarke et al. (2019)
comprised of both low and high-α sequence stars, how- propose that clumpy star-formation is responsible for
ever low-α sequence stars are primarily confined close the observed bimodal α sequence. Using GASOLINE to
to the disk midplane (|z| . 1 kpc) and high-α sequence perform high-resolution simulations, Clarke et al. (2019)
stars are primarily located at |z| ∼ 0.5 - 2 kpc. Contrary reproduce the Milky Way’s chemical bimodality in the
to the inner disk, at all distances from the midplane the [O/Fe]-[Fe/H] plane. Investigating the birth sites of
outer disk (R & 9 kpc) is markedly devoid of stars with stars in the two sequences, they find that stars in the
high-α abundances. This observed chemical structure high-α sequence are formed in clumps that start off with
of the Milky Way has been cited as evidence for “inside- low-α abundances, but rapidly self-enrich due to high
out” (Larson 1976) and “upside-down” (Bournaud et al. star-formation rates (SFR). Meanwhile, low-α sequence
2009) formation scenarios of the disk components of our stars are the product of a more extended star-formation
Galaxy, where radially, the central disk was formed be- mode that occurs with a substantially lower SFR. Since
fore the outer disk, and vertically, the thick disk was the incidence of clumps are common in high redshift
formed before the thin disk. galaxies, Clarke et al. (2019) conclude that chemical bi-
Although the α-enriched component of the Milky Way modality should be prevalent among Milky Way-massed
has come to be associated with a “hotter disk” and the galaxies.
solar-α component often associated with a “cooler disk” Typically, the α abundances investigated in observa-
(Bensby et al. 2003; Navarro et al. 2011; Bovy et al. tional and theoretical studies are computed as an aver-
2012b), the origin of these two populations, and whether age of many individual α-elements. However, there is
they are unique, is still debated (Haywood et al. 2016; additional information in the relative abundance of dif-
Toyouchi & Chiba 2016; Bovy et al. 2012a). Note that ferent α-elements themselves. We know from stellar nu-
these hotter and cooler populations do not necessarily cleosynthesis theory that the different α-elements vary
follow the same morphology as have previously been in the details of their production mechanisms. Given
identified as the “thick” and “thin” disks (Gilmore & the precision of recent spectroscopic surveys like APOGEE,
Reid 1983). However, there has been growing evidence GALAH, and LAMOST, and the many abundance measure-
that in addition to their chemical differences, the two ments now available for hundreds of thousands of stars,
sequences are also dynamically distinct. For example, we can begin to go beyond considering a mean α abun-
using a sample of stars from APOGEE and Gaia, Mack- dance and examine the information encoded by individ-
ereth et al. (2019) find that at fixed age and [Fe/H], the ual α-elements. Full exploitation of the information con-
low and high-α sequence display different age-velocity tained in these multi-element abundance vectors, and
dispersion relationships with respect to both radial and how this vector varies with dynamical properties, is still
vertical velocity dispersion. As discussed in Mackereth underway. Recently, Weinberg et al. (2018) has mapped
et al. (2019), these kinematic differences between the multiple APOGEE abundances from R = 3 - 15 kpc and
low- and high-α sequence are suggestive of disparate |z| = 0 - 2 kpc. This includes the comparison of var-
heating mechanisms contributing to the formation of the ious elements to Mg, including α-elements (like S, Si,
two populations. Similarly, Gandhi & Ness (2019) re- O, and Ca), light odd-Z elements (like Al, P, K, and
port differences between the low and high-α sequence in Z), as well as iron-peak elements (like Cr, Mn, Fe, V,
[Mg/Si] variations 3
Co, and Ni). Through this broad exploration, they find how the ratio of Mg to Si varies with age, as well as
small variations among the different abundance ratios dynamics, and find that the low and high-α sequences
throughout the disk. By considering these subtle dif- exhibit markedly different behavior. To put these re-
ferences between inter-family element combinations in sults in the context of the Galaxy’s chemical evolution,
depth, more details of the Galaxy’s formation history, we further attempt chemical evolution modeling to ex-
and the physics of nucleosynthesis, can be gleaned. tract both ISM and stellar population parameters at the
For example, a detailed examination of an inter-family time of star-formation. While in this study we concen-
abundance ratio has been carried out for the Sagittarius trate only on the ratio of Mg to Si, our methods are
dwarf galaxy, where stars are observed to be deficient in generalizable. An in-depth analysis of many inter-family
magnesium (Mg) compared to silicon (Si) (McWilliam abundance ratios, and even a full matrix of element ra-
et al. 2013; Hasselquist et al. 2017; Carlin et al. 2018). tios, appears to be a promising avenue for extracting the
The discrepancy between the enrichment of Mg and Si most information from large spectroscopic surveys.
has been interpreted as suggesting the Sagittarius galaxy This paper is organized as follows. First, in Section 2
was formed with a “top-light” stellar initial mass func- we use yield tables to explore theoretical Mg and Si con-
tion (IMF), meaning an IMF with fewer high-mass stars tributions from different nucleosynthetic sources. Then
compared to the canonical IMF. This is argued to be in Section 3 we introduce the data and methods of the
a consequence of varying yield dependencies on stellar paper including: the APOGEE sub-sample, clustering of
mass between α-elements like Mg and O and α-elements the low and high-α sequences, and empirical motivation
like Si and Ca. However, this argument is complicated for examining the ratio of Mg to Si. In Section 4 we
by the fact that some α-elements (like Si) are also pro- present the main empirical results of the paper, show-
duced by SN Ia (Tinsley 1979). ing how [Mg/Si] varies: between the low and high-α se-
The interpretation of Sagittarius [Mg/Si] observations quences, with age and metallicity, throughout the Galac-
are supported by theoretically motivated expected dif- tic disk, and with orbital parameters. In the second half
ferences in how various α-elements are produced. As of the paper we focus on interpreting these empirical re-
discussed in Hasselquist et al. (2017), hydrostatic α- sults within the context of Galactic chemical evolution.
elements, such as Mg and O, are produced in massive To do this, in Section 5 we use Chempy to fit chemi-
stars during the hydrostatic burning phase, and these cal evolution models to a sample of APOGEE stars and
elements get ejected into the ISM during CC-SN explo- examine variations with the IMF slope, number of SN
sions. Contrary, explosive α-elements, such as Si and Ca, Ia, and ISM parameters. In this section we discuss the
are produced in massive stars during the explosive nu- limitations we encounter when attempting to fit these
cleosynthesis leading up to the CC-SN explosion. While models to a diverse set of stars. Finally, in Section 6 we
both hydrostatic and explosive α-elements are produced distill the main takeaways of both the empirical results
in massive stars and released via CC-SN, explosive α- and attempted chemical evolution modeling, and discuss
elements are produced in shells that lie closer to the potential paths forward.
cores of massive stars, whereas hydrostatic α-elements
are produced in the outermost shells. This makes the 2. EXPECTED MG AND SI YIELDS
yields of hydrostatic α-elements more dependent on the
Before empirically characterizing the ratio of magne-
mass of the star, while explosive α-element yields are rel-
sium to silicon throughout the Milky Way, we further
atively independent of stellar mass (Woosley & Weaver
motivate examining this abundance ratio by exploring
1995). In this regard, different α-elements can be used
theoretical Mg and Si yields as expected from yield ta-
to probe the population of massive stars at a given epoch
bles. We do this using the Galactic chemical evolution
of star-formation.
code Chempy (Rybizki et al. 2017), which is discussed in
Inspired by the Sagittarius results, as well as the in-
more detail in Section 5. In short, using Chempy we ini-
creasing precision and wealth of multi-abundance mea-
tialize a simple stellar population (SSP) with a Chabrier
surements of stars located throughout the Milky Way’s
IMF (Chabrier 2003) and evolve the SSP from 0 to 13.5
disk, in this paper we perform a focused investigation of
Gigayears in 1350 linear-spaced steps, keeping track of
the magnesium to silicon abundance ratio. By consider-
enrichment from CC-SN, SN Ia, and asymptotic giant
ing this specific inter-family element ratio, we are able to
branch (AGB) stars. The yields from CC-SN and AGB
attain a more resolved understanding of how α-elements
stars both depend on the mass of dying stars at each
vary in the Galaxy. We are also able to isolate particular
time step, whereas yields from SN Ia, parameterized as
enrichment events associated only with the production
a power-law delay time distribution (DTD) (Maoz et al.
of these inter-family elements. We do this by examining
2010), are independent of stellar mass. The yields we
4 Blancato et al.
Mg net yield (M )
3
10
Mg net yield (M )
is the newly synthesized material from these nucleosy- 10 3
thetic channels.
We consider two initial SSP metallicities. This in-
cludes an SSP with a solar-like metallicity (Z = .01), as
well as a metal-poor SSP with Z = .0001. We also com-
total Z = .0001
pute SSPs assuming two sets of yields tables. The first total Z = .0001
is the Chempy default yield tables which includes CC- CC-SN Z = .01
CC-SN Z = .01
SN yields from Nomoto et al. (2013), SN Ia yields from SN Ia
4 SN Ia
Seitenzahl et al. (2013), and AGB yields from Karakas 10
10 4
(2010). The alternative Chempy yield set instead uses
CC-SN yields from Chieffi & Limongi (2004), SN Ia
Si net yield (M )
3
yields from Thielemann et al. (2003), and AGB yields 10
Si net yield (M )
10 3
from Ventura et al. (2013). In the following figures we
only show the Mg and Si yields from the default yield
set at Z = .0001 and Z = .01. The alternative yield set
exhibits similar trends as the default yield set, however
for the alternative yield set there is less of a difference
in the yields produced at the two metallicities.
First, the upper two panels of Figure 1 show the cumu-
4
lative Mg and Si yields as a function of time. The total 10
10 4
net yields summing the contributions from the three nu-
cleosynthetic channels, as well as the individual yields
from CC-SN and SN Ia, is shown. The AGB yields are
negligible compared to the SN yields and scale of the 100
100
Mg/Si
abundances are -0.13 dex (-1.17 < [Fe/H] < 0.61 dex) in stars with distances accurate to 5-10%, and 18,357 of
[Fe/H], 0.034 dex (-0.28 < [α/Fe] < 0.42 dex) in [α/Fe], these stars have age estimates from Ness et al. (2016).
0.050 dex (-0.48 < [Mg/Fe] < 0.61 dex) in [Mg/Fe], and The median value (min, max) of the chemical abun-
0.027 dex (-0.61 < [Si/Fe] < 0.59 dex) in [Si/Fe]. The dances for the RC sample are -0.13 dex (-0.90 < [Fe/H]
median (min, max) of the stellar atmosphere parameters < 0.51 dex) in [Fe/H], 0.029 dex (-0.14 < [α/Fe] < 0.40
are 2.46 (1.05 < log g < 3.72) in log g and 4780 K (3980 dex) in [α/Fe], 0.038 dex (-0.31 < [Mg/Fe] < 0.53 dex)
< Teff < 5800 K) in Teff . The median Galactocentric ra- in [Mg/Fe], and 0.032 dex (-0.41 < [Si/Fe] < 0.46 dex)
dius and distance from the disk midplane of the sample in [Si/Fe]. The median (min, max) of the stellar atmo-
are R = 9.47 kpc and |z| = 0.37 kpc, and the 3σ spatial sphere parameters are 2.45 (1.80 < log g < 3.13) in log g
extent spanned is 0.14 < R < 44.6 kpc and 0.00 < |z| and 4880 K (4190 < Teff < 5440 K) in Teff . As expected,
< 10.7 kpc. Lastly, the median (3σ range) of the actions the red clump stars a narrower range in log g and Teff
are 2050 kpc km s−1 (-390 < Jφ < 3570 kpc km s−1 ) in than the main sample described in Section 3.1.
Jφ , 1.51 (-0.81 < log(JR ) < 2.76) in log(JR ), and 0.94
3.2.2. HARPS solar twins
(-1.67 < log(Jz ) < 2.50) in log(Jz ).
Solar twin stars are main-sequence G dwarfs which
3.2. Additional datasets are spectroscopically similar to the Sun. A typical solar
We assume that the measured [Mg/Fe] and [Si/Fe] twin has an effective temperature within 100 K, surface
abundances of stars reflect their abundances at the time gravity log g within 0.1 dex, and bulk metallicity or
of birth. While stochastic effects, like binary interac- iron abundance [Fe/H] within 0.1 dex of the solar values
tions and planetary engulfment, might change abun- (e.g. Ramı́rez et al. 2014; Nissen 2015). As a result, the
dances over time, we expect these affects to be negligi- spectra of solar twins may be compared differentially
ble. Additionally, given the narrow temperature range to the solar spectrum with minimal reliance on stellar
of our star sample, we also assume that dust does not atmospheric models, yielding extremely high precision
differentially impact the measured abundances. (0.01 dex level) abundance measurements (Bedell et al.
To test that our empirical results are robust, we cor- 2014). Similarly precise spectroscopic parameters and
roborate our findings with two other samples: a sample therefore isochronal ages are also possible for these stars
of red clump stars (which are constrained in Teff and log (Ramı́rez et al. 2014; Spina et al. 2018).
g) and a sample of main sequence stars with chemical We use a set of 79 solar twin stars observed at high
abundance measurements from HARPS. In the follow- signal-to-noise and high resolution with the HARPS
ing sections we describe and motivate the use of each of spectrograph. These stars have highly precise ages de-
these datasets. rived from isochrone fits to their spectroscopic parame-
ters with an estimated uncertainty of 0.4 Gyr in Spina
3.2.1. APOGEE red clump stars
et al. (2018). Using the same spectra and parameters,
Red clump stars are low-mass, core helium-burning abundances for several α-elements including Mg and Si
stars. As discussed in Girardi (2016), the constancy were derived in Bedell et al. (2018) through a differential
of the core mass for these ∼1.5 M stars at the start equivalent width technique.
of the core helium-burning phase is what leads these The stars in this dataset are limited in both number
stars to “clump” to the same luminosity in the color- and scope: all are located within ∼ 100 pc of the Sun
magnitude diagram. For similar reasons, red clump stars and, by definition, all have approximately solar metal-
also span narrower ranges in stellar parameters such as licity. Despite these limitations, the precision of age
Teff and log g compared to red giant branch (RGB) stars. and abundance measurements in this sample makes it a
This property makes red clump stars a good check of valuable source of information. This sample also serves
our results, and if what we find is dependent on stellar as a test for the generalizability of our results to main-
atmosphere parameters. sequence stars.
The sample of red clump stars we use is from the
APOGEE Red-Clump (RC) Catalog, which is derived from 3.3. Soft clustering of low and high [α/Fe] stars
the main APOGEE stellar catalog based on the procedure To quantify differences in [Mg/Si] between stars with
developed in Bovy et al. (2014). The selection makes use solar α abundances and those with enriched α abun-
of both photometric and spectroscopic data, and identi- dances, we first separate the full APOGEE sample into
fies probable red clump stars based on their metallicity, a ‘low-α’ sequence and a ‘high-α’ sequence. We clus-
color, effective temperature, and surface gravity. This ter the subsample of APOGEE stars described in Section
selection results in minimal contamination from RGB ~ = {[α/Fe], [Fe/H]},
3.1 based on two input features, X
stars. The DR14 RC catalog contains 29,502 red clump which is the parameter space in which the two sequences
[Mg/Si] variations 7
are typically identified. Visually, the two proposed clus- P(z = high-α)
ters in this feature space are distributed anisotropically 0.4 0.0 0.5 1.0
and not cleanly separable, so we decide to achieve a soft
clustering through fitting a mixture model. 0.3
[α/Fe]
of the clustering to be sensitive to the initialization of 0.1
the component means, so we compute a 2D kernel den-
sity estimate (KDE) of the data and use the highest 0.0
density locations in each of the two sequences to ini-
tialize the means. After convergence, for each star we −0.1
record the component assignment (z) probability that
the star belongs to the high-α sequence P (z = high-α). −0.2
−1.00 −0.75 −0.50 −0.25 0.00 0.25 0.50
As seen in Figure 3, stars with high-α abundances are
assigned with high probability to the same mixture com- [Fe/H]
ponent, and stars with low-α abundances are assigned
with high probability to the other mixture component. Figure 3. Soft clustering of the APOGEE stars in the [α/Fe]-
As expected, stars with intermediate α abundances at a [Fe/H] plane. Each bin is colored by the mean assignment
fixed [Fe/H] are assigned a lower probability of belong- (z) probability of the stars contained within the bin, where
blue indicates a high probability of being assigned to the
ing to either the low or high-α sequence.
high-α sequence and red indicates a high probability of being
For the results throughout this paper, we use the as- assigned to the low-α sequence. The contours indicate the
signment probabilities to weight the stars when mak- density of stars (with levels indicated at 30, 100, 500, and
ing comparisons between the low and high-α sequence. 1500 stars).
Since 92% of the stars in our sample are assigned with
high probability to either component (i.e. P (z = high- as the of determination, r2 = 1 -
α.) = 0-5% or 95-100%), only a small fraction of stars 1
P standard coefficient 2
N σ2 i (ytrue,i - ypred,i ) , where ytrue and ypred are the
with more ambiguous component assignments are given true and model predicted values of the dependent vari-
less weight. able, N is the number of observations, and σ 2 is the
~ . An r2 score closer to 1 indicates that
variance of ytrue
3.4. Demonstration that [Mg/Si] does not the model predicts the variation in ytrue well, whereas an
simply trace [α/Fe] r2 score of 0 indicates that the model does not capture
In Section 1 we described the physical motivation for any of the variation.
looking at the ratio of magnesium to silicon. We now First we consider a linear model to predict [α/Fe].
motivate, in a quantifiable way, that there is additional Since the dimensionality of the input feature space is
information encoded in the ratio of Mg to Si beyond its low, we first implement unregularized ordinary least
relation to a global α abundance. We do this by building squares (OLS). The top row of Figure 4 shows the results
a model to predict the [α/Fe] abundances of stars using of the OLS model applied to the hold-out set. Using the
~
two sets of input features. In the first instance we use X individual Mg and Si abundances, we find that even this
= ([Fe/H], [Mg/Fe], [Si/Fe]), and in the second instance simple linear model predicts [α/Fe] quite well, with an
we use X~ = ([Fe/H], [Mg/Si]). The former set contains r2 = 0.95. However, the results are considerably when
worse using the input feature vector of X ~ = ([Fe/H],
the individual Mg and Si abundances, and the latter 2
contains just the ratio of the two. [Mg/Si]). The r is reduced to 0.59, and as seen in the
We split the APOGEE sample described in Section 3.1 figure, the model tends to over-predict the α abundances
into a training set (70%) and a hold-out set (30%). of low-α sequence stars, and under-predict the α abun-
The training set is used for model training and hyper- dances of high-α sequence stars.
parameter tuning. The hold-out set, which is data not One possibility that the [Mg/Si] abundance does not
seen during training or model selection, is used to eval- predict [α/Fe] well is that the relationship is not cap-
uate the performance of the final model. To select the tured by a linear model. To test this, we also train a
best model, with the training set we perform a grid vanilla feed-forward neural network (also referred to as
search over model hyper-parameters with a 10-fold cross- a multilayer perceptron (MLP)) using both sets of in-
validation, and ultimately select the model that results put features. We create a sequential MLP model and
in the best average r2 score. The r2 score is computed perform a small grid search over several hyperparame-
8 Blancato et al.
−0.1 r2 = 0.95 r2 = 0.59 and [Si/Fe] abundances do. What this suggests is that
−0.1 0.0 0.1 0.2 0.3 −0.1 0.0 0.1 0.2 0.3
0.3 there is additional information contained in the residu-
0.2 als of the prediction when using [Mg/Si], that is distinct
MLP
0.1
from the two elements’ relationship to a bulk α abun-
dance. However, this demonstration does not tell us
0.0
what the residuals from the [Mg/Si] prediction do trace.
−0.1 r2 = 0.95 r2 = 0.63 In this paper we investigate this information and seek
−0.1 0.0 0.1 0.2 0.3 −0.1 0.0 0.1 0.2 0.3 to understand what [Mg/Si] reveals about the chemical
[α/Fe]true
evolution of the Milky Way.
Figure 4. The prediction of [α/Fe] using two sets of in-
~ = ([Fe/H], [Mg/Fe], [Si/Fe]) (left) and X
put features, X ~ = 4. EMPIRICAL CHARACTERIZATION OF [MG/SI]
([Fe/H], [Mg/Si]) (right). In each panel, the predicted ver- 4.1. The [Mg/Si] abundance of the low- and high-α
sus true [α/Fe] abundances of stars in the hold-out set are sequences
shown, which is data not used during training and model se-
lection. The binning color indicated the dominate α class, We begin our investigation by first characterizing the
and the contours show the density of stars. The input vector ratio of Mg to Si for both the low and high-α sequences.
~ = ([Fe/H], [Mg/Si]) fails to predict the α-enhancement of
X The left panel of Figure 5 shows the [α/Fe]-[Fe/H] dis-
stars assuming both a linear model (OLS, top-row), and a tribution for the sample of ∼70,000 APOGEE stars de-
more-flexible model (MLP, bottom-row). scribed in Section 3, where the bins are colored by the
mean [Mg/Si] of the stars that fall within each bin.
ters including: the hidden layer size (5, 10, 25, 100), For the entire sample, the inner 90th percentile of the
the number of hidden layers (1, 2), the activation func- [Mg/Si] abundances spans nearly 0.2 dex from [Mg/Si]=
tion (ReLU, tanh), and the regularization α (5 values -0.05 - 0.14 dex, indicating that the stars have varying
from 10−3 to 103 ). As seen in the bottom row of Figure [Mg/Fe] and [Si/Fe] abundances. The median [Mg/Fe]
4, we find that a MLP model only performs marginally and [Si/Fe] values for the high-α sequence are 0.25 dex
better (r2 = 0.63) than the OLS model at predicting the and 0.1 dex, respectively. The median [Mg/Fe] and
[α/Fe] abundances of the hold-out set from X ~ = ([Fe/H], [Si/Fe] values for the low-α sequence are 0.035 dex and
[Mg/Si]). 0.015 dex, respectively. These differences in Mg and Si
To test if the dimensionality of the input feature vec- abundances lead to mean [Mg/Si] ∼ 0.096 dex for the
tor is driving the difference in performance between the high-α sequence and mean [Mg/Si] ∼ 0.02 dex for the
models trained with the two sets of features described low-α sequence.
above, we trained models with three additional sets of To quantify how the ratio of Mg to Si varies for
input features: (1) X ~ = ([Fe/H], [Mg/Fe]), (2) X ~ = stars within the low and high-α sequences, the right
~
([Fe/H], [Si/Fe]), and (3) X = ([Fe/H], average([Mg/Fe], panel of Figure 5 shows how [Mg/Si] varies with [α/Fe]
[Si/Fe])). Training with (1) we find the OLS r2 to be across the [α/Fe]-[Fe/H] plane. Here, we compute the
0.93 and the MLP r2 to be 0.93. Training with (2) we absolute value of the covariance between [Mg/Si] and
find the OLS r2 to be 0.74 and the MLP r2 to be 0.77. [α/Fe]⊥ in 8 [Fe/H] bins from -1.0 ≤ [Fe/H] ≤ +0.5
And training with (3) we find the OLS r2 to be 0.94 and dex, where [α/Fe]⊥ is measured perpendicular to a lin-
the MLP r2 to be 0.94. From this experiment we find ear class boundary separating the low and high-α se-
that for this specific problem, the difference in the di- quences. In computing the covariance for the high and
mensionality of the input feature vector only marginally low-α sequence separately, we weight the stars accord-
effects the prediction performance because [Mg/Fe] more ing to their cluster assignment probabilities as described
strongly traces [α/Fe] than [Si/Fe]. in Section 3.3. As seen in Figure 5, [Mg/Si] and [α/Fe]
In the end, using both a simple linear model and a jointly vary more strongly for low-α sequence stars than
more flexible non-linear model, the individual [Mg/Fe] for high-α sequence stars across nearly the entire range
and [Si/Fe] abundances predict the α-enhancements of of metallicities. The absolute value of the covariance for
[Mg/Si] variations 9
[Mg/Si]
0.40
0.4 0.05 0.00 0.05 0.10 high-↵
0.35
low-↵
|COV([Mg/Si], [↵/Fe]?)|
0.3
0.30
0.2 0.25
[↵/Fe]
0.20
0.1
0.15
0.0
0.10
0.1
0.05
0.2 0.00
1.00 0.75 0.50 0.25 0.00 0.25 0.50 0.8 0.6 0.4 0.2 0.0 0.2 0.4
[Fe/H] [Fe/H]
Figure 5. Left: The [α/Fe]-[Fe/H] distribution for the ∼70,000 stars in our APOGEE subsample. The bins are colored by the
[Mg/Si] abundance, while the contours represent the density of stars. Here, we see that the high-α sequence stars have higher
[Mg/Si] abundances than low-α sequence stars. Right: The absolute value of the covariance between [Mg/Si] and [α/Fe]⊥ as a
function of metallicity for the low- (red) and high- (blue) α sequences. In the low-α sequence, [Mg/Si] and [α/Fe]⊥ jointly vary
more strongly than in the high-α sequence.
the high-α sequence never surpasses ∼0.2, whereas for to build intuition for how the production of Mg rela-
the low-α sequence it reaches ∼0.4 at [Fe/H] ∼ 0.2 dex. tive to Si varies both through time and in different star-
This shows that the ratio [Mg/Si] behaves differently formation environments.
in the low and high-α sequences, which suggests that As described in Section 3.1, we use stellar age esti-
this ratio could be probing differences in the chemical mates from Ness et al. (2016), which are derived from
enrichment histories of the two sequences. CN absorption lines and are accurate to ∼40%. Con-
Before continuing, we corroborate the finding, that the sidering these uncertainties, we divide our sample of
high-α sequence has an excess of Mg relative to Si com- APOGEE stars into three wide age bins, roughly separat-
pared to the low-α sequence, with both the APOGEE red ing young stars born 0 - 3 Gyrs ago, intermediate-aged
clump sample described in Section 3.2.1 and the HARPS stars born 3 - 7 Gyrs ago, and old stars born 7 - 14
sample described in Section 3.2.2. First, to obtain clus- Gyrs ago. This binning results in 561 young stars, 3,981
ter membership probabilities for both of these datasets, intermediate-aged stars, and 8,254 old stars in the high-
we apply the same Gaussian mixture model trained on α sequence (P(z = high-α) > 0.95), and 19,957 young
the full APOGEE sample. For the red clump stars, the stars, 23,685 intermediate-aged stars, and 8,747 old stars
high-α sequence is found to have a mean [Mg/Si] of 0.07 in the low-α sequence (P(z = high-α) < 0.05).
dex, while the low-α sequence has a mean [Mg/Si] of Figure 6 shows the mean [Mg/Si] abundance in each
0.003 dex. And for the HARPS stars, the mean [Mg/Si] age bin, for both the low and high-α sequences. As ex-
of the high-α sequence is 0.07 dex and the mean [Mg/Si] pected from Section 4.1, in each bin the high-α sequence
of the low-α sequence is 0.01 dex. These differences are has a higher mean [Mg/Si] than the low-α sequence. It
similar to what is found for the full APOGEE sample. Af- should be noted that while the differences in the mean
ter verifying that the [Mg/Si] trends are robust to log g [Mg/Si] values are significant, specifically the standard
and Teff variations, and are also present in a sample of errors on the means are smaller than the size of the sym-
stars with higher quality spectra, in the following sec- bols in the figure, the 1σ dispersions around the mean
tion we explore how [Mg/Si] evolves with stellar age and of the distributions (indicated in grey) do overlap. So,
metallicity. while the distributions peak at different values, they are
not completely disparate. The takeaway of Figure 6 are
4.2. [Mg/Si] trends with age and metallicity the trends we observe with age and the mean [Mg/Si].
Without yet taking metallicity into consideration, we
To further examine differences in the [Mg/Si] abun-
notice two things. The first is that stars born earlier in
dance between the low and high-α sequences, we now
time (older ages) are more enhanced in Mg relative to
consider variations with other stellar properties avail-
Si than stars born at later times (younger ages). This
able to us, including ages and [Fe/H]. Investigating cor-
is consistent with the relative theoretical Mg/Si yields
relations with these additional parameters will allow us
10 Blancato et al.
[Fe/H] 0.05 < [Fe/H] < 0.05 dex), and a metal-rich bin (0.25
0.15 −0.3 −0.2 −0.1 < [Fe/H] < 0.35 dex).
In each metallicity bin, we see that the [Mg/Si] abun-
dance of the low-α sequence increases with stellar age.
0.10
This suggests that the general trend of the relationship
[Mg/Si]
0.20 -0.55 < [Fe/H] < -0.45 -0.05 < [Fe/H] < 0.05 0.25 < [Fe/H] < 0.35
0.16
0.12
[Mg/Si]
0.08
0.04
0.00
−0.04 high-α
−0.08
low-α
0 2 4 6 8 10 12 0 2 4 6 8 10 12 0 2 4 6 8 10 12
age (Gyrs)
Figure 7. [Mg/Si] abundance as a function of stellar age in three narrow metallicity bins of 0.1 dex. The left panel shows a
metal-poor bin, the middle panel shows a solar-metallicity bin, and the right panel shows a metal-rich bin. In each panel, the
high-α sequence is shown in blue, and the low-α is shown in red. The light grey error bars represent the 1σ standard deviation
of [Mg/Si] in each age bin, while the black error bars represent the error on the mean.
lation from SN Ia pollution. A possible mechanism for each Monte Carlo iteration every star’s coordinates are
this increase in Mg enrichment is if high-α sequence stars re-sampled from:
formed according to an IMF with a high-mass end slope
that became flatter over time. This would produce rel- zi ∼ N (|z|, zerr ) (1)
atively more high-mass stars which, as discussed in Sec-
tion 1, would yield more Mg than Si. We attempt to
Ri ∼ N (R, Rerr ) (2)
explore these possibilities in Section 5 through galactic
chemical evolution modeling. where the coordinates are described as normal distri-
butions centered at the Sanders & Das (2018) derived
4.3. Spatial and orbital trends with [Mg/Si]
values, with a variance equal to the reported errors. Af-
4.3.1. Trends with Galactic location ter sampling zi and Ri for each star, the stars are re-
Lastly, we now turn towards empirically characterizing binned by (|z|, R) and the mean [Mg/Si] and [Fe/H]
the relationship between [Mg/Si] and disk structure and values of each bin are recorded. Repeating this proce-
dynamics. We do this by establishing how the ratio Mg dure N=103 times, we obtain the median and range of
to Si varies with location throughout the disk, and by in- the mean [Mg/Si] and [Fe/H] values in each bin.
vestigating how the stellar actions are related to [Mg/Si]. The top panel of Figure 8 shows the low and high-
By making this connection between the chemistry and α sequence [Mg/Si]-[Fe/H] distributions in the different
the structural and orbital properties of stars, we hope to (|z|, R) bins, as well as the results of the Monte Carlo
understand how [Mg/Si] might encode unique informa- sampling simulation described above. In the Figure, the
tion regarding the formation and build-up of the Milky displayed contours are based on the mean posterior dis-
Way disk. tances to each star, while the error bars represent the
To start, we consider how [Mg/Si] varies with Galacto- results of the Monte Carlo sampling. First we examine
centric radius (R) and distance from the disk midplane the high-α sequence. At all distances from the midplane,
(|z|). As discussed in Section 3.1, we match our sample the [Mg/Si]-[Fe/H] distribution evolves similarly from
of APOGEE stars with the Sanders & Das (2018) catalog the inner disk to the outer disk, where the peak of the
to obtain the coordinates of each star. Inspired by Fig- distribution appears to increase marginally in [Mg/Si]
ure 4 of Hayden et al. (2015), we divide the sample into abundance from the inner-disk (3 kpc) to the mid-disk (7
three |z| bins: 0 - 0.5 kpc, 0.5 - 1.0 kpc, and 1.0 - 2.0 - 9 kpc), and then decreases towards the outer-disk (15
kpc. Each of these three bins in |z| is further divided kpc). However, in the outermost region of the disk con-
into six radius bins, ranging from 3 to 15 kpc in 2 kpc sidered (13 - 15 kpc) the distance errors combined with
wide annuli. To quantify the impact of each star’s dis- the limited sample size significantly impact the binning.
tance measurement uncertainty on the (|z|, R) binning, Contrary to high-α sequence, the low-α sequence dis-
we perform a Monte Carlo simulation. In short, during plays more significant spatial [Mg/Si]-[Fe/H] trends. As
12 Blancato et al.
seen in bottom row of the top panel of Figure 8, close with radius until R ∼ 15 kpc. For the stars closest to
to the disk mid plane (|z| = 0 - 0.5 kpc) the peak of the the disk midplane (|z| < 0.5 kpc), the mean [Mg/Si] is
[Mg/Si] distribution decreases from ∼0.02 dex in the in- nearly constant from R ∼ 3 - 9 kpc, and then decreases
ner part of the Galaxy to ∼-0.01 dex in the outer regions. until R ∼ 15 kpc.
While this gradient in [Mg/Si] with radius is weak, it is The main takeaways from this exploration of how
significant compared to the stellar distance uncertain- [Mg/Si] varies throughout the disk are summarized as
ties and sample sizes of each bin. A similar trend with follows. First, from the top panel of Figure 8 we learn
radius is seen farther from the midplane. This figure that similar to how the distribution of [α/Fe]-[Fe/H]
shows that the low-α sequence stars currently residing varies with location in the disk (as in Figure 4 of Hay-
in the inner region were formed from gas that was more den et al. (2015)), the distribution of [Mg/Si]-[Fe/H]
enriched in Mg relative to Si compared to the stars cur- also varies with Galactic coordinates. We examine
rently residing in the outer disk. the high and low-α sequence stars separately, and ob-
The bottom panel of Figure 8 summarizes the mean serve that while their [Mg/Si]-[Fe/H] distributions sig-
trends seen in the top panel of Figure 8, for both the low- nificantly overlap, there are small differences between
and high-α sequences. The [Mg/Si]-radius relationship how the distributions of the two sequences change with
is shown for each bin in |z| and the marker color indicates R and |z|. From the bottom panel of Figure 8 we learn
the mean [Fe/H]. The shaded grey regions indicate the that the large number of stars in each bin affirms that
1σ distribution, the black error bars indicate the stan- the differences between the two α sequences in their
dard error on the mean (SEM), and the grey error bars mean [Mg/Si]-spatial trends are significant.
indicate the 3σ range from the Monte Carlo simulation The trends seen with mean [Mg/Si] abundance and
described above. While the information shown in this |z| for the low-α sequence stars could suggest that the
figure is the same as the information shown in the top low-α sequence part of the disk was built both up and
panel of Figure 8, this representation of the data better out from layers of gas that had varying overall [Mg/Si]
lends itself to visualizing the relevant trends. First con- normalizations that scale with |z|, but similar gradients
sider the high-α sequence. For these stars we see that with respect to radius. On the other hand, the high-α
the mean [Fe/H] decreases from the inner- to outer-disk, sequence trends suggest that the gas that formed these
and that the trend with mean [Mg/Si] and radius is sim- stars was similarly enriched in Mg and Si at all distances
ilar regardless of distance from the disk midplane. Here, from the disk midplane. These observations are consis-
we see more clearly that high-α sequence stars in the tent with how the ages of stars vary throughout the disk
intermediate disk (at ∼8 kpc) have the highest [Mg/Si] for both the low- and high-α sequences. Considering the
abundances on average (∼ 0.11 dex) compared to the av- same spatial bins defined in Figure 8, at every |z|, low-α
erage [Mg/Si] abundances of stars in the inner- (∼0.07 sequence stars that reside closer to the Galactic center
dex) and outer-disk (∼0.06 dex). This trend of an aver- are older than their counterparts at larger radii. And at
age [Mg/Si] which first increases from ∼3 kpc to ∼8 kpc, fixed R, low-α sequence stars that reside closer to the
and then decreases from ∼8 kpc to ∼15 kpc, is robust disk midplane are younger than the stars that at larger
to the standard errors on the mean (which span lengths heights from the midplane. However, for the high-α se-
smaller than the height of the markers in the figure). quence stars, we find little to no trend with stellar age
The low-α sequence stars exhibit markedly different and Galactic coordinates.
behavior with mean [Mg/Si] and (|z|, R). As seen in the Aside from their different trends with |z|, the mean
bottom panel of Figure 8, while the shape of the [Mg/Si]- [Mg/Si] of the high and low-α sequences also peak at dif-
radius curve is similar for all bins in |z|, at each radius ferent radii, with the low-α sequence stars having higher
stars that reside farther from the disk midplane have [Mg/Si] abundances more towards the inner disk and the
higher [Mg/Si] abundances compared to the stars that high-α sequence stars having higher [Mg/Si] abundances
reside closer to the disk midplane. For instance, in the R in the intermediate disk. These differences must be re-
= 5 - 7 kpc bin, low-α stars residing at |z| = 0 - 0.05 kpc flective of the time sequence of star-formation, and of
have an average [Mg/Si] abundance of 0.03 dex, while radial migration effects. Low-α sequence stars largely
the stars residing at |z| = 1 - 2 kpc have an average form in the disk midplane where there are gradients in
[Mg/Si] abundance of 0.076 dex. Compared to the high- [Mg/Si] and [Fe/H]. As time goes on, these stars form
α sequence stars, the shape of the mean [Mg/Si]-radius at progressively larger radii. On the other hand, high-α
trends are also different. For |z| > 0.5 kpc, the stars with sequence stars form more towards the central regions of
the highest [Mg/Si] abundances on average are located the Galaxy with higher and clumpier efficiencies. Since
at R ∼ 6 kpc, and then the mean [Mg/Si] decreases high-α sequence stars are generally older, they presum-
0.10
[Mg/Si] variations 13
90
1.0 < |z| < 2.0 kpc high-↵ sequence low- ↵ sequence
0.3
3 < R < 5 kpc 5 < R < 7 kpc 7 < R < 9 kpc 9 <92R < 11 kpc 11 < 0.08
R < 13 kpc 13 < R < 15 kpc
0.1
94
0.1
N = (307, 49) N = (514, 87) N = (2321, 1269) N = (496,
96 904) N = (138, 905) N = (27, 308)
[Mg/Si]
1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5
98
0.5 < |z| < 1.0 kpc 0.06 1.0 < |z| < 2.0 kpc
0.3 0.3
3 < R < 5 kpc 5 < R < 7 kpc 7 < R < 9 kpc 3 9<<
RR< 5<kpc
11 kpc 5 < R11< <
7 kpc
R < 137kpc
< R < 13
9 kpc
< R <915
<R < 11 kpc
kpc 11 < R < 13 kpc 13 < R
100
[Mg/Si]
0.1
N = (307, 49) N = (514, 87) N = (2321, 1269) N = (496, 904) N = (138, 905) N = (27,
0.1
N = (357, 225) N = (500, 525) N = (1704, 4121) 1.0 (424,
N= 0.5 3290)
0.0 0.5 1.0N =0.5(160,
0.0 0.5
2819) 1.0 0.5
N =0.0(49,0.5
1282) 1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5 1.0
1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5
0.125 1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5
0.5 < |z| < 1.0 kpc
0.3
3 < R < 5 kpc 5 < R < 7 kpc
0.04 7 < R < 9 kpc 9 < R < 11 kpc 11 < R < 13 kpc 13 < R
0.0 < |z|0.100
< 0.5 kpc
[Mg/Si]
0.3
3 < R < 5 kpc 5 < R < 7 kpc 7 < R < 9 kpc0.1 9 < R < 11 kpc 11 < R < 13 kpc 13 < R < 15 kpc |z| =
0.075
[Mg/Si] 1 - 2 kpc
0.1 0.1
0.050
N = (357, 225) N = (500, 525) N = (1704, 4121) N = (424, 3290) N = (160, 2819) 0.5 -N1= kpc
(49,
1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5 1.0 0.5 0.0 0 0.5
- 0.5 1.0kpc
0.1
N = (306, 921) N = (484, 2460) N = (1296, 10520) 0.025N = (630, 14144) N = (284, 9978) 0.0N<= |z|
(79, < 0.5 kpc
2589)
0.3
1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5 1.0 0.5 0.0 0.5 3 <1.0
R <0.5
5 kpc
0.0 5<R<
0.5 1.07 kpc
0.5 0.0 7 <
0.5R < 9 1.0
kpc 0.5 90.0
< R 0.5
< 11 kpc 11 < R < 131kpc 13 < R
- 2 kpc
0.02
0.000 0.5 - 1 kpc
[Fe/H] 0.1 0 - 0.5 kpc
0.025
0.1
N = (306, 921) N = (484, 2460) N = (1296, 10520) N = (630, 14144) N = (284, 9978) N = (79,
0.125 1.0 40.5 0.0 0.5 6
1.0 0.5 0.0 0.5 8 1.0 0.5 0.0 10
0.5 1.0 0.5 0.0 120.5 1.0 0.5 14
0.0 0.5 1.0
[Fe/H]
0.100
|z| = 0.3 0.1 0.1
0.075 1 - 2 kpc0.00
[Mg/Si]
4 6 8 10 12 14 16
radius (kpc
radius (kpc)
Figure 8. Top: The distribution of high-α (grey) and low-α (blue) sequence stars in the [Mg/Si]-[Fe/H] plane as a function
of Galactocentric radius (R) and distance from the disk midplane (|z|). Each row shows the distribution in 2 kpc wide radius
bins, from the inner-disk (left) to the outer-disk (right). The bottom row shows the stars closest to the disk midplane from
|z| = 0 − 0.5 kpc, the middle row shows stars from |z| = 0.5 − 1 kpc, and the top row shows stars farthest from the midplane
at |z| = 1 − 2 kpc. In each panel the error-bars represent the range of the mean [Mg/Si]-[Fe/H] values based on 103 Monte
Carlo samplings of R and |z|. The number of stars in each bin is indicated as N = (# high-α stars, # low-α stars). Bottom:
This figure summarizes the trends shown in the panel above, showing the mean [Mg/Si] as a function of radius for each R,|z|
bin. The black error bars represent the standard error on the mean (SEM), the grey error bars represent the 3σ range from
the Monte Carlo simulation, and the grey bands represent the 1σ range of the [Mg/Si] distribution. Here, trends with [Mg/Si],
[Fe/H], and position within the Galaxy are more apparent. For high-α sequence stars (triangles) the peak [Mg/Si] occurs at an
intermediate disk radius of ∼ 8 kpc, regardless of height from the disk midplane. For low-α sequence stars (circles), the peak
[Mg/Si] occurs more towards the inner-disk at < 6 kpc, and in each radius bin stars at larger distances from the disk midplane
are more enhanced in [Mg/Si] than stars residing closer to the disk midplane.
14 Blancato et al.
ably have had more time to migrate both vertically and lower Jφ values (residing in the inner disk) formed at
radially. Distinct radial migration patterns, for example early times, and since then the radial size of the disk
caused by different dynamical effects of the Galactic bar was built up from progressively younger stars.
on the low- versus high-α sequence populations, could We also find the high and low-α sequences to be dis-
also potentially result in the trends we observe. These tinct in radial action, JR , and vertical action, Jz . As
broad speculations should be revisited in the context of seen in the middle panel of the top row of Figure 9, in
a more rigorous modeling approach. every age bin the high-α sequence stars exhibit larger
JR values than the low-α sequence stars. This trend is
4.3.2. Trends with orbital properties robust to the standard errors on the mean, however the
distributions do overlap within 1σ. What these trends
Aside from considering location within the Galaxy, we
with JR suggest is that at all ages, high-α sequence stars
also characterize the relationship between [Mg/Si] and
are on more eccentric orbits than low-α sequence stars,
stellar orbital properties. We do this by examining how
and that for both sequences older stars are described by
[Mg/Si] varies with the three actions, Jφ , JR , and Jz .
more eccentric orbits than younger stars. One explana-
As mentioned in Section 3.1, we obtain actions for our
tion for the relationships between radial action and age
APOGEE sample from Sanders & Das (2018), who adopt a
is that older stars have had more time for their orbits
McWilliam et al. (2013) Milky Way potential to compute
to be perturbed to more eccentric orbits, and this per-
Jφ , JR , and Jz from Gaia astrometry using the Stäckel
turbation occurs regardless of a star’s α-enhancement.
Fudge method (Sanders & Binney 2016). Assuming an
The observed trends with Jz are similar to those with
axisymmetric and relatively time-independent potential,
JR . As seen in the figure, at every age the high-α se-
the three actions uniquely define a stellar orbit. Concise
quence stars are described by larger vertical actions than
physical explanations of the three actions are given in
the low-α sequence stars. This trend also evolves with
Trick et al. (2019). The azimuthal action, Jφ , quantifies
age for both sequences, where older stars have higher Jz
a star’s rotation about the center of the Galaxy. This
values than younger stars. These trends are robust to
quantity is the same as angular momentum in the verti-
the standard errors on the mean, and the distributions
cal direction, and throughout this paper we refer to az-
between the two sequences are distinct by at least 1σ.
imuthal action of a star as its angular momentum. The
The differences in the Jz values of the low and high-
radial action, JR , describes the amount of oscillation a
α sequence stars is reflective of their general location
star exhibits in the radial direction, which is related to
within the Galaxy, where low-α sequence stars are more
the eccentricity of the orbit. Stars with JR = 0 are on
confined to the disk midplane and high-α sequence stars
circular orbits. And lastly, the vertical action, Jz , mea-
mostly comprise the “thick”, vertically extended disk of
sures the excursion of a star in the vertical direction,
the Milky Way.
where Jz = 0 indicates that a star is confined to the
We speculate why high-α sequence exhibit higher Jz
disk midplane.
and JR values than low-α sequence stars at every age.
To begin, the top row of Figure 9 shows how the ac-
One possibility is that because the low- and high-α se-
tions of high and low-α sequence stars differ across stel-
quences form in different spatial locations (and presum-
lar ages. Similar to what Gandhi & Ness (2019) report
ably from different reservoirs of gas), they subsequently
for stars with LAMOST abundances, we find that the low
experience distinct modes of star-formation that result
and high-α sequences are distinct in their actions at all
in different initial orbital properties. An additional pos-
ages. First considering angular momentum, we see that
sibility is that heating processes may be more active in
low-α sequence stars have higher Jφ values than high-α
the thicker, inner disk where high-α sequence stars re-
sequence stars at each age. This separation between the
side, which perturbs these stars to higher radial and ver-
two sequences is robust to the standard errors on the
tical excursions. Lastly, dynamical times are shorter in
mean Jφ in each age bin, and at most ages the distri-
the inner disk. So the dynamical ages of high-α stars
butions are separated by at least one standard devia-
are generally older than their stellar ages, which means
tion. The separation in Jφ between the two sequences is
they would have more time to be perturbed to higher Jz
consistent with the notion that the low-α sequence com-
and JR values.
prises a more radially extended disk, while the high-α
Considering the evolution with age, as with the radial
sequence is confined closer to the Galactic center. Since
action, older stars have had more time to be perturbed
Jφ traces radial location, these trends are also reflective
to larger vertical excursions. However, while the Jz val-
of a gas disk where star-formation proceeded outwards
ues of low-α sequence stars increase steadily with stel-
over time with a radial gradient in chemical abundances.
lar age, the Jz values of high-α sequence stars remain
This is inside-out formation of the disk, where stars with
[Mg/Si] variations 15
J log(JR) log(Jz )
2
2600 p p p
Lz 2 Jr Jz
220051 14
10
48 12 1
180045 8
10
42 6
1400 8
39 1
36 6 0 4
1000
330 2 4 6 8 10 12 14 04 2 4 6 8 10 12 14 02 2 4 6 8 10 12 14
30 2
2600 0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14
2
220051 14 10
48 12 8
1
180045
10 6
42
1400 8 4
39 1
36 6 0 2
1000
330 2 4 6 8 10 12 14 04 2 4 6 8 10 12 14 0 0 2 2 4 4 6 6 8 8 10 10 12 12 14 14
30
high-↵ seq. age (Gyrs)
0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14 0.02 0.00 0.02 0.04 0.06 0.08
low-↵ seq.
high ↵ seq. age (Gyrs) [Mg/Si]
low ↵ seq.
Figure 9. Top row : The evolution of orbital actions (Jφ , JR , Jz [kpc km s−1 ]) with stellar age for low and high-α sequence
stars. For 16 age bins, the average action value is shown and in grey the 1σ standard deviation is indicated. At every age, the
actions of high and low-α sequence stars are distinct. Bottom row : The low-α sequence action-age relations now divided into
stars with lower versus higher [Mg/Si] abundances. For Jφ , part of the scatter in the relation with age is described by a gradient
in [Mg/Si], where stars with lower [Mg/Si] values have higher angular momentum than stars with higher [Mg/Si] values.
nearly constant from 4 - 14 Gyrs. This suggests that α- a small fraction of their time near the disk midplane that
enhancement could be a signature of initial orbital prop- GMC scattering cannot further heat these stars.
erties, as well as the dynamical processes that perturb Given the relationships revealed in Section 4.2 be-
orbits over time. The behavior of the low-α sequence tween [Mg/Si] and stellar age, the bottom row of Fig-
can be broadly understood in the context of recent work ure 9 shows the [Mg/Si] abundance of low-α sequence
by Ting & Rix (2018), who explore the observed age de- stars separated into two bins. Divided based on the
pendence of Jz for low-α APOGEE red clump stars with mean [Mg/Si] value of all the low-α sequence stars, one
ages < 8 Gyrs. Ting & Rix (2018) find that a simple an- bin contains the stars with lower [Mg/Si] abundances
alytic model of vertical heating can describe the trends and the other contains the stars with higher [Mg/Si]
in the data. They posit that heating is dominated by or- abundances. As seen in the figure, for the radial and
bit scattering, presumably from giant molecular clouds vertical actions, at every age there is little to no dy-
(GMCs), with Jz ∝ t1/2 . They also take into account namical separation between stars with low versus high
how the exponentially declining SFR of the Galaxy re- [Mg/Si] enrichment. However for angular momentum, a
sults in the decline of GMC occurrence over time, and gradient with [Mg/Si] partly constitutes the scatter in
that the disk mass density decreases with time as well. the Jφ -age relationship. At every age, stars with lower
However, they do not consider high-α sequence stars. [Mg/Si] abundances have higher angular momenta than
We find in Figure 9 that unlike the age-Jz relationship stars with higher [Mg/Si] abundances. Due to the large
for the low-α sequence, the high-α age-Jz relationship number of stars in the sample, this trend in the mean
saturates at some stellar age. This saturation matches [Mg/Si] is robust to the standard errors. For the high-α
our physical expectation that, at a certain point, al- sequence, we find no separation in any action-age re-
ready dynamically hot high-α sequence stars spend such lationship with [Mg/Si] abundance. The trends with
Jφ -[Mg/Si] and age for the low-α sequence reflect the
16 Blancato et al.
chemical segregation of the star-forming thin disk, at riched from the products of star-formation, is exchanged
a given time. The absence of a trend with JR and Jz between the gas reservoir and the star-formation site.
suggest that heating and/or migration effects act simi- As briefly described in Section 2, Chempy is a recently
larly on mono-age stellar populations at the radii they developed code that allows for Bayesian inference of
reside. For the high-α sequence, the additional absence GCE model parameters. The publicly available Chempy
of a trend with Jφ is reflective of the lack of chemical model presented in Rybizki et al. (2017) is a one-zone
segregation in the thicker disk in which high-α stars were GCE with seven free parameters. This includes three
generally born. These observations support the notion parameters related to stellar physics, and four parame-
that the low- and high-α sequences experience different ters the describe the ISM. The stellar physics parameters
modes of enrichment. include: the high-mass slope of the IMF, a normaliza-
As we’ve done throughout Section 4, large samples tion constant for the number of exploding SN Ia, and
of stars with high quality spectra and astrometric mea- the SN Ia enrichment time delay. The ISM parameters
surements allow us to empirically characterize the re- include: a parameter which sets the star-formation effi-
lationship between chemical abundances, stellar ages, ciency, the peak of the star-formation rate, the fraction
and the dynamical properties of stars. However, the of stellar yields which outflow to the surrounding gas
physical origins of these connections between chemistry, reservoir, and the initial mass of the gas reservoir. With
age, and dynamics remains an open question in Milky these parameters that specify the GCE, Chempy com-
Way science. Nonetheless, empirical investigations can putes the enrichment of the ISM through time based on
place constraints on: theoretical expectations, trends a set of yield tables. These yield tables describe stellar
observed in simulations of Milky Way galaxies, and on feedback from three nucleosynthetic channels: CC-SN,
Galactic chemical evolution models. For the remainder SN Ia, and AGB stars. The CC-SN and AGB yields are
of this paper, we now shift our focus and attempt to mass and metallicity-dependent, and in the model these
understand the [Mg/Si] abundance trends we report by yields are ejected immediately following stellar death.
connecting the observed abundance patterns to under- SN Ia feedback is independent of mass and metallicity,
lying stellar physics. with the yields being deposited into the ISM accord-
ing to the Maoz et al. (2010) delay time distribution.
5. CHEMICAL EVOLUTION MODELING As we’ll describe in Section 5.2, with a list of observed
5.1. Motivation stellar abundances, a stellar age estimate, and choice of
yield tables, Chempy can be used to obtain posterior
In Section 2 we established a theoretical motivation for
distributions for the GCE model parameters.
examining the ratio of magnesium to silicon. In Section
We present the results of our Chempy modeling in the
3.4 we used a large sample of stars with abundance mea-
following sections. Our approach is to fit each star’s
surements from APOGEE to show that the [Mg/Si] abun-
abundance pattern with its own one-zone GCE model,
dance contains different information than the individ-
which is in contrast with approaches that fit a single
ual [Mg/Fe] and [Si/Fe] abundances. Then, in Section
model to either the abundance pattern of multiple stars,
4 we characterized how the [Mg/Si] abundance varies
or a mean abundance pattern of many stars. However,
between the low- and high-α sequences, with age and
we caution that ultimately we encounter problems in
[Fe/H], with location in the Galaxy, and with the or-
reproducing the APOGEE abundances, which limits the
bital actions. After establishing these empirical [Mg/Si]
scope and interpretation of the fits. The challenges we
trends, we now seek to understand these trends by inves-
come across are discussed in detail in Section 5.4, but
tigating the physical origin of the observed magnesium
in short we find that the inferred Chempy model pa-
and silicon abundances.
rameters result in predicted abundances that are unable
One way observed abundances can be linked to a phys-
to match the observed abundance patterns, particularly
ical origin is through galactic chemical evolution mod-
the [Si/Fe] abundance. Presumably, this failure is due
eling. Broadly, the goal of galactic chemical evolution
to the inability of current yield tables to describe the
(GCE) modeling is to employ physically motivated mod-
abundance patterns of stars that exhibit a diversity of
els of star-formation and stellar evolution together with
enrichment histories.
a parametrization of galactic inter-stellar medium (ISM)
physics to predict observed abundance patterns of stars
throughout cosmic time. The simplest form of GCEs 5.2. Fitting APOGEE stellar abundances
involve a single “zone” that instantiates a site of star- Given the limitations we encounter when using
formation surrounded by a reservoir of gas. Through Chempy to model observed APOGEE abundance patterns,
inflow and outflow processes, primordial gas, and gas en- we decide to narrow the scope of this study by focusing
[Mg/Si] variations 17
5.3. Results
space, and by comparing the current evaluation to the
evaluation at the previous step, the proposed step is ei- We now present the results of the Chempy GCE mod-
ther accepted or rejected with some probability. In the eling for the mono-age, mono-abundance stellar samples
context of Chempy, the sampling routine is summarized described above. As an example of the full posterior dis-
as follows. During each walkers’ steps, the current pa- tributions obtained, Figure 13 in the Appendix shows
rameter configuration is used to evaluate the log prior. the trace plots and joint posterior distributions for two
Then, keeping track of the chemical abundances at each stars that we fit: a solar-metallicity, 5 Gyr-aged star in
timestep, the Chempy GCE model is run with these pa- the low-α sequence, and a metal-poor, 9 Gyr-aged star
rameter values from t = 0 to the provided estimated in the high-α sequence. Details regarding these partic-
time of stellar birth. The log likelihood is then com- ular fits are discussed in the Appendix. While there
puted by summing the square difference between the are potentially interesting differences between the pos-
observed and model abundances, weighted by the obser- teriors of the two stars, these differences must be inter-
vational uncertainties. The posterior is then evaluated preted in the context of how well the models describe
by summing the log prior and log likelihood. the data. To examine this we use the inferred parame-
For each of the stars we fit, we initialize the MCMC ter values to run instances of the Chempy GCE model
routine with 28 walkers. The starting locations of the and predict the stellar abundance patterns at the time
walkers are typically set to be confined to a tight 4- of stellar birth. We then compare these predicted abun-
dimensional ball centered on the prior means, and after dances to the observed abundances that were used to
several iterations the walkers explore the larger, relevant infer the posteriors. This approach to model evaluation
parameter space. Instead of running each MCMC in- is similar to posterior predictive checking (Gelman &
stance for a pre-defined number of steps, we monitor the Stern 1996). Figure 10 shows the observed and predicted
convergence of the chains in real-time and terminate the abundances for both of the stars included in Figure 13.
runs once some convergence criterion is satisfied. The The observed abundances are the [Fe/H], [Mg/Fe], and
commonly used Gelman-Rubin statistic is not appro- [Si/Fe] measurements from APOGEE that were passed to
priate for chains produced with emcee since the chains the likelihood calculation, and the predicted abundances
are not independent of one another. Instead, Foreman- are based on a Chempy model run using parameter val-
Mackey et al. (2013) suggest assessing convergence by ues from each star’s posterior distributions. The hori-
computing the integrated autocorrelation time of the zontal markers indicate predicted abundances based on
the 50th percentile posterior values, while the vertical
[Mg/Si] variations 19
range indicates the predicted abundance range consid- same [Fe/H], age, and α sequence bin, which assumes
ering Chempy models with the 16th and 84th percentile that individual posterior probability distributions are
values of the parameter posteriors. independent of one another. Given that the stars have
As seen in the figure, the [Fe/H] and [Mg/Fe] abun- similar properties modulo any differences in [Mg/Fe] and
dances of the 5 Gyr-aged solar-metallicity star are well [Si/Fe], the combined posteriors more strongly constrain
predicted by the model, falling nearly within the ob- the parameters than the individual posteriors.
servational uncertainty ranges. However, the [Si/Fe] First consider the αIMF parameter, which is the GCE
abundance is not reproduced as well. The model over- parameter we find to be most constrained by the data.
predicts the silicon abundance by nearly 0.2 dex, and For all the stars that we fit, the posteriors appear to
the observed abundance is not within the model’s 1σ disfavor IMF slopes that are more top-heavy than αIMF
posterior range. The predicted abundances for the 9 > -2.25. Additionally, for both the solar-metallicity
Gyr-old metal-poor star exhibit similar discrepancies and metal-poor stars, we find a weak trend between
with respect to the observed abundances. As seen in the inferred αIMF and stellar age. Younger stars are
the bottom panel of Figure 10, the Chempy model run found to have IMF slopes described by a more bottom-
with the 50th percentile posterior values results in pre- heavy IMF, whereas older stars are described by more
dicted abundances that agree or nearly agree with the Chabrier-like IMF slopes. At each age, there is a fur-
observed [Fe/H] and [Mg/Si] abundances. But, simi- ther trend with αIMF and [Fe/H]. Metal-poor stars are
lar to the solar-metallicity star, the range of predicted found to be described by a more bottom-heavy IMF
[Si/Fe] abundance is disparate from the observed [Si/Fe] slope than their solar-metallicity counterparts of the
abundance, and is over-predicted by about 0.2 dex. same age. There are also smaller differences between
The main take-away from Figure 10 is as follows. the inferred IMF slope of low and high-α sequence stars,
While we are relatively confident that the chains are which is most apparent when considering the combined
not unconverged, we find that the inferred parameters posteriors. For both metallicities and at all ages, high-α
do not always result in Chempy models that are able to sequence stars are found to have more top-heavy IMF
reproduce the observed abundances, particularly when slopes than low-α sequence stars.
it comes to [Si/Fe]. Possible reasons for this will be dis- Aside from the IMF parameter, NIa is the other pa-
cussed further in Section 5.4. While we next present rameter we find to be tightly constrained by the APOGEE
how the inferred GCE parameters vary for stars of dif- abundances. As seen in the second row of Figure 11,
ferent ages and abundances, the quality of the model fits the ∼1σ posterior ranges appear to disfavor different re-
cautions any significant physical interpretation of these gions of the NIa parameter space, with dependence on
trends. stellar age and metallicity. First we notice that at every
Moving beyond the two example stars discussed un- age, the metal-poor stars are found to be fit by fewer SN
til this point, we now present the posteriors for the full Ia than their solar-metallicity counterparts of the same
sample of mono-age, mono-abundance stars that we fit. age. This difference is ∼0.25 dex on average. In addition
As discussed in Section 5.2, we fit a collection of 60 stars to the variations with metallicity, there are also trends
that fall into two [Fe/H] bins, three age bins, and have α- with age at fixed [Fe/H]. For the solar-metallicity stars,
enhancements characteristic of both the low- and high-α younger stars are found to be described by fewer SN Ia
sequences. Figure 11, which displays summary statistics compared to older stars, spanning values from log(NIa )
of the posterior distributions at each age and [Fe/H], = -3 - -2.5. For the metal-poor stars, a similar trend is
summarizes the results of these fits. In each panel, the observed with age, except these values span log(NIa ) =
individual posteriors for each star are described by the -3.5 - 2.75. Lastly, similar to the IMF parameter, there
markers placed at the 50th percentile value, and the error are small differences between the NIa posteriors of high-
bars which span the 1σ (or 16th - 84th ) percentile range. α sequence stars and low-α sequence stars. As apparent
The markers are colored by the “total absolute error”, in the combined posteriors, low-α sequence stars are de-
which is the sum of the absolute difference between the scribed by fewer SN Ia than high-α sequence stars, at
observed abundances and predicted abundances based every age and metallicity.
on the 50th posterior values. We use this error as a Unlike the IMF and SN Ia parameters, the two ISM
goodness of fit measure, which we discuss further in Sec- parameters are not as tightly constrained by the data.
tion 5.4. We also show the combined posterior, which As seen in the third row of Figure 11, the 16th - 84th
is indicated by the greyscale colorbar. These combined ranges of the star-formation efficiency posteriors are
posteriors are determined by multiplying the individual more comparable to the range of the prior, especially
posterior distributions of all the stars that fall within the for the metal-poor stars. The binned posteriors of metal-
20 Blancato et al.
↵IMF
↵IMF
2 4 6 2 8 4 10 6 2 8 4 10 6 8 10
age (Gyrs) age (Gyrs) age (Gyrs)
Figure 11. The Chempy posterior distributions for mono-age, mono-abundance stars with abundance measurements from
APOGEE. Each panel shows the values of the inferred GCE parameters (αIMF , NIa , SFE, and SFRpeak ) for stars with ages of 2, 5,
and 9 Gyrs. The plots on the left show stars with solar [Fe/H] abundances, while the plots on the right show stars with [Fe/H]
abundances of -0.3 dex. For each star, the 50th percentile of the posterior distribution is indicated by the markers, where high-α
sequence stars are indicated by triangles and low-α sequence are indicated by circles. The error bars on the markers span the
1σ (or 16th - 84th ) posterior percentile range, and the markers are colored by the total absolute error between the observed and
predicted abundances. The prior on each parameter is shown for comparison, with the variance of the distribution indicated
by the solid black line. The combined posteriors, binning stars with the same age, [Fe/H] abundance, and α-enhancement, are
shown in greyscale, with darker shades indicating a higher posterior probability.
poor stars are more constraining, and show the SFE val- of each age and α enhancement. Compared to stars of
ues for these stars to be > -0.25 dex, with weak trends the same age that are metal-poor, the solar-metallicity
seen with age and α-enhancement. For the 2 Gyr-aged stars exhibit slightly higher SFE values.
metal-poor stars, we see a discrepancy between the 50th Finally, the fourth row of Figure 11 shows the pos-
percentile values of the individual posteriors and the teriors for star-formation rate peak. As with the
combined posterior distributions, and bi-modality in the star-formation efficiency parameter, the ranges of the
combined posterior for the high-α sequence. This is a re- SFRpeak posteriors are more comparable to the range of
sult of these posteriors being non-Gaussian, exhibiting the prior. Despite this, some regions of the parameter
a one-sided tail in addition to a peak. For the solar- space are still disfavored. For the solar-metallicity stars,
metallicity stars, the posteriors are more constrained. their posteriors suggest a low probability of a SFR peak
For these stars, SFE values less than ∼0 dex are mostly occurring before ∼3 Gyrs, with a median posterior value
disfavored, with posterior medians at ∼0.25 dex for stars of ∼5.5 Gyrs for all of the stars, with only a slight dif-
[Mg/Si] variations 21
ference between the low-α and high-α sequence. For the the likelihood evaluation for discrepant [Fe/H] predic-
metal-poor stars, their posteriors suggest a low proba- tions compared to the penalties for discrepant [Mg/Fe]
bility of a SFR peak occurring before ∼2 Gyrs, with the predictions. The observational uncertainties on the
median posterior value of ∼4 Gyrs, with a slight trend [Mg/Fe] and [Si/Fe] abundances are comparable, so the
with age and α enhancement. observational uncertainties are not the primary driver of
the [Si/Fe] over-prediction.
5.4. Limitations To test if perhaps another set of yield tables could
result in better [Si/Fe] predictions, we also infer the
As emphasized throughout Section 5.3, we find that
parameter posteriors assuming the alternative Chempy
the predicted Chempy abundances are often in tension
yield tables (based on Chieffi & Limongi (2004), Thiele-
with the observed abundances, particularly in regards
mann et al. (2003), and Ventura et al. (2013)). For
to [Si/Fe]. We now quantify these discrepancies, and
the solar-metallicity stars, the predicted [Fe/H] abun-
discuss possible reasons for them that ultimately limit
dances are off by 0.06 dex (96% under-predicted, 4%
the interpretation of our chemical evolution modeling
over-predicted), the predicted [Mg/Fe] abundances are
results. In the end, we advocate that given the current
off by 0.11 dex (100% over-predicted), and the predicted
quality and quantity of abundance measurements for
[Si/Fe] abundances are off by 0.32 dex (100% over-
stars located all throughout the Milky Way, a more data-
predicted). Compared to the default yield tables, the
driven approach to galactic chemical evolution modeling
alternative yields result in poorer abundances matches.
could be a possible path forward.
Considering the metal-poor stars, the predicted [Fe/H]
Before examining how well the Chempy models are
abundances are off by 0.09 dex (100% under-predicted),
able to reproduce the abundance patterns of stars with
the predicted [Mg/Fe] abundances are off by 0.14 dex
varying metallicities, ages, and α-enhancements, we first
(100% over-predicted), and the predicted [Si/Fe] abun-
compare the default and alternative Chempy yield ta-
dances are off by 0.34 dex (100% over-predicted). Again,
bles. As discussed in Rybizki et al. (2017), the choice of
the alternative yield tables result in worse matches to the
yield tables is a hyperparameter of the Chempy model,
observed abundances.
and ideally the observed abundances can be used to de-
We make several observations from this comparison
termine which yield tables are most probable. Consider
of the predicted and observed abundances for both the
the posteriors inferred under the default yield tables
default and alternative yield tables. First is that the
(based on Nomoto et al. (2013), Seitenzahl et al. (2013),
default yield tables result in better matches to the ob-
and Karakas (2010)), which are the tables used for the
served abundances for all elements, for both the solar-
results shown in Section 5.3.
metallicity and metal-poor stars. As found in Philcox
For the solar-metallicity stars, the average absolute
et al. (2018) who present a scoring system for com-
difference between the predicted and observed [Fe/H]
paring yield tables, the optimal choice of tables even
abundances is 0.007 dex, with all of the stars hav-
when fitting only proto-solar abundances is dependent
ing their abundances over-predicted. For [Mg/Fe] the
on what specific abundances are being fit. Presumably,
predicted abundances match the observed abundances
the optimal yield tables will also differ for stars with
a bit worse, with the average absolute difference be-
varying abundance patterns. A detailed ranking of the
ing 0.056 dex (100% under-predicted). However, the
various yields tables for a collection of stars with differ-
Chempy models are unable to reproduce the observed
ent abundance properties may reveal informative trends
silicon abundances. The average absolute difference be-
regarding what tables are preferred by the data. The
tween the predicted and observed [Si/Fe] abundances is
other observation we make is that regardless of yield ta-
0.187 dex (100% over-predicted). For the metal-poor
ble choice, the silicon abundance is the most difficult to
stars, the predicted abundances are less consistent with
reproduce and is systematically over-predicted by ∼0.2 -
the observed abundances. On average, the [Fe/H] pre-
0.4 dex. Because of this, despite the [Mg/Si] trends that
dictions are 0.035 dex off from the observed abundances
the default Chempy yield tables exhibit in Section 2, the
(100% over-predicted), the [Mg/Fe] predictions are 0.073
inferred parameter values are not able to correctly repro-
dex off from the observed abundances (100% under-
duce the relative amounts of magnesium and silicon. As
predicted), and the [Si/Fe] predictions are 0.178 dex off
found in Philcox et al. (2018) there seems to be an Mg to
from the observed abundances (100% over-predicted).
Si offset for all tested CC-SN yield tables, which include
The differences between the [Fe/H] and [Mg/Fe] pre-
the ones used in this work and more recent ones from
dictions are at least partly due to the smaller measure-
the literature, cf. their Figure 4. It is unlikely that other
ment uncertainties on the [Fe/H] abundances. These
nucleosynthetic channels or the GCE model parameter-
smaller [Fe/H] uncertainties result in larger penalties in
22 Blancato et al.
ization are causing this offset. Instead, it is found that In summary, while we find the yield tables of Nomoto
the explosion energies of the CC-SN could be the reason et al. (2013), Seitenzahl et al. (2013), and Karakas
and lowering them could remedy the discrepancy (Heger (2010) to be preferred by the data, ultimately we are not
& Woosley 2010; Fryer et al. 2018). able to describe the abundance patterns of these APOGEE
While the default yield tables are preferred by the stars due to the systematic over-prediction of [Si/Fe].
data, these yield tables result in some systematic trends The reasons for this fall into three categories, which in-
between stellar properties and how well the Chempy clude shortcomings with: the yield tables, the model, or
models can reproduce the abundance patterns of stars the data. Considering the yield tables, one possibility
(as seen in Figure 11). While we find no significant is that there are nucleosynthetic contributions missing
trends with stellar age, there are some differences in from the yield tables that are necessary to reproduce
how well the models describe metal-poor versus solar- the Si abundances of these stars. The yield tables might
metallicity stars, and low-α versus high-α sequence also not be descriptive enough to accurately character-
stars. Considering metallicity, we find that the abun- ize stars with abundance patterns arising from a variety
dances of the solar metallicity stars are reproduced of initial environmental conditions and that have expe-
slightly better than the abundances of the metal-poor rienced a multitude of different evolutionary pathways.
stars. To compare the fits, we compute the “total ab- On the model side, while here we fit each star with its
solute error” between the predicted and observed abun- own unique one-zone model, the ISM parameterization
dances by summing the absolute values of the individual might be too simplistic to reproduce the abundance pat-
[Fe/H], [Mg/Fe], and [Si/Fe] differences. The average to- terns of these stars. Parameterizations that allow for
tal error of the solar-metallicity stars is 0.25 dex, while things like a bursty star-formation history, an adaptive
the average total error of the metal-poor stars is 0.29 coronal gas mass and volume, or different mixing of the
dex. As mentioned previously, a majority of this er- ISM will better describe the abundances. And lastly,
ror is from the model’s over-prediction of silicon. The shortcomings may lie with the data. In this paper, our
worse model predictions for metal-poor stars compared primary focus was to empirically characterize the ratio
to solar-metallicity stars could be due to a number of of magnesium to silicon, and then understand the phys-
factors, and we note that a similar finding is reported ical origin of these variations. With this goal, we fit
in Rybizki et al. (2017), where Arcturus with an [Fe/H] Chempy with just three abundances ([Fe/H], [Mg/Fe],
∼ -0.5 dex is fit more poorly than the Sun. Generally, and [Si/Fe]) in an attempt to generate the simplest ver-
stellar models are gauged to solar abundances, which sions of the model that would be able to reproduce the
could result in the yields preferentially producing solar data. However, GCE models could potentially be better
abundance patterns. constrained by different or more abundances, or even the
We also find differences in the ability of the models entire chemical abundance vector available from APOGEE.
to reproduce the abundances of low- versus high-α se- That said, two abundances are highly constraining in the
quence stars. First considering solar-metallicity stars of sense that they highlight deepened tensions between the
all ages, we find that on average the high-α stars have data, yield tables, and GCE models, even in a simple
marginally higher total errors than low-α stars (0.26 dex application.
compared to 0.24 dex). This small difference between
the two sequences is mostly due to worse [Mg/Fe] predic- 6. SUMMARY & CONCLUSION
tions for the high-α stars. For the metal-poor stars, we
In this paper we present a detailed investigation of
find a larger discrepancy in the accuracy of the predicted
the [Mg/Si] abundance in the Milky Way disk. We do
abundances between the low- versus high-α sequences.
this using a large sample of stars with APOGEE abun-
On average the high-α stars have a total error of 0.25
dance measurements, estimated stellar ages, and Gaia
dex, while the low-α stars have an average total error of
astrometry. Inspired by the increasing precision and
0.32 dex. This difference is mainly from both the [Fe/H]
sheer number of stars with available abundances, our
and [Si/Fe] abundance predictions, which are each ∼0.03
primary goal is to go beyond the information contained
dex worse for the low-α sequence stars. The more signifi-
in a bulk α abundance and examine the higher level of
cant difference between the low- and high-α sequence fits
granularity encoded in an inter-family abundance ratio.
at low metallicities could be a consequence of the high-α
The specific choice of magnesium and silicon is moti-
sequence abundance pattern resulting directly from CC-
vated by the expected subtle differences in their nucle-
SN yields. Fine-tuning of the abundances with Fe from
osynthetic origins, which has been previously used to
SN Ia is additionally needed to produce low-α sequence
interpret the [Mg/Si] abundances of stars in the Sat-
abundance patterns.
titargius dwarf galaxy. Our endeavor for Milky Way
[Mg/Si] variations 23
stars includes both an empirical characterization of the how the mean [Mg/Si] of the two sequences varies
[Mg/Si] abundance throughout the Galaxy, as well an throughout the Galaxy. For the high-α sequence,
attempt to link [Mg/Si] variations to a physical origin the trend with mean [Mg/Si] and radius is the
through galactic chemical evolution modeling. same at all heights from the disk midplane, with
After gaining intuition in Section 2 for how Mg and the highest [Mg/Si] occurring at R ∼ 8 kpc. How-
Si yields evolve through time in a single burst of star- ever, for the low-α sequence the highest [Mg/Si]
formation, we then focused on establishing how the ob- abundances occur at R ∼ 6 kpc and the mean
served [Mg/Si] abundance varies in the Galaxy. We [Mg/Si] at each radius scales with |z|. Since the
make connections between [Mg/Si] and various stellar [Mg/Si] abundance is correlated with age, these
properties including: stellar age, [Fe/H], location, and trends seemingly reflect the age gradients with R
stellar orbital actions. With the goal of better under- and |z| across the disk, which are distinct for the
standing the origin of the Milky Way’s bimodal α se- low- and high-α sequences.
quence, we identify differences in [Mg/Si] between low
and high-α sequence stars. Our findings are summarized • Considering trends with orbital actions, our find-
as follows: ings confirm the results of Gandhi & Ness (2019)
that the high and low-α sequences are distinct in
• High-α sequence stars are more enhanced in Jφ , Jr , and Jz at all stellar ages. Examining how
[Mg/Si] than low-α sequence stars, with the dif- [Mg/Si] varies with actions, we find that at all ages
ference in the average [Mg/Si] between the two low-α sequence stars with lower [Mg/Si] abun-
sequences being ∼0.08 dex. The two sequences dances have higher angular momenta than their
also exhibit distinct behavior in the variation of enriched [Mg/Si] counterparts. A similar trend is
their [Mg/Si] abundances across the [α/Fe]-[Fe/H] not found for the radial or vertical actions, or the
plane, where [Mg/Si] varies more strongly with high-α sequence and any action. Presumably, this
[α/Fe] and [Fe/H] in the low-α sequence than it reflects the radial metallicity gradient in the disk
does in the high-α sequence. Given the expected at a given age, and a bulk chemical composition of
theoretical Mg and Si yields (Figure 1), observed star-forming gas that varies with time and Galac-
variations in [Mg/Si] at early times could po- tocentric radius, but not with distance from the
tentially be used to discriminate between differ- disk midplane.
ent IMFs. This is tentatively supported by the
Chempy inferences in Section 5, where high-α se- These observed trends with [Mg/Si] abundance support
quence stars are found to be described by a slightly established and reveal new connections between chem-
more top-heavy IMF than low-α sequence stars. istry and orbital dynamics. Given the expected differ-
ences in the chemical enrichment processes responsible
• The enhanced [Mg/Si] abundance of the high-α for generating Mg and Si yields, the varying [Mg/Si]
sequence compared to the low-α sequence is per- abundance relations between low versus high-α sequence
sistent at all stellar ages. Considering the evo- stars bolsters the notion that the two sequences are dis-
lution of [Mg/Si] with stellar age, from old to tinct in their origin and formation. Detailed matching
young ages the [Mg/Si] abundance of the low-α se- of these observational trends between [Mg/Si], age, and
quence decreases nearly 2× more than the [Mg/Si] dynamics to the properties of simulated Milky Way-like
abundance of the high-α sequence. This differ- galaxies can place powerful constraints on disk forma-
ence is persistent for solar-metallicity stars, as well tion and chemical evolution mechanisms.
as metal-poor stars, but is minimized at higher In the second half of this paper we focused our atten-
[Fe/H] abundances where the two α sequences be- tion on understanding the physical origin of the observed
come less distinct. [Mg/Si] variations. Specifically, we perform galactic
chemical evolution (GCE) modeling, which combines
• Inspired by Hayden et al. (2015)’s characteriza- physically motivated models of star-formation, the ISM,
tion of the [α/Fe]-[Fe/H] distribution in the disk, and galactic evolution to predict the chemical content
we examine how the [Mg/Si]-[Fe/H] distribution of the ISM through time. Here, we employ the recently
varies with location in the Galaxy, from R = 3 developed Chempy code that, given a set of observed
- 15 kpc and |z| = 0 - 2 kpc. We find that the abundances and estimated stellar age, allows for infer-
[Mg/Si]-[Fe/H] distributions of the high and low- ence of GCE model parameters.
α sequences considerably overlap at each R and We infer GCE parameters for a set of mono-age, mono-
|z|, but that there are significant differences in metallicity stars with various [Mg/Fe] and [Si/Fe] abun-
24 Blancato et al.
dances. For each star, we obtain posterior distributions for his assistance in planning the Chempy runs on the
for four parameters: the slope of the high-mass IMF, Flatiron Institute computing cluster.
the number of SN Ia explosions per solar mass over a 15 KB is supported by the NSF Graduate Research Fel-
Gyr time period, the star-formation efficiency, and the lowship under grant number DGE 16-44869. KB thanks
peak of the star-formation rate. While there are poten- the LSSTC Data Science Fellowship Program, her time
tially interesting relationships between these parameters as a Fellow has benefited this work. MN is supported
and stellar properties, our interpretation of them is lim- by the Alfred P. Sloan Foundation. KVJ’s contribu-
ited. This is because the Chempy models are unable to tions were supported in part by the National Science
reproduce the APOGEE abundances for the diversity of Foundation under grants NSF PHY-1748958 and NSF
stars that we fit. A majority of the discrepancy between AST-1614743. Her work was performed in part during
the predicted and observed abundances comes from the the Gaia19 workshop and the 2019 Santa Barbara Gaia
consistent over-prediction of [Si/Fe], which has also been Sprint (also supported by the Heising-Simons Founda-
reported by Philcox et al. (2018) for a number of tested tion), both hosted by the Kavli Institute for Theoretical
CC-SN yield tables. We also find systematic trends with Physics at the University of California, Santa Barbara.
metallicity and α enhancement, and the predictive qual- Funding for the Sloan Digital Sky Survey IV has been
ity of the Chempy models. This reveals tensions between provided by the Alfred P. Sloan Foundation, the U.S.
the GCE models, yield tables, and observed abundance Department of Energy Office of Science, and the Par-
measurements. ticipating Institutions. SDSS-IV acknowledges support
The main conclusions of this paper are as follows. and resources from the Center for High-Performance
Given the large number of stars with high quality abun- Computing at the University of Utah. The SDSS web
dance measurements available, small variations in these site is www.sdss.org. SDSS-IV is managed by the As-
abundances can be characterized by averaging stars with trophysical Research Consortium for the Participating
similar properties, such as α-enhancement, location, age, Institutions of the SDSS Collaboration including the
or actions. In this way, we were specifically motivated Brazilian Participation Group, the Carnegie Institution
to examine the inter-family ratio of two α-elements, Mg for Science, Carnegie Mellon University, the Chilean
and Si, because of expected differences in how they are Participation Group, the French Participation Group,
produced in CC-SN and SN Ia events. This approach we Harvard-Smithsonian Center for Astrophysics, Instituto
take in dissecting the [Mg/Si] abundance can be gener- de Astrofı́sica de Canarias, The Johns Hopkins Univer-
alized to isolate other particular enrichment channels. sity, Kavli Institute for the Physics and Mathematics of
In theory, other inter-family or intra-family abundances the Universe (IPMU) / University of Tokyo, Lawrence
can be used to probe specific nucleosynthesis or disk for- Berkeley National Laboratory, Leibniz Institut für As-
mation mechanisms, and these empirical relationships trophysik Potsdam (AIP), Max-Planck-Institut für As-
can serve as detailed constraints for simulations of Milky tronomie (MPIA Heidelberg), Max-Planck-Institut für
Way-like galaxies. However, in practice we encountered Astrophysik (MPA Garching), Max-Planck-Institut für
challenges in connecting variations in Mg and Si to un- Extraterrestrische Physik (MPE), National Astronomi-
derlying stellar physics. Specifically, we found that a cal Observatories of China, New Mexico State Univer-
flexible model of galactic chemical evolution is unable sity, New York University, University of Notre Dame,
to predict even just three stellar abundances that rep- Observatário Nacional / MCTI, The Ohio State Uni-
resent a diversity of enrichment histories. This failure versity, Pennsylvania State University, Shanghai As-
highlights tensions between the chemical evolution mod- tronomical Observatory, United Kingdom Participation
els, the yield tables, and the data. As we have now en- Group, Universidad Nacional Autónoma de México,
tered a regime of rich stellar abundance information, a University of Arizona, University of Colorado Boulder,
more data-driven approach to chemical evolution models University of Oxford, University of Portsmouth, Uni-
and nucleosynthetic yield tables may be a way forward. versity of Utah, University of Virginia, University of
Washington, University of Wisconsin, Vanderbilt Uni-
versity, and Yale University.
REFERENCES
Abolfathi, B., Aguado, D. S., Aguilar, G., et al. 2017, Gunn, J. E., Carr, M., Rockosi, C., et al. 1998, AJ, 116,
ArXiv e-prints, arXiv:1707.09322 3040
Adibekyan, V. Z., Sousa, S. G., Santos, N. C., et al. 2012, Hasselquist, S., Shetrone, M., Smith, V., et al. 2017, ApJ,
A&A, 545, A32 845, 162
Anders, F., Chiappini, C., Santiago, B. X., et al. 2014, Hayden, M. R., Bovy, J., Holtzman, J. A., et al. 2015, ApJ,
A&A, 564, A115 808, 132
Bedell, M., Meléndez, J., Bean, J. L., et al. 2014, ApJ, 795, Haywood, M., Lehnert, M. D., Di Matteo, P., et al. 2016,
23 A&A, 589, A66
Bedell, M., Bean, J. L., Meléndez, J., et al. 2018, ApJ, 865, Heger, A., & Woosley, S. E. 2010, ApJ, 724, 341
68 Karakas, A. I. 2010, MNRAS, 403, 1413
Bensby, T., Feltzing, S., & Lundström, I. 2003, A&A, 410, Larson, R. B. 1976, MNRAS, 176, 31
527 Mackereth, J. T., Crain, R. A., Schiavon, R. P., et al. 2018,
Bensby, T., Feltzing, S., & Oey, M. S. 2014, A&A, 562, A71 MNRAS, 477, 5072
Blanton, M. R., Bershady, M. A., Abolfathi, B., et al. 2017, Mackereth, J. T., Bovy, J., Schiavon, R. P., et al. 2017,
AJ, 154, 28 MNRAS, 471, 3057
Bournaud, F., Elmegreen, B. G., & Martig, M. 2009, ApJL, Mackereth, J. T., Bovy, J., Leung, H. W., et al. 2019, arXiv
707, L1 e-prints, arXiv:1901.04502
Bovy, J., Rix, H.-W., & Hogg, D. W. 2012a, ApJ, 751, 131 Maoz, D., Sharon, K., & Gal-Yam, A. 2010, ApJ, 722, 1879
Bovy, J., Rix, H.-W., Liu, C., et al. 2012b, ApJ, 753, 148 McMillan, P. J., Kordopatis, G., Kunder, A., et al. 2018,
Bovy, J., Nidever, D. L., Rix, H.-W., et al. 2014, ApJ, 790, MNRAS, 477, 5279
127 McWilliam, A., Wallerstein, G., & Mottini, M. 2013, ApJ,
Carlin, J. L., Sheffield, A. A., Cunha, K., & Smith, V. V. 778, 149
2018, ArXiv e-prints, arXiv:1805.04120 Navarro, J. F., Abadi, M. G., Venn, K. A., Freeman, K. C.,
Chabrier, G. 2003, PASP, 115, 763 & Anguiano, B. 2011, MNRAS, 412, 1203
Chieffi, A., & Limongi, M. 2004, ApJ, 608, 405 Ness, M., Hogg, D. W., Rix, H.-W., Ho, A. Y. Q., &
Chollet, F., et al. 2015, Keras, https://keras.io Zasowski, G. 2015, ApJ, 808, 16
Clarke, A. J., Debattista, V. P., Nidever, D. L., et al. 2019, Ness, M., Hogg, D. W., Rix, H.-W., et al. 2016, ApJ, 823,
MNRAS, arXiv:1901.00931 114
Foreman-Mackey, D. 2016, The Journal of Open Source Nidever, D. L., Bovy, J., Bird, J. C., et al. 2014, ApJ, 796,
Software, 1, 24 38
Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, Nissen, P. E. 2015, A&A, 579, A52
J. 2013, PASP, 125, 306 Nomoto, K., Kobayashi, C., & Tominaga, N. 2013,
Fryer, C. L., Andrews, S., Even, W., Heger, A., & ARA&A, 51, 457
Safi-Harb, S. 2018, ApJ, 856, 63 Pagel, B. 1998, Astronomy and Geophysics, 39, 5.27
Fuhrmann, K. 1998, A&A, 338, 161 Pedregosa, F., Varoquaux, G., Gramfort, A., et al. 2011,
Gandhi, S. S., & Ness, M. K. 2019, arXiv e-prints, Journal of Machine Learning Research, 12, 2825
arXiv:1903.04030 Philcox, O., Rybizki, J., & Gutcke, T. A. 2018, ApJ, 861, 40
Garcı́a Pérez, A. E., Allende Prieto, C., Holtzman, J. A., Prochaska, J. X., Naumov, S. O., Carney, B. W.,
et al. 2016, AJ, 151, 144 McWilliam, A., & Wolfe, A. M. 2000, AJ, 120, 2513
Gelman, A., M. X., & Stern, H. 1996, Statistica Sinica, vol. Ramı́rez, I., Meléndez, J., Bean, J., et al. 2014, A&A, 572,
6, no. 4, 1996, pp. 733-760 A48
Gilmore, G., & Reid, N. 1983, MNRAS, 202, 1025 Reddy, B. E., Lambert, D. L., & Allende Prieto, C. 2006,
Girardi, L. 2016, ARA&A, 54, 95 MNRAS, 367, 1329
Goodman, J., & Weare, J. 2010, Communications in Rybizki, J., Just, A., & Rix, H.-W. 2017, A&A, 605, A59
Applied Mathematics and Computational Science, Vol. 5, Sanders, J. L., & Binney, J. 2016, MNRAS, 457, 2107
No. 1, p. 65-80, 2010, 5, 65 Sanders, J. L., & Das, P. 2018, MNRAS, 481, 4093
Grand, R. J. J., Bustamante, S., Gómez, F. A., et al. 2018, Seitenzahl, I. R., Ciaraldi-Schoolmann, F., Röpke, F. K.,
MNRAS, 474, 3629 et al. 2013, MNRAS, 429, 1156
26 Blancato et al.
Spina, L., Meléndez, J., Karakas, A. I., et al. 2018, Venn, K. A., Irwin, M., Shetrone, M. D., et al. 2004, AJ,
MNRAS, 474, 2580 128, 1177
Thielemann, F.-K., Argast, D., Brachwitz, F., et al. 2003, Ventura, P., Di Criscienzo, M., Carini, R., & D’Antona, F.
Nuclear Physics A, 718, 139 2013, MNRAS, 431, 3642
Ting, Y.-S., & Rix, H.-W. 2018, arXiv e-prints, Weinberg, D. H., Holtzman, J. A., Hasselquist, S., et al.
arXiv:1808.03278 2018, arXiv e-prints, arXiv:1810.12325
Tinsley, B. M. 1979, ApJ, 229, 1046 Wilson, J. C., Hearty, F., Skrutskie, M. F., et al. 2010, in
Toyouchi, D., & Chiba, M. 2016, ApJ, 833, 239 Proc. SPIE, Vol. 7735, Ground-based and Airborne
Instrumentation for Astronomy III, 77351C
Trick, W. H., Coronado, J., & Rix, H.-W. 2019, MNRAS,
Woosley, S. E., & Weaver, T. A. 1995, ApJS, 101, 181
484, 3291
[Mg/Si] variations 27
APPENDIX
↵IMF
log10 (NIa)
log10 (SFE)
log10 (SFRpeak)
Figure 12. The distribution of average integrated autocorrelation times (left), and the average number of estimated independent
posterior samples (right) for each inferred Chempy parameter.
28 Blancato et al.
Figure 13. Example trace plots (left) and joint posterior distributions (right) of the four inferred Chempy parameters. The
top panel shows the posterior distributions for a solar-metallicity, ∼5 Gyr-aged star in the low-α sequence, and the bottom panel
shows the posterior distributions for a metal-poor, ∼9 Gyr-aged star in the high-α sequence. The blue curves indicates the prior
for each parameter, while the histograms show the marginalized posterior distributions with the 50th percentile values indicated
by the dashed lines. The sharp edges in the SFRpeak posterior distributions are a result of the model being constrained to
explore SFR peaks occurring before ∼12.6 Gyrs.
[Mg/Si] variations 29