Professional Documents
Culture Documents
https://doi.org/10.1038/s41467-020-14660-y OPEN
1 Neutron Scattering Division, Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, TN 37831, USA. 2 Theoretical Division and CNLS, Los Alamos
National Laboratory, Los Alamos, NM 87545, USA. 3 National Center for Computational Sciences, Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak
Ridge, TN 37831, USA. 4 Materials Science and Technology Division, Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, TN 37831, USA.
5 Department of Physics and Astronomy, Louisiana State University, Baton Rouge, LA 70803, USA. 6 Department of Physics and Astronomy, University of
Tennessee, Knoxville, TN 37996, USA. 7 Instituto de Física de Líquidos y Sistemas Biológicos, UNLP-CONICET, La Plata, Argentina. 8 School of Physics and
Astronomy, University of St Andrews, St Andrews, UK. ✉email: samarakoonam@ornl.gov
E
xtracting the correct interactions from experimental data is Methods: Experimental details) on the CORELLI instrument at
essential for modeling. For magnetic insulators, the model is the Spallation Neutron Source, Oak Ridge National Laboratory to
described by the spin Hamiltonian equation, dictated by measure the magnetic state of Dy2Ti2O7. Three-dimensional (3D)
symmetry, single-ion properties, and electron overlap between volumes of diffuse scattering were measured in the 100 –960 mK
ions. The problem of extracting a spin Hamiltonian from neutron temperature range. In view of the low temperature equilibration
scattering data (inverse scattering problem) is often ill-posed and challenge, we undertake our analysis on data sets at 680 mK,
compounded by the need to use theory to interpret scattering which is low enough for correlations to be well developed but
data. Further, the available experimental data may not be enough sufficiently high to reach equilibrium over a short time scale.
to accurately determine the model parameters because of limited Figure 2a shows the background-subtracted data at 680 mK. This
access to experimental data, a large noise magnitude at each is proportional to the modulus squared of the spin components in
scattering wavevector, or systematic errors associated with, e.g., the wavevector space. However, an additional aspect of neutron
background subtraction. Selecting the optimal Hamiltonian to scattering is that it samples only the spin components perpen-
model the experimental data is often a formidable task, especially dicular to the wavevector transfer Q.
when many parameters must be simultaneously determined. We employ a dipolar spin-ice Hamiltonian that includes
Tools for doing so are needed to uncover the physics that is exchange terms up to third-nearest neighbors:
emerging from large classes of complex magnetic materials1,2. P P P
H ¼ J1 Si Sj þ J2 Si Sj þ J3 Si Sj
Dy2Ti2O7 is a highly frustrated magnet showing complex beha- hi;ji1 hi;ji2 hi;ji3
vior including spin liquid formation3–8. The magnetism originates ð1Þ
from Dy3+ ions which behave as classical Ising spins on a pyro- P P Si Sj 3ðSi rij ÞðSj rij Þ
þ J30 Si Sj þ Dr1 3
jrij j jrij j
3 5
chlore lattice of corner-linked tetrahedra, as in Fig. 1a. Figure 1b hi;ji30 i;j
shows the four essential magnetic interactions including a ferro-
magnetic coupling that results from the combination of exchange where Si can be viewed as an Ising spin of the ith ion, Fig. 1b. The
with a large dipolar interaction. This FM coupling makes Dy2Ti2O7 model includes first, second, and two different third nearest
a canonical spin-ice material, i.e., the spins on each tetra- neighbor interaction strengths, J1, J2, J3, and J3′, respectively.
hedron obey the ice rules that only allow for two-in two-out There is also a dipolar interaction with strength D, which couples
configurations9,10. This divergenc-free condition leads to a spin the ith and the jth spins, according to their displacement vector
liquid with macroscopic degeneracy that features north and south rij. Prior work has determined D = 1.3224 K and J1 = 3.41 K to
charged magnetic monopoles interacting via a 1/r potential at ele- high accuracy17–21. In the present modeling effort, we seek to
vated temperatures3. A full low temperature characterization determine the three unknown parameters J2, J3, and J3′ without
demands the study of a vast number of spins subject to short and any use of prior knowledge. Given a model Hamiltonian H, we
long-range interactions. Spin dynamics occurs through millisecond use Metropolis Monte Carlo to generate a simulated structure
quantum tunneling processes11 and the measured characteristic factor, Ssim(Q), to be compared with the experimental data Sexp
equilibration time τ increases drastically upon lowering the tem- (Q) (Methods: Simulation details).
perature, leading to irreversible behavior below 600 mK12–15. This
slowdown has resulted in major difficulties16,17 in measuring and Optimizing Hamiltonian for diffuse scattering data. In a direct
interpreting experiments such as heat capacity at low temperatures. approach, one might try to minimize the squared distance,
Here we introduce an autoencoder-based approach that can 1X 2
potentially address important modeling challenges, such as a χ 2SðQÞ ¼ mðQÞ Sexp ðQÞ Ssim ðQÞ ð2Þ
N Q
proper background and noise subtraction, more reliable inference
of model Hamiltonians, improved transferability to other physical between the raw experiment and simulation data. We introduce a
systems, and efficiency. We apply our method to neutron scat- factor mðQÞ 2 f0; 1g masking selected Q-points where experi-
tering measurements of Dy2Ti2O7 in order to infer the optimal mental artifacts can be identified (see Supplementary Fig. 4). The
parameters for a dipolar spin ice model description. P
number of non-masked Q-points is N ¼ Q mðQÞ 1:2 ´ 105 .
We initially investigated optimization methods, such as the par-
Results ticle swarm method22, to overcome local barriers and find the
Neutron scattering measurements and simulations. Here we global minimum of χ 2SðQÞ . However, despite nominal success in
use diffuse neutron scattering from time-of-flight techniques (see optimization, we quickly ran into reliability issues stemming from
a b
3′
b
c Dy
2
Ti
O 1
Fig. 1 Crystal structure and the effective magnetic model. a Atomic structure of Dy2Ti2O7 comprised of tetrahedra of magnetic Dy ions (blue) and
nonmagnetic octahedra of oxygen ions (red) surrounding Ti ions (cyan). b The magnetic moments located on Dy ions are constrained by crystal field
interactions to point in or out of the tetrahedra. They form a corner sharing pyrochlore lattice. The pathways of nearest neighbor (1), next-nearest-neighbor
(2) and two inequivalent next-next-nearest neighbor (3 and 3′) interactions are shown as thick colored lines.
a b c
1 1 1
[l, l, l ]
[l, l, l ]
[l, l, l ]
0 0 0
2 2 2
–1 –1 –1
1 0 0] 1 0
0] 1 0 0]
0 h, 0 h, 0 h,
[k, ,– ,– [k, k ,–
k, – –1 –2 [h [k, k
, –2 –1 –2 [h , –1 –2 [h
2k ] k] –2k
]
Fig. 2 Comparison of the experimental and simulated data. a The scattering function Sexp(Q) for the Dy2Ti2O7 crystal at 680 mK. b The same
exp
experimental data after being filtered by the autoencoder, SAE ðQÞ. c Simulated scattering data from the optimal model spin Hamiltonian, Ssim
opt ðQÞ. In going
from (a) to (b), the autoencoder has filtered out experimental artifacts such as the red peaks, the missing data at the dark patches, etc. Using both
scattering and heat capacity data, we determine the optimal spin Hamiltonian couplings to be J2 = 0.00(6) K, J3 = 0.014 (16) K and J30 = 0.096(12) K with
J1 = 3.41 K and D = 1.3224 K having been fixed in prior work17.
errors in the experimental and simulation data. As we will discuss We employ the simplest possible autoencoder architecture: a
below, χ 2SðQÞ is both noisy and effectively flat around its minimum, fully-connected neural network with a single hidden layer. The
such that many distinct model Hamiltonians could achieve simi- hidden space activations P (i.e., the latent space representation) are
larly small values of the χ 2SðQÞ error measure. Thus, even if we defined as SL ¼ f ð Q WL;Q mðQÞSðQÞ þ bL Þ, where S(Q) is the
could find the global minimum of χ 2SðQÞ , it might still be far from input to the autoencoder, and the matrix WL,Q and vector bL are to
be determined from the machine learning training process. Given
the physically correct model for Dy2Ti2O7. simulated structure factor data as input, we interpolate to the
To address the ill-posed nature of this inverse scattering problem, experimental Q-points as necessary. The output of the autoencoder
we present two strategies: (1) We employ machine learning P 0 0
is defined as SAE ðQÞ ¼ f ð 30 L¼1 WQ;L SL þ bQ Þ, where the new
techniques to replace χ 2SðQÞ . with our error measure χ 2SL that is 0 0
matrix WQ;L and vector bQ are also trainable. We employ the
more robust to errors in the experimental and simulation data, and
logistic activation function f ðxÞ ¼ 1=ð1 þ ex Þ at both layers. This
puts more weight on “characteristic features” of the structure factor.
choice guarantees that the output SAE(Q) is non-negative.
(2) Rather than reporting just the single “best” model, we sample
Figure 2 illustrates how the trained autoencoder processes the
from the entire set of Hamiltonian models for which the error
Dy2Ti2O7 scattering data. Figure 2a shows the raw experimental
measure is below some tolerance threshold. In this way, our method
data Sexp(Q) while Fig. 2b shows how the autoencoder filters the
will report not just a model, but also a model uncertainty. exp
We use an autoencoder23 to formulate χ 2SL , our choice of error experimental data to produce SAE ðQÞ. The autoencoder preserves
measure. Autoencoders were originally developed in the context important qualitative features of the data, while being very
of computer vision, where they are known to be effective at image effective at removing experimental artifacts. Figure 2c shows the
compression and denoising tasks. Here we apply them to simulated data Ssim opt ðQÞ for the optimal Hamiltonian model Hopt,
interpret structure factor data, and to disambiguate among many without any autoencoder filtering. We will describe later our
possible solutions of the inverse scattering problem. Our procedure to determine Hopt. Note that the best model, Ssim opt ðQÞ, is
exp
autoencoder is a neural network that takes an S(Q) as input in remarkably good agreement with SAE ðQÞ. This agreement is
(either simulated or experimental), encodes it into a compressed consistent with the fact that the autoencoder was trained
latent space representation SL, and then decodes to an output specifically to reproduce simulated data.
SAE(Q) that captures the essence of the input S(Q), while Figure 3 provides another way to understand how the
removing irrelevant noise and artifacts. autoencoder is processing the S(Q) data. In Fig. 3a we show a
The autoencoder’s latent space SL ¼ ðS1 ; S2 ; ¼ SD Þ provides a cross section of Sexp(Q) in the high symmetry plane ð½h; h; 0
low-dimensional characterization of the S(Q) data. The dimension ½k; k; 2kÞ. Figure 3b shows the corresponding simulated data
opt ðQÞ for the optimal model Hamiltonian. Now we perturb Hopt
D of the latent space should strike a balance between overfitting and Ssim
underfitting. Keeping D relatively small limits the autoencoder’s to a new model Hperturb, which keeps all parameters from Hopt
ability to fit irrelevant noise in the training data. On the other hand, except modifies J2 from 0.16 K to −0.18 K. Despite the relatively
D should be large enough to allow the autoencoder flexibility to significant change ΔJ2 ffi 0:1J1 , there is very little change to
capture physically relevant characteristics in S(Q). We selected D = the structure factor. Indeed, Fig. 3c shows that ΔSsim ¼ Ssim
30 based on the D-dependance of ΔS(Q) (error over the validation opt ðQÞ
a 1.5
b 1.5 c 1.5
3 3
[k, k, –2k]
[k, k, –2k]
[k, k, –2k]
0 1.5 0 1.5 0 0
[k, k, –2k]
0 0 0 0 0 0
Fig. 3 Effect of latent space variables on structure factor. a Experimental scattering data Sexp(Q) in the high symmetry plane. Dark patches indicate zero
signal due to nuclear Bragg peaks. b The corresponding simulated data for the optimal model Hamiltonian. c Change in the simulated structure factor ΔSsim
when the J2 coupling is perturbed from 0.16 K to −0.18 K. The relatively weak response ΔSsim to a perturbation in J2 of ±0.05 J1 illustrates the challenge in
inferring the correct spin Hamiltonian. d–f The change in the autoencoder output when 1, 6, and 12 latent space components, respectively, are updated to
account for the perturbation on J2 (see main text for details).
to the structure factor? To answer this question, we replace 1, 6, the one obtained from the latent space representation of the S(Q)
and 12 latent space components of SL with the corresponding ones data using a linear autoencoder, which is equivalent to the
in S0L . The components selected are those with the largest principal component analysis (PCA).
deviations, jSL S0L j. Figure 3d–f show the change in autoencoder Here we propose an alternative error measure. The
output, after substitution of the latent space components. Panel f 30-dimensional latent space representation SL should, in some
captures some physically important characteristics of ΔSsim while sense, capture the most relevant information in S(Q). This
discarding irrelevant noise. Latent space components beyond 12 suggests that to compare Sexp(Q) to Ssim(Q) we should actually
carry little information about ΔSsim. look at the squared distance of their latent space vectors
Now we show how the autoencoder can assist in solving the
inverse scattering problem, i.e., in finding the optimal model 1 X D
exp 2
χ 2SL ¼ SL Ssim : ð4Þ
Hamiltonian Hopt given the experimental data Sexp(Q). To NL L¼1 L
0.3 a
2
0.2 CS(Q)
2S(Q)
d
0.1 0.4
2S(Q) < CS(Q)
2
0 2
–0.6 –0.4 –0.2 0 0.2 0.4 2S < CSL
L
J2 0.2
2multi < Cmulti
2
0.3 b
Hopt
J2
0
(Q)
0.2 ref.17
AE
ref. 18
2S
0.1 –0.2
ref. 20
0
–0.6 –0.4 –0.2 0 0.2 0.4 –0.4
J2
0.3 c –0.6
–0.4
0.2
L
0
2S
CS2 J3
L
0.4 0.2 0.4
0.1 –0.2 0
–0.6 –0.4
J3′
0
–0.6 –0.4 –0.2 0 0.2 0.4
J2
Fig. 4 Cost functions and their effects. Inferring the spin Hamiltonian for Dy2Ti2O7. a χ 2SðQÞ directly measures the distance between experimental and
simulated S(Q) data. It is relatively flat and noisy around its minimum, thus yielding a large uncertainty in the J2 coupling (any value below the dashed line,
C2SðQÞ , is a reasonable candidate). b χ 2SAE ðQÞ measures the distance between S(Q) data after being filtered through the autoencoder. c χ 2SL measures the
distance between the 30-dimensional latent space representations of the S(Q) data. The Gaussian process model ^χS2L ðHÞ(red curve) accurately
approximates χ 2SL , even in the full space of J2, J3, and J30 . d Once a good model ^χS2L ðHÞ has been constructed, we can rapidly identify the optimal Hamiltonian
model (magenta cross). The autoencoder-based error measure χ 2SL yields much smaller model uncertainty (blue region) than the naïve one χ 2SðQÞ (cyan
region). Model uncertainty is further reduced using a multi-objective error measure χ 2multi that incorporates heat capacity data (dark-blue region). Three
most popular Hamiltonian sets currently used from refs. 17,18,20 have also been marked as green, red and gray crosses respectively.
to χ 2SL . Optimization, as described in methods (Methods: The agreement between Sexp(Q) and Ssim opt ðQÞ is quite good, as
Optimization) gives the optimal parameters as J2 = 0.34(6) K, previously observed in Figs. 2 and 3. Further comparisons are
J3 = −0.134(18) K and J30 = 0.102(32) K. The red curve of Fig. 4c shown in Supplementary Fig. 2. To truly validate the model,
shows a cross section of the final ^χS2L ðHÞ model. The minimum at however, we should compare to experimental data that has not
been used during the model optimization process. For this
J2 = 0 is readily apparent. The dashed line in Fig. 4a indicates our
2 purpose, we use the magnetic field dependence of different
empirically selected error tolerance threshold CSðQÞ . The dashed physical properties shown in Fig. 5. The optimal spin model
2
line in Fig. 4c shows CSL , the corresponding tolerance threshold reproduces the measured field dependence of the magnetiza-
for the latent space error. We calculated CS2L from CSðQÞ
2
under the tion25, the zero-field cooled (ZFC) and field cooled (FC) magnetic
assumption of a fixed amount of uncertainty in the scattering data susceptibility12, and the diffuse scattering at multiple temperature
[Methods: Uncertainty quantification]. Figure 4d shows the and applied field conditions, confirming that we have indeed
three-dimensional regions of uncertainty corresponding to found a model Hamiltonian adequate to describe the magnetic
χ 2SðQÞ < CSðQÞ
2
(cyan) and χ 2SL < CS2L (blue). properties of Dy2Ti2O7 including the onset of irreversibility and
glassiness.
a c 1.5 e 1.5
Data Simulation Data Simulation 3
1
1 1 2.5
0.9
Data 0.48 K
0.5 0.5 2
[k, k, –2k]
[k, k, –2k]
Data 0.99 K
0
M
–1 –1 0.5
0.6 H // [111]
680 mK H = 0.2 T 680 mK H = 0.6 T
–1.5 –1.5 0
0 0.5 1 1.5 2 2.5 3 –2 –1 0 1 2 –2 –1 0 1 2
H (T ) [h, –h, 0] [h, –h, 0] [h, –h, 0] [h, –h, 0]
b 2 d 1.5 f 1.5
Data Simulation Data Simulation
1 1
1.5
0.5 0.5
[k, k, –2k]
[k, k, –2k]
1 ZFC (Data) 0 0
ZFC (Sim.)
FC (Data)
FC (Sim.) –0.5 –0.5
0.5
H = 0.05 T –1 –1
Fig. 5 Validation of the optimal solution over multiple experiments. a Magnetization as a function of magnetic field and temperature, b zero-field cooled
(ZFC) – field cooled (FC) susceptibility, c–f magnetic diffuse scattering measured at different temperature and magnetic field combinations: [680 mK,
0.2 T], [100 mK, 0 T], [680 mK, 0.6 T] and [900 mK, 0.2 T] respectively. All the experiments and simulations shown here are done under magnetic field
along the [1,1,1] direction. The magnetization and the ZFC-FC data are extracted from refs. [5,26, respectively.
is time consuming and error prone, our methodology is fully experimental studies. For example, the application of relatively
automated, and helps overcome difficulties of visualizing 3D or small external fields and pressures or dopings should be enough
4D data. to push Dy2Ti2O7 into new magnetic regimes. For instance, the
Finally, we remark that the autoencoder latent space provides proximity to the ferromagnetic phase (blue regime in Fig. 6a)
an interesting characterization of structure factor data in its own indicates that the saturation field is small, as confirmed by
right. Supplementary Fig. 5 in the supplement illustrates how the magnetization, Fig. 5a.
30-dimensional latent space variables map to S(Q). Supplemen- In summary, a fundamental bottleneck in experimental con-
tary Fig. 6 illustrates the activations of each latent space variable densed matter physics is model optimization and assessment of
at varying points in the space of J3 J30 parameters. confidence levels. We have developed a machine learning-based
Future studies might explore more direct application of auto- approach that addresses both challenges in an automated way.
encoders to the problems of background subtraction and of Applied to Dy2Ti2O7 our method produces a model that accounts
denoising experimental data. Here, we investigate another inter- for the diffuse scattering data as well as the lack of magnetic
esting application of the autoencoder: It can delineate different ordering at low temperature. Our approach readily extends to the
magnetic regimes. To demonstrate this, we will explore the space analysis of dynamical correlations, parametric data sets in e.g.
of J3 and J30 parameters, while keeping J2 = 0 K fixed. Our goal is field and temperature, and other scattering data.
to build a map of regimes with different dominant spatial mag-
netic correlations within this two-dimensional Hamiltonian Methods
space. We caution that the transitions between regimes will Experimental details. To measure the diffuse scattering of Dy2Ti2O7 an iso-
typically not be sharp phase transitions, so our modeling will not topically enriched single crystal sample of Dy2Ti2O7 was grown using an optical
produce a phase diagram in the strict sense. floating-zone method in a 5 atm oxygen atmosphere. Starting material Dy2O3
(94.4% Dy-162) and TiO2 powder were first mixed in proper ratios and then
Figure 6a shows the result of our clustering analysis on the annealed in air at 1400 °C for 40 h before growth in the image furnace as previously
simulated data (Methods: Clustering). The optimal spin Hamil- described26. Then the sample was further annealed in oxygen at 1400 °C for 20 h
tonian for Dy2Ti2O7 is marked as Hopt near the center of this after the floating-zone growth. The lack of a nuclear spin moment in Dy-162 means
map. The corresponding Ssim(Q) data, sliced in the high sym- that nuclear spin relaxation channels for the spins are cut off which is important in
order to study the quench behavior in the material. In addition, the incoherent
metry plane, had previously been shown in Fig. 3b. Figure 6b–i scattering from natural dysprosium is high (54.4 barns) whereas for Dy-162 it is
show the Ssim(Q) data for alternative Hamiltonians, as marked on zero and the absorption cross section is decreased from 994 barns (2200 ms−1
the map. It is clear from these results that the spin Hamiltonian of neutrons) for natural dysprosium to 194 barns for isotope 162. A best growth was
Dy2Ti2O7 is close to the confluence of multiple regimes. This fact achieved with a pulling speed of 3 mm/hour. One piece of crystal with the mass ≈
reveals an additional source of complexity that explains the dif- 200 mg was aligned in the (111) plane for the neutron investigation at the single
crystal diffuse scattering spectrometer CORELLI at the Spallation Neutron Source,
ficulties that were encountered in previous characterizations of Oak Ridge National Laboratory. The crystal was prepared as a sphere to minimize
this material. This analysis suggests a roadmap for further absorption corrections and demagnetization corrections.
a T = 680 mK b 2
1.0
1
0.6 1
[h, –h, 0]
7
5 0
0.4
6 –1
8
0.2
H
× opt –2
J3′ (K) –1 0 1
0 4 0.5 [k, k, –2k ]
c 2
–0.2 2
3 1
[h, –h, 0]
–0.4
0
1 2
–0.6 –1
0.0
–0.4 0 0.4 –2
J3 (K) –1 0 1
[k, k, –2k ]
d 2 e 2 f 2
3 4 5
1 1 1
[h, –h, 0]
[h, –h, 0]
[h, –h, 0]
0 0 0
–1 –1 –1
–2 –2 –2
–1 0 1 –1 0 1 –1 0 1
[k, k, –2k ] [k, k, –2k ] [k, k, –2k ]
g 2 h 2 i 2
6 7 8
1 1 1
[h, –h, 0]
[h, –h, 0]
[h, –h, 0]
0 0 0
–1 –1 –1
–2 –2 –2
–1 0 1 –1 0 1 –1 0 1
[k, k, –2k ] [k, k, –2k ] [k, k, –2k ]
Fig. 6 Map of regimes with different spatial magnetic correlations. The map is generated by varying J3 and J30 , with the remaining Hamiltonian
parameters J1, J2, and D being fixed to their optimal values. The colors in a indicate different groups of S(Q), which correspond to regimes with different
patterns of magnetic correlations. b–i Simulated S(Q) data sliced in the high symmetry plane at specific points indexed on panel (a). The large dark-blue
regime (1) corresponds to long-range ferromagnetic order, as indicated by the Bragg peaks in panel (b). c–i Correspond to different patterns of short-range
correlations arising from subsets of states that still obey ice rules. The pattern (g) falls within the same orange regime as Hopt, and qualitatively captures
the Coulombic correlations expected for spin ice [cf. experimental scattering data in Fig. 3a].
CORELLI is a time-of-fight instrument where the elastic contribution is using 4 × 4 × 4 cubic supercells, giving a total of 1024 spins in the pyrochlore
separated by a pseudo-statistical chopper27. The crystal was rotated through 180 lattice. The magnetic form factor of Dy3+ and the neutron scattering polarization
degrees with the step of 5 degree horizontally with the vertical angular coverage of factor that enter in the calculation of the spin structure factor, S(Q), are accounted
±8 degree (limited by the magnet vertical opening) for survey on the elastic and before comparison to the background corrected experimental data. To correctly
diffuse peaks in reciprocal space. The dilution refrigerator insert and cryomagnet account for the long-range dipolar interactions, we used Ewald summation31,
were used to enable the measurements down 100 mK and fields up to 1.4 T. The implemented with the fast Fourier transform.
data were reduced using Mantid28 and Python script available at Corelli.
Background runs at 1.4 Tesla were made to remove all diffuse signal and the extra
scattering at Bragg peak positions due to the polarized spin contribution was
accounted for by using the zero field intensities. (see Supplementary Fig. 01) Figs. 2a Training details. To train the autoencoder, we require a dataset sufficiently broad
and 3a shows a 3D plot and a slice of the high symmetry plane of the background- to cover all potentially important characteristic features of the Dy2Ti2O7 scattering
subtracted diffuse scattering measurement at 680 mK and 0 T respectively. data. For this purpose, we employ 1000 model Hamiltonians of the form Eq. (1).
Each model has individually randomized coupling strengths J2, J3, and J3′, sampled
uniformly from the range −0.6 K and 0.6 K. For each model, we use simulated
Simulations details. Given a model Hamiltonian H, we use Metropolis Monte annealing to generate equilibrated three-dimensional Ssim(Q) data at the target
Carlo to generate a simulated structure factor, Ssim(Q), to be compared with the temperature of 680 mK. Our training data will thus consist of 1000 model
experimental data Sexp(Q). We use simulated annealing to properly estimate Ssim Hamiltonians, each labeled by simulated data. The autoencoder tries to minimize
(Q)29. Beginning at an initial temperature of 50 K, we iterate through 11 expo- the deviation between its input Ssim(Q). and filtered output, summed over all
nentially spaced intermediate temperatures, until finally reaching the target tem- random models in the dataset.
perature of 680 mK. At each intermediate temperature, 5 × 106 Monte-Carlo Training the autoencoder corresponds to determining the parameters (i.e., W, b,
sweeps were performed. At every sweep, each spin is updated once on average, W′, and b′) that minimize a loss function L. Primarily, we are interested in
according to the Metropolis acceptance criterion30. We perform our simulations minimizing the squared error between the simulated data and autoencoder-filtered
Competing interests This is a U.S. government work and not under copyright protection in the U.S.; foreign
The authors declare no competing interests. copyright protection may apply 2020