Professional Documents
Culture Documents
Evaluating A Quantum-Classical Quantum Monte Carlo Algorithm With Matchgate Shadows
Evaluating A Quantum-Classical Quantum Monte Carlo Algorithm With Matchgate Shadows
shadows
ows scale exponentially with system size, providing a tum hardware systems. These calculations reach agree-
barrier to achieving quantum advantage. ment with the reference values even in the absence of
The inefficiencies of Clifford shadows for QC-QMC error mitigation. In Sec. IV, we discuss the merits and
were recently addressed in Ref. [2] by replacing the Clif- shortcomings of QC-QMC and how it is related to other
ford circuits with Matchgate circuits [27]. To the best quantum-classical hybrid algorithms and purely classi-
of our knowledge, the proposal has not yet been exper- cal QMC methods. Finally, Sec. V concludes our work
imentally evaluated in the literature (although we note with a summary.
recent numerical investigations probing the statistical
properties of Matchgate shadows applied to chemical
systems [26, 28]). In particular, it is unclear whether II. PRELIMINARIES
the Matchgate shadows approach exhibits the noise re-
silience that underpinned the success of the original (un- The workflow of the QC-QMC method is shown in
scalable) Clifford shadows approach [1], as the theoreti- Fig. 1, where the computation on quantum and classical
cal justification for the noise resilience of Clifford shad- hardware is partitioned by classical shadows. In this
ows does not immediately extend to the Matchgate case. section, we first provide a brief overview of the auxiliary-
In addition, while the proposal is formally efficient, its field quantum Monte Carlo (AFQMC) algorithm [33]
practicality for studying realistic systems has yet to be used in the QC-QMC method. Then we introduce the
established. classical shadows formalism and discuss its use in QC-
In this work, we numerically and experimentally QMC. Readers familiar with this background material
study the QC-QMC algorithm, incorporating Match- can skip to the summary of our results in Sec. III.
gate shadows. In particular, we probe the noise re-
silience of the algorithm on quantum hardware. We
observe and theoretically characterize the persistence A. Auxiliary-field quantum Monte Carlo
of the noise resilience that was observed for Clifford
shadows, uncovering a more subtle microscopic origin. Within the electronic structure community, QMC
The Matchgate shadows approach reduces the classical refers to a family of methods that bypass the explicit
post-processing cost from exponential to formally poly- optimization of the many-body wavefunction in an ex-
nomial. However, its use in QC-QMC has high degree ponentially large Hilbert space. Specifically, probabilis-
polynomial scaling in the system size, resulting from tic sampling is applied in a subspace of the full Hilbert
computing the local energy of each QMC walker. The space, resulting in a polynomial scaling of the memory
algorithm also has large constant factors resulting from required for the evaluation of the ground state energy
the number of samples used in the classical shadow post- of a given system. The auxiliary-field quantum Monte
processing, as well as the number of QMC walkers and Carlo (AFQMC) method is one example of projector
the number of QMC timesteps. These final two factors QMC methods where a stochastic imaginary time evo-
are properties of the QMC algorithm itself, rather than lution of the wavefunction is carried out by propagating
aspects of the quantum algorithm. We observe hours of samples on a manifold of nonorthogonal Slater determi-
post-processing runtime for even the smallest chemical nants. Each sample {|ϕl ⟩} is usually called a “walker”.
systems, using thousands of CPUs. This fundamentally The ground state energy is estimated as
challenges the scalability of the approach. We thus vali-
iθl loc
P
date the persistence of one of the major strengths of QC- l wl e El ⟨ΨT |H|ϕl ⟩
E= P iθl
, Elloc = , (1)
QMC, as well as highlight the challenges that must be l w l e ⟨ΨT |ϕl ⟩
overcome if the technique is to provide practical quan-
tum advantage. where Elloc is the local energy of each walker, wl , θl
The rest of the paper is organized as follows. In Sec. II are its weight and phase, H is the electronic structure
we discuss how QC-QMC is performed and the basics of Hamiltonian, and |ΨT ⟩ is a trial state defined as an ap-
classical shadows, which serve as prerequisites for sub- proximation to the true ground state. The walker wave-
sequent discussions. In Sec. III, we first discuss prior function and its weight and phase are updated every
⟨ΨT |ϕi ⟩
work on noise robust classical shadows [29, 30] (and time step according to functions of ⟨Ψ T |ϕj ⟩
that perform
their recently developed Matchgate variants [31, 32]) the imaginary time evolution, see App. A for details.
and give an extension to these methods that mitigate For a generic Hamiltonian, however, similar to other
state preparation noise in QC-QMC. Second, we exper- projector QMC methods, AFQMC suffers from the sign
imentally and analytically confirm the natural noise re- problem, or more precisely, the phase problem where
silience of the Matchgate shadow protocol as used in the phases θl of the walkers are evenly distributed in
QC-QMC. Finally, we apply QC-QMC to simulate the [0, 2π) [33, 34]. It leads to the ground state energy es-
dissociation of hydrogen, and the ground state energy timator, i.e., Eq. 1 (computed from the ensemble av-
of a solid-state spin defect system, on different quan- erage of all the walkers) experiencing an exponentially
3
! "
𝑄! , #𝑏! % !!"#$%&
Figure 1: Workflow for the hybrid quantum-classical quantum Monte Carlo (QC-QMC) algorithm: (a). An equal
superposition of the all-zero state |0⟩, and the quantum trial state |ΨT ⟩ is prepared on a quantum computer,
followed by the twirling of random unitary circuits UQ and measurements; (b). The measurement outcomes {|bi ⟩}
and the random circuit unitary UQ —which together constitute the information required to reconstruct classical
shadows—are communicated to the classical computers; (c). The QMC procedure is carried out on a classical
computer, where the walker states |ϕi ⟩ evolve on the potential energy surface from the initial state |ΨI ⟩ towards
the target ground state |Ψg ⟩.
fast decay of the signal-to-noise ratio and a large sta- to |ΨT ⟩. The former approach suffers a significant over-
tistical error. To remedy the phase problem, a com- head in quantum computation due to the large number
mon solution is the so-called phaseless approximation of walkers typically used in AFQMC. In addition, it
(ph) [33], where θl = 0 is enforced by a modified up- requires iterative communication between the classical
date rule, thus constraining the evolution of the walk- and quantum hardware, since |ΨT ⟩ is queried at every
ers, see App. A for details. Such an approximation, time step during the time evolution. We therefore fol-
while controlling the phase problem, introduces a bias lowed the approach of Huggins et al. in this work, and
to the estimated ground state energy. The magnitude we provide a detailed scaling comparison of the two ap-
of the bias is largely determined by how well the trial proaches in Section IV.
state |ΨT ⟩ represents the exact ground state. On classi-
cal computers, |ΨT ⟩ is usually chosen as either a single
Slater determinant, e.g., the Hartree-Fock state, or a
linear combination of Slater determinants, to ensure a
polynomial scaling in computational time [35].
The insight of Ref. [1] is that on a quantum com-
puter, one may efficiently compute overlaps between B. Classical shadows
Slater determinant walker states and a much wider
range of trial states, e.g., the unitary coupled clus-
ter (UCC) state [12, 36]. We refer to implementing As mentioned in the previous subsection, the need to
AFQMC in this way as QC-AFQMC. Such states are po- evaluate ⟨ΨT |ϕ⟩ for all walkers at each time step is a key
tentially “closer” to the target ground state than those consideration for the practicality of the algorithm. In
adopted as classical trials, and could lead to more accu- Ref. [1] it was shown that the measurements required in
rate ground state energy estimations. It is currently QC-AFQMC can be recast into the following framework.
unclear what are the best AFQMC trial states that Let ρ denote the density matrix of an n-qubit quantum
can be prepared on a quantum computer, in low cir- state that we know how to prepare, and {Oi } denote a
cuit depth. This is a similar issue to that of choosing collection of M observables whose expectation values,
a well-motivated and implementable ansatz circuit in i.e., tr(Oi ρ) we wish to estimate. Classical shadow to-
VQE. We will discuss this problem in Sec. IV, and as- mography [19] provides a way to estimate these quanti-
sume for now that a suitable AFQMC trial state can be ties with a cost that only scales logarithmically with M .
efficiently prepared on a quantum computer. Then, the Specifically, it estimates each tr(Oi ρ) up to some error
central issue for the QC-AFQMC scheme lies in how to ϵ by the following procedure: i) choose a distribution D
evaluate the overlap amplitude ⟨ΨT |ϕ⟩ for each walker, of unitary transformations; ii) sample random unitaries
at each timestep. Efforts in the literature have branched U ∈ D from the chosen distribution and iii) measure the
out in two directions: Xu and Li [20] designed circuits state U ρU † in the computational basis {|b⟩} (we refer to
based on the Hadamard test [37] to efficiently compute the tensor products of {|0⟩, |1⟩} of each qubit as the com-
the overlap between the walker states and the trial state. putational basis, i.e., {|b⟩}b∈{0,1}n ) to obtain the mea-
On the other hand, Huggins et al. [1] computed the am- surement outcome |b⟩⟨b|. Consider the state U † |b⟩⟨b|U ;
plitude by using the classical shadows technique applied in expectation, the mapping from ρ to U † |b⟩⟨b|U defines
4
• Explicitly showed how to efficiently compute the which has more subtle origins than in the Clifford case.
overlap between trial states and Slater determi- Third, we apply the QC-AFQMC algorithm to compute
nant states using Matchgate shadows, including i) the dissociation curve of the hydrogen molecule on a
bounding the worst case variance in the estimate. superconducting qubit quantum computer; and ii) the
We refer the interested reader to Ref. [2, Sec.IIID] for a ground state of a negatively charged nitrogen-vacancy
more detailed comparison between these recent works. (NV) center in diamond on a trapped ion quantum com-
The results of Ref. [2] overcame the post-processing lim- puter.
itations of Clifford shadows, showing that overlap am- We use the following notation in this section: we de-
plitude can be computed as note overlap amplitudes obtained from noiseless shad-
n
ows with an ordinary bracket, ⟨ΨT |ϕ⟩, quantities sub-
h i
†
X
−1 ject to quantum noise with a tilde bracket, ⟨Ψ^
⟨ΨT |ϕ⟩ = 2 f2l E tr |ϕ⟩⟨0|Π2l UQ |b̂⟩⟨b̂|UQ , T |ϕ⟩, and
Q∼B(2n) we use a tilde plus subscript r to denote quantities ob-
l=0
(8) tained from a robust shadow protocol, ⟨Ψ ^ T |ϕ⟩r .
where the random Matchgate circuits UQ are prepared
by sampling random signed permutation matrices Q
(Q ∈ B(2n), where B represents the Borel group). In
A. Robust Matchgate shadow protocol
Eq. 8, Π2l is the projector associated with the l-th even
irrep, and its eigenvalue f2l is
−1 Refs. [29, 30] developed a “robust shadow proto-
2n n col”, showing that when the quantum noise is gate-
f2l = . (9)
2l l independent, time-stationary, and Markovian (GTM),
Importantly, Eq. 8 can be efficiently solved using the and the state preparation of ρ is perfect, a noise-free
matrix Pfaffian, with a scaling of O (n − ζ/2)4 by expectation value can be obtained. The protocol works
polynomial interpolation [2], where ζ denotes the num- by replacing the eigenvalue(s) f of M with modified
ber of electrons. As such, the exponential post- value(s) fe that compensates for the effect of noise. The
processing bottleneck present in the Clifford shadows robust scheme uses additional shadow-like circuits to
approach is eliminated through the use of Matchgate estimate the value(s) of fe. It can be viewed as pas-
shadows. sive error mitigation, as the updates are made purely
Inherent noise resilience, analogous to that observed in the classical post-processing of the shadow expecta-
for overlap ratios computed via Clifford shadows, has tion value. The framework was originally applied to
not yet been established for overlap ratios computed global and local Clifford shadows. Recently, two sepa-
via Matchgate shadows. One sufficient but not neces- rate works extended the robust shadow protocol to the
sary condition, (following from the Clifford case), would Matchgate setting [31, 32]. The latter of these works
be fe2l = αf2l , l ∈ {0, 1, . . . , n}, where α is a proportion- observed a close connection between the circuit used for
ality constant. In Sec. III B 2 we investigate the noise re- the robust Matchgate shadow protocol and prior work
silience of overlap ratios computed via Matchgate shad- on Matchgate randomized benchmarking [44]. We will
ows. While our experimental results do not display a closely follow this approach, presenting derivations for
completeness in App. E.
universal proportionality constant between each fe2l and
The robust Matchgate shadows protocol [32] esti-
f2l , we nevertheless find that the computed ratios are
inherently robust to noise, and provide justification for mates {fe2l } using the following steps: i) prepare the
this observation. all-zero state |0⟩ on a quantum computer; ii) sample
Q ∈ B(2n) and apply the Matchgate circuit UQ to |0⟩;
iii) measure in the computational basis and collect mea-
III. RESULTS surement outcomes |b⟩. The expressions used to calcu-
late {fe2l } from these measurement outcomes are given in
In this section, we present our results, divided into App. E. The sample complexity is asymptotically equiv-
three parts. We first introduce prior work on noise alent to that used in the noiseless case [32].
robust classical shadows, and provide an improvement The established robust shadow scheme focuses on
that mitigates state preparation noise in QC-AFQMC. noise in the shadow unitary (UQ in Fig. 1a), and does
Second, we experimentally test for noise resilience of not mitigate any noise occurring during state prepara-
both overlap amplitudes and ratios of overlap ampli- tion (VT in Fig. 1a). We extend the method to (partly)
tudes, evaluated via the Matchgate shadow technique. account for noise that occurs during the state prepara-
Notably, we observe that the ratios are resilient to the tion unitary. In QC-AFQMC, we evaluate the over-
effects of noise (as was previously observed for Clif- lap amplitudes by measuring classical shadows of an
ford shadows [1]). We provide a theoretical explana- equal superposition state √12 (|ΨT ⟩ + |0⟩). This state
tion for the observed noise resilience and its limitations, is prepared via a circuit that first generates a superpo-
6
sition of the Hartree-Fock state with |0⟩ state, and then their accuracy is limited by the effects of noise. The two
applies a unitary VT , such that VT |ΨHF ⟩ = |ΨT ⟩ and robust shadow approaches (accounting for state prepa-
VT |0⟩ = |0⟩ [45]. As discussed above, to estimate the ration error (SP) and not accounting for it) mitigate
values of {fe2l } required in the robust Matchgate proto- the impact of hardware noise. In particular, compen-
col, we apply a Matchgate shadow circuit to the initial sating for the effects of state preparation noise results in
state |0⟩. We can partially account for the noise in the a much smaller deviation from the ideal noiseless values
state preparation circuit in the following way. We mini- than the standard robust approach. This observation
mally alter the QC-AFQMC circuit in Fig. 1a, such that signifies the importance of considering the state prepa-
it measures {fe2l }, by removing the initial Hadamard ration noise when using robust shadows on quantum
gate. In a noiseless setting, this modified version of VT hardware.
would still result in ρ = |0⟩ ⟨0|, as required. In the
noisy setting, we can treat the noise as originating in
the Matchgate circuit, which is then mitigated by the
robust protocol. While we do not yet have a full math-
ematical characterization of the noise models that can
be mitigated by our enhanced robust Matchgate shad-
ows technique, we demonstrate the efficacy of this ap-
proach through numerical simulations and experiments
on quantum hardware, presented in the following sub-
section.
2. Noise resilience in the evaluation of overlap ratios Proof. A full proof is given in App. D 4, and we sketch
a brief outline here. The operation Π2l (|ϕ⟩⟨0|) can be
re-expressed as 2−n jl ⟨0|γj†l |ϕ⟩γjl where γjl are strings
P
Using the results presented in the previous section
for {⟨ΨT |ϕi ⟩}, we can compute the 120 possible ratios of 2l Majorana operators (defined in App. D). The ac-
of overlaps originating from the 16 Slater determinants tion of the Majorana strings is to create “1’s” in the
sampled. In Fig. 2 (lower panel) we show the MAE of |0⟩ state. Only terms with ζ 1’s created contribute
these overlap ratios with respect to the ideal values (and to the sum. Projecting the resulting sum into the re-
verify in Fig. 9 in App. H that this behavior holds for stricted subspace ensures the ‘eigen-operator’ property.
not only the MAE, but the errors associated with each The prefactor is given by counting arguments for the
overlap ratio). We observe the following behavior number of Majorana strings that can yield ζ 1’s for a
given l value.
⟨ΨT |ϕi ⟩ ^
⟨Ψ T |ϕi ⟩r
^
⟨Ψ T |ϕi ⟩
≈ ≈ , (10) The justification for restricting to the [0, ζ]-electron
⟨ΨT |ϕj ⟩ ^
⟨ΨT |ϕj ⟩r ^
⟨ΨT |ϕj ⟩ subspace is that ρ is an equal superposition of |0⟩ and
when the number of shadow circuits exceeds 100. This is |ΨT ⟩, and therefore the trace of ρ with operators out-
an indication of noise resilience in evaluating the overlap side this subspace will necessarily be zero. This will
ratios, suggesting that the robust shadow protocol may still hold approximately true when ρ is reconstructed
not be required for QC-AFQMC. We provide theoretical from a finite number of classical shadows. This oper-
justification for these observations below. ator only appears in the classical post-processing step
The first (approximate) equality in Eq. 10 will not and is therefore not affected by noise. Therefore, the
hold for all noise models, as the robust Matchgate noisy overlap amplitude can be expanded as
shadow protocol is only guaranteed to correct the ef- n
!
† ˆ ˆ
X hD ED Ei
fects of noise if its assumptions (GTM noise, noiseless −1
^
⟨ΨT |ϕ⟩ ≈ 2 f2l b2l E 0 UQ b̃ b̃ UQ ϕ ,
state preparation) are satisfied. Nevertheless, for cases l=0
Q∼B(2n)
where the robust Matchgate protocol is able to correct (13)
the effects of noise the equality of the ratios follows di- where in deriving this equation, we have used
tr AM−1 (B) = tr M−1 (A)B , where operators A, B
rectly. In our experiments, we found that the robust
Matchgate protocol was able to correct the individual are in the subspace where M is invertible [2]. This
overlaps up to a small residual error attributed to resid- expression can be compared to Eq. 7 for Clifford shad-
ual state preparation noise. This leads to the approxi- ows, noting the common feature of the separation of
mate equality between the ratios. the expression into a sum over the channel index l, and
The second (approximate) equality shows that the ra- the expectation over the shadow unitaries acting on the
tio of any two uncorrected overlap amplitudes (which walker state |ϕ⟩. In order to mitigate the effects of noise
are themselves inaccurate, see Fig. 2 (upper panel)) with the robust Matchgate shadows approach, we could
yields the same results as ratios of overlaps computed replace f −1 with fe−1
2l 2l
via the robust Matchgate protocol. To understand this !
result, we first restate Eq. 8 for computing the overlap n
† ˆ ˆ
X hD ED Ei
^ −1
in the absence of noise ⟨ΨT |ϕ⟩r ≈ 2 fe2l b2l E 0 UQ b̃ b̃ UQ ϕ .
Q∼B(2n)
n l=0
(14)
h i
†
X
−1
⟨ΨT |ϕ⟩ = 2 f2l E tr |ϕ⟩⟨0|Π2l UQ |b̂⟩⟨b̂|UQ .
Q∼B(2n) If the noise satisfies GTM assumptions (and state prepa-
l=0
ration is noise-free), Eq. 14 is able to correct the
EDeffect of
We make use of the following Theorem: ˆ ˆ
P noise that manifests from the deviation of b̃ b̃ from
Theorem 1. For an n qubit state |ϕ⟩ = i ci |i⟩ which ED
is a linear combination of computational basis states |i⟩ b̂ b̂ . If these assumptions on the noise are not satis-
with fixed Hamming weight ζ, fied, robust Matchgate shadows may not be able to per-
fectly recover the noiseless value. We can now observe
P0,ζ Π2l (|ϕ⟩⟨0|) = b2l |ϕ⟩⟨0|, (11)
that regardless of whether the robust procedure is used,
with the terms corresponding to the sum over l in Eq. 13 and
( n−ζ ζ ζ Eq. 14 are both independent of |ϕ⟩. As such, when com-
2ζ−n l− ζ2
, if 2 ≤l ≤n− 2 ^
⟨Ψ T |ϕi ⟩r
^
⟨Ψ T |ϕi ⟩
b2l = (12) puting the ratios and , the prefactor will
0, otherwise ^
⟨ΨT |ϕj ⟩r
^
⟨ΨT |ϕj ⟩
silience in the overall algorithm. We first compute the in energy as a function of imaginary time for the 0.75
dissociation curve of the four-qubit hydrogen molecule Å geometry. There is little observable difference (∼ 0.1
studied in the previous subsections using the QC- mHa) between the noisy experiment and its noiseless
AFQMC algorithm implemented on the IBM Hanoi su- classical emulation.
perconducting qubit quantum processor. We then apply We further note that even when the state prepara-
the QC-AFQMC algorithm to compute the ground state tion suffers from coherent errors, the QC-AFQMC algo-
of an NV center in diamond, using the Aria trapped ion rithm is still able to recover accurate ground state en-
quantum processor from IonQ. In both settings, we emu- ergies, providing the effective quantum trial state |Ψe T ⟩,
late practical applications of QC-AFQMC by assuming reconstructed from the noisy shadows, does not appre-
a trial state generated by an imperfect VQE calcula- ciably deviate from |ΨT ⟩. This is a result of the dissipa-
tion. In both cases, we use a tailored UCCSD ansatz, tive nature of the imaginary time process emulated by
see App. H, which was optimized until the energy was AFQMC, which will drive any initial state towards the
approximately 30 ∼ 80 mHa higher than the FCI refer- ground state. This further enhances the inherent noise
ence. resilience of the QC-AFQMC algorithm.
We emphasize that even for the hydrogen molecule
in a minimal basis set, executing the QC-AFQMC algo-
1. Hydrogen molecule dissociation rithm using Matchgate shadows requires a substantial
amount of resources, with classical computation domi-
nating over the quantum subroutine. It was necessary to
In Fig. 4 (upper panel) we show the ground state en-
perfectly parallelize the classical AFQMC propagation,
ergies obtained from QC-AFQMC at 5 different molec-
with one walker per CPU core2 (this is only possible
ular geometries. If we target an error ϵ = 10−2 on any
when not using population control). For each walker, it
overlap, with ≥ 99.99% confidence, the rigorous sample
took approximately 1 minute to post-process the shad-
complexity bounds presented in Eq. 5 imply the need to
ows from 16000 circuits in each timestep. As each step
use ∼ 1.1 × 105 shadow circuits (with one measurement
of the AFQMC algorithm must be performed sequen-
shot per circuit). As shown in Fig. 2, it was sufficient to
tially, it could take up to hours or even days to com-
use only 16000 circuits for each bond length, suggesting
plete the full evolution, see App. H for details. This
the bounds of Eq. 5 are overly pessimistic for the small
total runtime partly comes from the large number of
system studied here. Given the noise resilience observed
shadows, which plays the role of a pre-factor and could
in evaluating the overlap ratios, the use of robust shad-
be further improved by implementing additional paral-
ows was not necessary. In our AFQMC calculations,
lelization among the shadows, for example, processing
we used 4800 walkers for all numerical data presented
the shadows corresponding to a single walker in paral-
here to ensure a negligible variance, and a time step
lel across a number of cores, rather than sequentially
∆τ = 0.005 H.a.−1 resulting in a negligible Trotter er-
on the same core. The total runtime also stems from
ror. We chose the initial walker state to be the Hartree-
a scaling as high as O(n8 ) to evaluate the local energy
Fock state, for each of the five distances studied here.
of each walker, which is discussed in detail in App. B.
The calculation was parallelized across 4800 CPU cores
This high cost presents a practical concern for scaling
(1 core per AFQMC walker), as discussed in more detail
this algorithm to larger systems of practical interest.
below.
In the interest of conserving computational resources,
only two of the five data points (solid points) were ob-
2. Nitrogen-vacancy center ground state
tained using the scalable Matchgate shadows approach
outlined above. The remaining three data points were
obtained using an exponentially scaling approach used We applied the QC-AFQMC algorithm to study the
in Ref. [1] and detailed in App. H 1 d, that is ultimately electronic states of a point defect in a solid. We chose
more efficient than the scalable Matchgate approach for the NV center in diamond, which is widely considered
small system sizes. We verified that the two schemes a promising candidate for quantum sensing and has re-
give the same results for overlap amplitudes. Thus, the cently been applied to imaging high-pressure phase tran-
two Matchgate-obtained points are used as a complete sitions [47] and superconducting systems [48]. To obtain
evaluation of the practicality of the algorithm, while the multireference states we employed a quantum defect em-
remaining three points are only presented to confirm the bedding theory (QDET) [49], which allows us to de-
accuracy of the algorithm in the presence of hardware rive an effective Hamiltonian Heff , by defining an active
noise. space, see App. G for details. This effective Hamilto-
We observe that all five QC-AFQMC calculations
converge to within chemical accuracy of the FCI ref-
erence value, even without the use of the robust shadow
2 Intel Xeon Platinum Processor.
protocol. In Fig. 4 (lower panel) we show the variation
10
Method for computing ⟨ΨT |ϕ⟩ Vacuum reference [20] Clifford shadows [1] Matchgate shadows [2]
a M in the scaling should be interpreted as the total number of walker states involved in the algorithm. The logarithmic factor
entering the complexities for the shadow-based methods arises from the use of a median-of-means estimator, and is not present in
the heuristic mean estimator used in this work.
b O(NT ) represents the quantum trial state preparation circuit depth complexity. And the |ϕ⟩ preparation, random Clifford, and
Matchgate circuit for the three schemes all scale as O(n) in depth.
c The local energy numerator can be computed by decomposing it into a linear combination of O(n4 ) overlap amplitudes, due to the
complexity of the Hamiltonian. The number of shadow samples required also contributes to the post-processing cost.
would in principle be corrected by imaginary time evo- in the trial, respectively. This can be compared against
lution. In turn, the quantum trial state used in the the complexities of the quantum algorithm, which is
QC-AFQMC algorithm could come from VQE calcula- summarized in Table. I for the three different proposed
tions (in a smaller active space, or for a reduced prob- approaches for measuring the overlap amplitudes. We
lem size). However, one notable difference from the see that the classical algorithms are more efficient in n
VQE ansatz is that the quantum trial preparation cir- than the classical shadows-based quantum algorithms,
cuit needs to satisfy VT |0⟩ = |0⟩, if following the ap- which scale as O(n8.5 log2 n). As such, the quantum trial
proach used in this work. This additional constraint state would need to lead to a much smaller bias than a
prevents the use of some popular ansatz circuits in VQE, classical multi-Slater trial state with a large number of
such as qubit coupled cluster [52], and circuit simplifi- determinants to be practically advantageous. It is cur-
cation techniques [53]. For some ansatz circuits it may rently an open question as to what kind of quantum trial
be possible to circumvent this limitation by introducing state is best suited for QC-AFQMC. The only criterion
another ancilla qubit, and controlling the implementa- adopted so far is the trial state energy ⟨ΨT |H|ΨT ⟩, but
tion of the ansatz (which can often be done by control- there might exist other, better-suited criteria.
ling select gates, rather than every gate), following the
procedure in App. E of Ref. [32]. While Matchgate classical shadows are formally effi-
cient, they introduce a large post-processing cost. This
The possibility of achieving exponential quantum ad- presents a practical challenge for scaling up the QC-
vantage in solving quantum chemistry problems remains AFQMC algorithm to larger system sizes. To high-
controversial [54]. This is arguably not the direct aim light this, in Fig. 6 we present the classical runtime for
of QC-AFQMC, given that phaseless-AFQMC is already post-processing a single timestep of the QC-AFQMC
a polynomially scaling method. Instead, the motivation algorithm, assuming parallelization with 1 CPU core
of ph-QC-AFQMC is that the trial state used may pro- per walker. Numerical results were obtained for hydro-
vide energies with smaller biases than the purely classi- gen and then extrapolated to larger system sizes us-
cal algorithm. A full comparison requires an improved ing the scaling of the QC-AFQMC algorithm. The re-
understanding of the advantages offered by quantum sults show that our implementation of the algorithm is
trial states over classical trial states. As a proxy for only realistic up to eight spin-orbitals (water). Within
this, we could compare the cost of classical methods each CPU, computation associated with postprocess-
using state-of-the-art techniques [55–58], such as Multi- ing the results of each of the shadow circuits can be
Slater trial states. Two popular implementations us- further parallelized. While not investigated in this pa-
ing Wick’s theorem have the following scaling for eval- per, parallelization between shadow circuits can poten-
uating the local energy, O(M Xn3 + M XNc ) [35] and tially speed up the classical runtime. For example, us-
O(M Xn4 + M Nc ) [59], where M, X, Nc are the number ing one million CPU cores [60] would drive the classical
of walkers, Cholesky vectors, and Slater determinants post-processing of benzene toward a realistic regime, of
12
the use of IBM Quantum services for this work and to short-time propagator can be approximated as
advanced services provided by the IBM Quantum Re-
searchers Program. The views expressed are those of the e−∆τ (H−E0 )
authors and do not reflect the official policy or position ∆τ ∆τ
vγ2 − ∆τ
Y
≈ e−∆τ (H0 −E0 ) e− 2 v0
e 2 e 2 v0
of IBM or the IBM Quantum team. γ (A5)
Z
= dxp(x)B(x) + O(∆τ 2 ),
nary time evolution (ITE) process: The constant prefactor is usually left out and p(x) is the
iN multi-variable standard normal distribution. After this
h τ transformation, we have only one-body terms left, and
e−τ (H−E0 ) |ΨI ⟩ = e−∆τ (H−E0 ) |ΨI ⟩, ∆τ = ,
N the original interacting problem is now mapped onto an
(A1) ensemble of non-interacting problems coupled to a set of
where E0 is some approximated ground state energy auxiliary fields. The probability distribution p(x) in the
typically from mean-field calculations. ITE projects out above expression is realized by an ensemble of walkers
the excited state components in the initial state |ΨI ⟩, |ϕl ⟩, which are evolved as
and the true ground state |Ψg ⟩ is ultimately obtained
as long as |ΨI ⟩ is not orthogonal to it. In general, it is |ϕ(i+1) ⟩ = B(x)|ϕ(i) ⟩, (A7)
often as difficult to realize the ITE in Eq. A1 as solving
the time-independent Schrödinger equation. To rem- where the superscript represents the time step. The
edy this challenge, the Hubbard-Stratonovich transfor- walkers in AFQMC are chosen as nonorthogonal Slater
mation [67, 68] can be employed, where the two-body determinants and stay in the manifold of Slater deter-
interaction in the following form is recast into one-body minants S during evolution, due to the Thouless theo-
coupled to an auxiliary field: rem [71, 72]. The ground state is then estimated by a
mixed energy estimator which is exact and can lead to
Z
dx x2
γ
√ considerable simplifications in practice
∆τ
vγ2
e 2 = √ γ e− 2 e ∆τ xγ vγ , (A2)
2π ⟨ΨT |He−N ∆τ (H−E0 ) |ϕ(0) ⟩
E= , (A8)
⟨ΨT |e−N ∆τ (H−E0 ) |ϕ(0) ⟩
where vγ is a matrix that represents a one-body inter-
action, and xγ is the auxiliary field. To achieve that, where |ΨT ⟩, |ϕ(0) ⟩ are the trial state and initial Slater
the Hamiltonian, typically written in second quantized determinant, respectively.
form, has to be rewritten into a Cholesky decomposed So far we haven’t imposed any constraints on the
format evolution of walkers, which is known as free-projection
AFQMC. The only potential systematic error comes
1 X
hij a†i aj + Vijkl a†i a†j al ak
X
H = H0 + from the Trotterization error. However, free-projection
i,j
2 AFQMC is prone to large fluctuations in the estimated
i,j,k,l
(A3) energy because of an asymptotic instability in τ , also
1X 2
= H0 + v 0 − v , known as the phase problem. The phase problem man-
2 γ γ ifests as the phase θl of walkers evolves into the whole
[0, 2π) range during the random walk (e.g., see Fig. 3 in
where H0 is a constant, h, V are the one- and two- Ref. [34] for visualization), and results in the denomina-
tor in Eq. 1, i.e., l wl eiθl approaching 0 exponentially
P
electron integrals, v0 is the modified one-body interac-
tion, vγ = iLγ , and Lγ is the Cholesky vector of the quickly with τ . This problem could be viewed as a gen-
two-body interactions satisfying eralization of the sign problem, see Refs. [33, 34] for a
detailed discussion.
X X Importance sampling is a variance-reduction tech-
Lγ = Lγpq a†p aq , Vijkl = Lγik Lγ∗
lj . (A4)
nique widely used in QMC methods. It’s also employed
pq γ
to control the phase problem in AFQMC. To do that,
we first note that the Eq. A5 still holds up to a complex-
Note that we’ve assumed a mean-field subtraction [33] valued shift x. Therefore it can be modified as:
in the above Hamiltonian. With it, and the second-
order Trotter formula [69, 70]: eA+B ≈ eA/2 eB eA/2 , the p(x) → p(x − x), B(x) → B(x − x). (A9)
14
To find the best x, we first expand Eq. A8 as Algorithm 1 Auxiliary-field quantum Monte Carlo
R QN −1 (N )
1: Input H, E0 , |ΨT ⟩, |ΨI ⟩, number of walkers M , number
k=0 dxk p(xk − xk )⟨ΨT |H|ϕ ⟩ of time steps N , and step size ∆τ . (H will be decom-
E≈ RQ N −1 (N ) posed into L Cholesky vectors.)
k=0 dxk p(xk − xk )⟨ΨT |ϕ ⟩
R QN −1 (N ) 2: for l = 1 to M do
(k) ⟨ΨT |H|ϕ
k=0 dxk p(xk )I(xk , xk , ϕ ) ⟨ΨT |ϕ(N ) ⟩ ⟩ 3:
(0) (0)
|ϕl ⟩ ← |ΨI ⟩, wl ← 1 ▷ Initialize the walker.
= R QN −1 , 4: for k = 1 to N do
(k) )
k=0 dxk p(xk )I(xk , xk , ϕ 5: for γ = 1 to L do
(A10) √ (k−1)
⟨Ψ |v̂ |ϕl ⟩
6: xγ = − ∆τ T γ (k−1)
⟨ΨT |ϕl ⟩
where we’ve defined 7: Sample x according to distribution p(x)
(k) (k−1)
8: |ϕl ⟩ ← B(x − x)|ϕl ⟩. ▷ Update the walker
|ϕ(N ) ⟩ = B(xN −1 − xN −1 ) . . . B(x0 − x0 )|ϕ(0) ⟩, (A11) (k−1)
(k−1) ⟨ΨT |H|ϕl ⟩
9: El ← (k−1) ▷ Compute local energy
⟨ΨT |ϕl ⟩
and an importance function (k)
⟨ΨT |ϕl ⟩
10: θ ← arg (k−1) ▷ Compute the phase
⟨ΨT |B(x − x)|ϕ⟩ x·x− x2 ⟨ΨT |ϕl ⟩
I(x, x, ϕ) = e 2 . (A12) 11:
(k)
wl ← wl
(k−1)
×
(k−1)
I(x, x, ϕl ) ▷ Update weight
⟨ΨT |ϕ⟩ (N )
(N ) ⟨ΨT |H|ϕl ⟩
12: El ← (N ) ▷ Compute the local energy
By choosing ⟨ΨT |ϕl ⟩
PM (N ) (N )
wl El
√ 13: Output the energy E ← l
⟨ΨT |vγ |ϕ⟩ PM
wl
(N )
xγ = − ∆τ ⟨vγ ⟩, ⟨vγ ⟩ = , (A13) l
⟨ΨT |ϕ⟩
fluctuations
√ in the importance function to first order
Appendix B: Local energy evaluation with classical
in ∆τ is cancelled [33], leading to a more stable ran-
shadows
dom walk. The dynamic shift, usually referred to as
a force bias, modifies the sampling of the auxiliary
fields by shifting the center of the Gaussian distribu- The most time-consuming step in QC-AFQMC is the
tion p(x) according to the overlap ⟨ΨT |ϕ⟩. The impor- evaluation of the local energy. Here we discuss how
tance function can this task is performed in our implementation compati-
therefore be further
approximated as
ble with classical shadows. We first note that the local
I(x, x, ϕ) ≈ exp −∆τ (E loc − E0 ) , known as the local
energy formalism. Introducing the force bias into the energy numerator can be rewritten as
importance sampling would solve the instability if B(x) ⟨ΨT |H|ϕ⟩ = ⟨ΨT |Uϕ Uϕ† HUϕ |Φ0 ⟩
were real, which could be true in special cases. For a X X
general phase problem, however, it doesn’t lead to com- = ⟨ΨT |Uϕ |Φrp ⟩⟨Φrp |H̄|Φ0 ⟩ + ⟨ΨT |Uϕ |Φrs rs
pq ⟩⟨Φpq |H̄|Φ0 ⟩
plete control of the phase problem. pr pqsr
To further control the phase problem, the local energy X X
= ⟨ΨT |ϕrp ⟩⟨Φrp |H̄|Φ0 ⟩ + ⟨ΨT |ϕrs rs
pq ⟩⟨Φpq |H̄|Φ0 ⟩,
in the importance function has to be replaced by its real
pr pqsr
part, leading to θl = 0 and wl > 0 in Eq. 1. A phaseless
(B1)
approximation is also adopted as
where H̄ = Uϕ† HUϕ represents the rotated Hamiltonian
I(x, x, ϕl ) ≈ exp −∆τ (Re Elloc − E0 )
and |Φ0 ⟩, |Φrp ⟩(|Φrspq ⟩) denotes the Hartree-Fock state
⟨ΨT |B(x − x)|ϕl ⟩ and its single (double) excited counterpart. We also
× max 0, cos arg ,
⟨ΨT |ϕl ⟩ have |ϕrp ⟩ = Uϕ |Φrp ⟩. The second line in the above equa-
(A14) tion utilizes the resolution of identity and the fact that
Hamiltonian has only up to two-body interactions.
where abrupt phase changes during the random walk The Hamiltonian rotation can be done with O(n5 )
(k)
are now forbidden. The weight is updated as: wl ← operations. In the above equation, the overlap ampli-
(k−1) (k−1)
wl × I(x, x, ϕl ). Combining these techniques tudes ⟨ΨT |ϕrp ⟩, ⟨ΨT |ϕrspq ⟩ can be calculated with Match-
finally resolves the phase problem, and the whole algo- gate shadows each with complexity O(n4 ), as already
rithm is usually referred to as the phaseless AFQMC discussed in the main text. Each matrix element can
(ph-AFQMC). Note that the phaseless approximation be computed using the Slater-Condon rules with com-
also introduces a bias to the mixed energy estimator, plexity O(1). Due to the number of possible excited
which we seek to systematically improve by using quan- Slater determinants, which scales as O(n4 ), the scal-
tum computation in QC-AFQMC. ing of the evaluation of the local energy for each walker
The whole process of the AFQMC algorithm is sum- would therefore scale as O(n8 + n5 + n4 ) = O(n8 ) using
marized in Alg. 1, following the presentation in Ref. [20]. this approach.
15
We also note that there exist other ways to evaluate where g is an element in G, and the random unitaries
the local energy with Matchgate shadows. Sec.V C of U is a unitary representation of G. Applying Schur’s
Ref. [2] proposed an alternative way to evaluate the local lemma, and assuming no multiplicities in U lead to
energy using the Grassmann algebra. In this approach,
the Hamiltonian is first rewritten into the Majorana for-
X tr(MZ Πλ )
M= f λ Πλ , fλ = , (C2)
malism. Then the mixed estimator of each Majorana tr(Πλ )
λ∈RG
operator ⟨ΨT |γS |ϕ⟩ can be estimated in a similar fash-
ion to the overlap amplitude. This approach leads to a where RG are the irreducible representations (irreps) of
scaling between O(n9 ), due to the relevant matrices be- G. The super-operators Πλ ∈ L(L(Hn )) are orthogonal
ing non-invertible, making it slightly less favorable than projectors onto the irreducible subspaces Γλ , and fλ is
the previous approach. the eigenvalue associated with each orthogonal projec-
tor.
is a mixed state. For any computational basis state |b⟩, where Q belongs to the orthogonal group O(2n). Match-
for example, we have gate circuits form a continuous group Mn which is in
one-to-one correspondence with the orthogonal group
n
Y 1 O(2n), up to a global phase:
I − i(−1)bj γ2j−1 γ2j .
|b⟩⟨b| = (D5)
j=1
2 Mn = {UQ : Q ∈ O(2n)}. (D13)
Ref. [2] considered two distributions: i). “uniform” dis-
A Gaussian state can also be defined by its two-point
tribution over Mn , where uniformity is more precisely
correlations tr(ϱγµ γν ), which is also known as its co-
given by the normalized Haar measure µ on O(2n);
variance matrix. An n-qubit Gaussian state ρ has its
ii). uniform distribution over the discrete Borel group
covariance matrix defined as:
B(2n), which is the intersection of Mn and the n-qubit
i Clifford group Cln . The second consists of 2n×2n signed
(Cρ )µν := − tr([γµ , γν ]ρ) (D6) permutation matrices:
2
Mn ∩ Cln = {UQ : Q ∈ B(2n)}. (D14)
for µ, ν ∈ [2n], which is a 2n×2n antisymmetric matrix.
For any computational basis state |b⟩⟨b| as an example, (j) (j)
For j ∈ Z>0 , we use EMn and EMn ∩Cln to denote the
its covariance matrix is: j-fold twirl channels corresponding to the distributions
n over Mn and Mn ∩ Cln , respectively:
M 0 (−1)bj
C|b⟩ := . (D7)
Z
(−1)bj +1 0 (j)
EMn := dµ(Q)UQ⊗j
, (D15)
j=1
O(2n)
3. Matchgate channel
2. Matchgate 3-design
muting) Pauli strings: nected pairs, denote any such choice as [j̄]. The ‘con-
Nl− ζ
nected part’ of the string can be written as k=12 ıZj̄k ,
ζ−1 √
⃗ (ik+1 −1) Oi . where here we have used ı = −1 to avoid confusion
O
pik ,ik+1 Oik Z (ik +1) k+1
(D32)
k=1
with the index i.
k odd
Then explicitly compute the sum over these Majorana
Note that the qubits acted on by Z are not in the set strings (qubits that are not explicitly acted on in the
[i]. Next, consider the part of the string acting on [j]. following expressions are implicitly acted on with the
n−ζ
There are l− ζ (paired) locations to place the con- identity operator):
2
[j̄]∈(
n−ζ Oi1 ∈[Xi1 Yi1 ] k=1 η=1
l−
ζ ) k odd
2 Oi2 ∈[Xi2 ,Yi2 ]
...
Oiζ ∈[Xiζ ,Yiζ ]
ζ−1 l− ζ2
⃗ (ik+1 −1) Oi
O O
× pik ,ik+1 Oik Z ıZj̄η
(ik +1) k+1
k=1 η=1
k odd
l− ζ2
ζ−1 O D E
⃗ (ik+1 −1) Oi Zj̄ 1i 0...01i 0...0j̄
X X O
= 2−n 0ik ...0ik+1 ...0j̄η Oik Z (ik +1) k+1 η k k+1 η
[j̄]∈(
n−ζ
) Oi1 ∈[Xi1 Yi1 ] k=1 η=1
ζ
l−
2 Oi2 ∈[Xi2 ,Yi2 ] k odd
...
Oiζ ∈[Xiζ ,Yiζ ]
⃗ (ik+1 −1) Oi Zj̄
× Oik Z (ik +1) k+1 η
Oζ−1 l− ζ2
⃗ (ik+1 −1) Oi
X X O
= 2−n
0ik 0ik+1 Oik Oik+1 1ik 1ik+1 Oik Z Zj̄η
(ik +1) k+1
k=1 Oik ∈[Xik Yik ] ) η=1
n−ζ
k odd [j̄]∈( ζ
l−
Oik+1 ∈[Xik+1 ,Yik+1 ] 2
(D33)
where in the first term of the final line we have swapped the order of the sum and tensor product, such that we are
essentially considering ζ/2 copies of states with Hamming weight two. Using the results of Lemma 2, the first term
in the final line above evaluates to
ζ−1 ζ−1
⃗ (ik+1 −1) = 2ζ |i⟩ ⟨0| ⃗ (ik+1 −1)
O O
2
2 |1⟩ ⟨0|ik ⊗ |1⟩ ⟨0|ik+1 ⊗Z (ik +1) [i] [i] Z (ik +1) (D34)
k=1 k=1
k odd k odd
where we use |x⟩[i] to denote the qubits of computational basis state |x⟩ corresponding to the set [i]. Hence, denoting
20
Thus for ζ2 ≤ l ≤ n − ζ
2, P0,ζ [Π2l (|i⟩ ⟨0|)] = 1. Robust shadow protocol
n−ζ
2ζ−n l− ζ |i⟩ ⟨0|, as required.
2
In this part, we review the robust shadow estima-
tion [29], which serves as a basis for robust Matchgate
shadows. Eq. C2 is particularly useful when it comes to
Lemma 2. The sum noise, where the channel is modified by noise as
X
⃗ j−1 Oj
X tr(MZ ΛΠλ )
⟨0i 0j | Oi Oj |1i 1j ⟩ Oi Z i+1 (D36) Mf= feλ Πλ , feλ = , (E1)
tr(Πλ )
Oi ∈[Xi Yi ] λ∈RG
Oj ∈[Xj ,Yj ]
where feλ is the noisy eigenvalue, Λ denotes the
evaluates to noise channel, and we’ve assumed the noise is
gate-independent, time-stationary, and Markovian
⃗ j−1 (GTM) [29]. Within this assumption, the noise-free ex-
22 |1⟩ ⟨0|i ⊗ |1⟩ ⟨0|j ⊗ Z i+1 (D37) pectation values of tr(Oi ρ) could be retrieved by invert-
ing this noisy channel, if ρ is perfectly prepared on quan-
⃗ ab := Nb Zk .
where Z tum computers. To achieve that, {feλ } need to be de-
k=a
termined first, and different measurement schemes have
been designed for that according to the specific group
Proof. Explicitly computing the terms in the sum yields
G used. Such a protocol is known as robust shadow
estimation [29].
⃗ j−1 .
(Xi Xj − Yi Yj − ı(Xi Yj + Yi Xj )) Z (D38)
i+1
Using the decomposition X = (|0⟩ ⟨1| + |1⟩ ⟨0|) and Y = 2. Noisy Matchgate channel
(−ı |0⟩ ⟨1| + ı |1⟩ ⟨0|) yields
In this section we provide derivations for the recently
2 ⃗ j−1 introduced robust Matchgate shadows approach [31, 32].
2 |1⟩ ⟨0|i ⊗ |1⟩ ⟨0|j ⊗ Z i+1 (D39)
Our derivations closely follow previous work on Match-
gate randomized benchmarking [44], highlighting the
as required. close connection between these two topics.
The robust shadow protocol assumes the quantum
noise satisfies the GTM assumption, in which case the
measurement channel is only modified in the eigenval-
Appendix E: Robust Matchgate shadows ues associated with each projector. The idea is therefore
to determine those noisy coefficients feλ and apply the
In this section, we formulate the robust Matchgate noisy inverse channel M f−1 when computing the expec-
shadow protocol, providing an alternative derivation tation values. For the noisy case, we can generalize the
from those in Refs. [31, 32]. derivation for eigenvalues in the noiseless case into
21
−1
Tr [MZ ΛΠ2l ] 2n X X
fe2l = = ⟨⟨b|Λ|γS ⟩⟩⟨⟨γS |b⟩⟩
Tr [Π2l ] 2l n
S∈ [2n] b∈{0,1}
2l
−1 2
(−i)|T |
2n X X P
= √ (−1) j∈T bj ⟨⟨γpairs(T ) |Λ|γpairs(T ) ⟩⟩ (E2)
2l 2n
b∈{0,1}n T ∈ [n]
l
−1
2n X
= ⟨⟨γpairs(T ) |Λ|γpairs(T ) ⟩⟩.
2l
T ∈ [n]
l
To get fe2l , we need to determine ⟨⟨γpairs(T ) |Λ|γpairs(T ) ⟩⟩, nally, all qubits are measured in the Z-basis. By averag-
which corresponds to the diagonal elements of the Liou- ing over many random sequences, we obtain an estimate
ville representation of noise channel Λ in the |γS ⟩⟩ basis. ĉ2l (m) of the weighted average
Interestingly, a recent work on randomized bench-
marking [44] proposed an efficient way to character- 1 X X
c2l (m) = α2l (b, Q)p(b|Q, m),
ize the noise in the quantum hardware by defining and |B(2n)|m
{Q}∈B(2n) b∈{0,1}n
computing a set of Majorana fidelities λk using ran-
(E3)
dom Matchgate circuits. Here we reformulate some of
where Q = Qm . . . Q1 and p(b|Q, m) represents the prob-
their derivations using our established conventions and
ability of measurement outcome |b⟩. A correlation func-
get more insights into the interpretation and connection
tion α2l has also been defined as
between λk and fe2l .
The protocol consists of multiple rounds with vary- h
†
i
ing parameters k ∈ [2n] and circuit sequence lengths α2l (b, Q) = tr |b⟩⟨b|Π2l UQ ρ0 UQ . (E4)
m. Due to our focus on fermionic quantum simulations,
we’ll only discuss the case when k is even. Each round We estimate p(b|Q, m) from the measurement outcomes
starts with the preparation of the all-zero state |0⟩. This received from the quantum computer, whereas we com-
is followed by m Matchgate unitaries UQ1 , . . . , UQm , pute α2l (b, Q) analytically. Assuming the state prepa-
chosen uniformly and independently at random. Fi- ration and measurement (SPAM) are perfect, we have
1 X X
c2l (m) = α2l (b, Qm . . . Q1 )⟨⟨b|ΛUQm . . . ΛUQ1 |0⟩⟩
|B(2n)|m
{Q}∈B(2n) b∈{0,1}n
1 X X
⊗2 ⊗2 ⊗2
= ⟨⟨b ⊗ b|(Π2l ⊗ Λ)UQ (I ⊗ Λ)UQ . . . (I ⊗ Λ)UQ |0 ⊗ 0⟩⟩
|B(2n)|m m m−1 1
{Q}∈B(2n) b∈{0,1}n
" 2n
#m−1 2n (E5)
(2) (2) (2) (2)
X X X
⊗2
= ⟨⟨b |(Π2l ⊗ Λ) |Υk′ ⟩⟩⟨⟨Υk′ |(I ⊗ Λ) |Υk′ ⟩⟩⟨⟨Υk′ |0⊗2 ⟩⟩
b∈{0,1}n k′ =0 k′ =0
h im−1
(2) (2) (2) (2)
X
= ⟨⟨b⊗2 |(I ⊗ Λ)|Υk ⟩⟩ ⟨⟨Υk |(I ⊗ Λ)|Υk ⟩⟩ ⟨⟨Υk |0⊗2 ⟩⟩.
b∈{0,1}n
(2) (2)
Ref. [44] defined the term ⟨⟨Υk |(I ⊗ Λ)|Υk ⟩⟩ as the m = 1 and have
Majorana fidelities λ2l . We also note that the first term X (2) (2)
in the last line is exactly what we’ve derived in Eq. E2 c2l (1) = ⟨⟨b⊗2 |(I ⊗ Λ)|Υk ⟩⟩⟨⟨Υk |0⊗2 ⟩⟩
if we assume SPAM error-free. Therefore, we choose b∈{0,1}n
−1
2n X X 2
= ⟨⟨b|γS ⟩⟩⟨⟨b|Λ|γS ⟩⟩⟨⟨γS ′ |0⟩⟩
2l
b∈{0,1}n S,S ′ ∈ [2n]
2l
−1 X
1 2n n
= n ⟨⟨γpairs(T ) |Λ|γpairs(T ) ⟩⟩
2 2l l
T ∈ [n]
l
1 n e
= n f2l .
2 l
22
Table II: IBM Q Hanoi hardware information, with data taken from IBM Quantum Platform.
101
10 1
101
10 1
Mean Absolute Error (MAE)
101
10 1
101
10 1
101
10 1
101
10 1
102 103 104 102 103 104 102 103 104 102 103 104 102 103 104 102 103 104
Exp. RShadow (SP) Noiseless
# shadow circuits
Figure 9: The mean absolute error (MAE) of 36 overlap ratios randomly selected from the tested 120 overlap
ratios’ pool. For each overlap ratio, noisy results from IBM Hanoi, w/wo the robust shadow correction (orange
circle/green triangle) are all in good agreement with the noiseless reference value (blue cross). This verifies the
noise resilience discussed in the main text.
The NV center in diamond is simulated using 4 qubits Since the ground state of NV center is an equal super-
on the IonQ Aria trapped ion quantum computer. position of two Slater determinants [51], we can gener-
25
Table III: Data of the noisy eigenvalues of the 4-qubit with all-to-all qubit connectivity. The average readout
Matchgate channel, with experiments performed on fidelity is 99.55%. Detailed information about the de-
the IBM Q Hanoi device on Oct 1st, 2nd and 14th. vice can be found on the IonQ website.
These noisy eigenvalues are used for the robust shadow
protocol considering the state preparation error (SP).
Specifically, shadows of hydrogen bond distance 0.75 Å
are collected on Oct 14th; shadows for 2.25 Å are
obtained on Oct 1st while the shadows for the rest
three are obtained on Oct 2nd. 3. Noisy simulations of the water molecule
|0⟩ H
a. Quantum trial state
fSWAP(θ)
|0⟩
ÛQ
Since the water molecule has a perfect pairing of elec-
|0⟩
trons in the ground state, we consider a parameterized
|0⟩ quantum circuit employing a 4-qubit double-excitation
gate UDE (θ):
Figure 10: Quantum circuit for the performing
Matchgate shadows of NV center. The first Hadamard
θ
θ
and CNOT gate generates an equal superposition of UDE (θ)|1100⟩ = cos |1100⟩ − sin |0011⟩.
2 2
|0⟩ and |ΨI ⟩. They are followed by the effective
(H1)
UCCSD ansatz, and the Matchgate circuit UQ for
This gate is implemented between qubit indexed
shadow tomography.
[0, 1, 4, 5], [2, 3, 6, 7], [1, 2, 7, 4], [0, 3, 6, 5]. These in-
dices are chosen because they construct those Slater de-
terminants with the largest coefficients from a CASCI
calculation. This ansatz is then fed to a (noiseless) VQE
b. Hardware information calculation to produce a wavefunction with an energy
error of approximately 10 mHa at the equilibrium ge-
The IonQ Aria device has an average of 99.97% single- ometry. This state is then used as the trial state in the
qubit gate fidelity and 98.6% two-qubit gate fidelity, subsequent (noisy) QC-AFQMC simulations.
[1] W. J. Huggins, B. A. O’Gorman, N. C. Rubin, D. R. [3] X. Lin, X. Li, and X. Lin, A review on applications of
Reichman, R. Babbush, and J. Lee, Unbiasing fermionic computational methods in drug screening and design,
quantum monte carlo with a quantum computer, Nature Molecules 25, 1375 (2020).
603, 416 (2022). [4] N. P. King, W. Sheffler, M. R. Sawaya, B. S. Vollmar,
[2] K. Wan, W. J. Huggins, J. Lee, and R. Babbush, Match- J. P. Sumida, I. André, T. Gonen, T. O. Yeates, and
gate shadows for fermionic quantum simulation, Com- D. Baker, Computational design of self-assembling pro-
munications in Mathematical Physics , 1 (2023). tein nanomaterials with atomic level accuracy, Science
336, 1171 (2012).
26
[5] S. Hammes-Schiffer and G. Galli, Integration of theory [22] A. Montanaro and S. Stanisic, Accelerating varia-
and experiment in the modelling of heterogeneous elec- tional quantum monte carlo using the variational
trocatalysis, Nature Energy 6, 700 (2021). quantum eigensolver, arXiv preprint arXiv:2307.07719
[6] W. Kohn, A. D. Becke, and R. G. Parr, Density func- 10.48550/arXiv.2307.07719 (2023).
tional theory of electronic structure, Journal of Physical [23] G. Mazzola, Quantum computing for chemistry and
Chemistry 100, 12974 (1996). physics applications from a monte carlo perspective,
[7] T. Helgaker, P. Jorgensen, and J. Olsen, Molecular The Journal of Chemical Physics 160 (2024).
electronic-structure theory (John Wiley & Sons, 2013). [24] S. Kanno, H. Nakamura, T. Kobayashi, S. Gocho,
[8] W. Foulkes, L. Mitas, R. Needs, and G. Rajagopal, M. Hatanaka, N. Yamamoto, and Q. Gao, Quan-
Quantum monte carlo simulations of solids, Reviews of tum computing quantum monte carlo with hybrid ten-
Modern Physics 73, 33 (2001). sor network toward electronic structure calculations of
[9] B. M. Austin, D. Y. Zubarev, and W. A. Lester Jr, large-scale molecular and solid systems, arXiv preprint
Quantum monte carlo and related approaches, Chemical arXiv:2303.18095 10.48550/arXiv.2303.18095 (2023).
Reviews 112, 263 (2012). [25] M. Amsler, P. Deglmann, M. Degroote, M. P. Kaicher,
[10] M. Troyer and U.-J. Wiese, Computational complex- M. Kiser, M. Kühn, C. Kumar, A. Maier, G. Sam-
ity and fundamental limitations to fermionic quantum sonidze, A. Schroeder, et al., Classical and quan-
monte carlo simulations, Physical Review Letters 94, tum trial wave functions in auxiliary-field quantum
170201 (2005). monte carlo applied to oxygen allotropes and a cubr2
[11] A. Aspuru-Guzik, A. D. Dutoi, P. J. Love, and M. Head- model system, The Journal of Chemical Physics 159,
Gordon, Simulated quantum computation of molecular 10.1063/5.0146934 (2023).
energies, Science 309, 1704 (2005). [26] M. Kiser, A. Schroeder, G.-L. R. Anselmetti, C. Ku-
[12] A. Peruzzo, J. McClean, P. Shadbolt, M.-H. Yung, mar, N. Moll, M. Streif, and D. Vodola, Classical and
X.-Q. Zhou, P. J. Love, A. Aspuru-Guzik, and J. L. quantum cost of measurement strategies for quantum-
O’brien, A variational eigenvalue solver on a photonic enhanced auxiliary field quantum monte carlo, New
quantum processor, Nature Communications 5, 4213 Journal of Physics 26, 033022 (2024).
(2014). [27] A. Zhao, N. C. Rubin, and A. Miyake, Fermionic par-
[13] M. Motta, C. Sun, A. T. Tan, M. J. O’Rourke, E. Ye, tial tomography via classical shadows, Physical Review
A. J. Minnich, F. G. Brandao, and G. K.-L. Chan, De- Letters 127, 110504 (2021).
termining eigenstates and thermal states on a quantum [28] M. Scheurer, G.-L. R. Anselmetti, O. Oumarou,
computer using quantum imaginary time evolution, Na- C. Gogolin, and N. C. Rubin, Tailored and externally
ture Physics 16, 205 (2020). corrected coupled cluster with quantum inputs, arXiv
[14] J. P. Misiewicz and F. A. Evangelista, Implemen- preprint arXiv:2312.08110 10.48550/arXiv.2312.08110
tation of the projective quantum eigensolver on a (2023).
quantum computer, arXiv preprint arXiv:2310.04520 [29] S. Chen, W. Yu, P. Zeng, and S. T. Flammia, Robust
10.48550/arXiv.2310.04520 (2023). shadow estimation, PRX Quantum 2, 030348 (2021).
[15] U. Baek, D. Hait, J. Shee, O. Leimkuhler, W. J. Hug- [30] D. E. Koh and S. Grewal, Classical shadows with noise,
gins, T. F. Stetina, M. Head-Gordon, and K. B. Wha- Quantum 6, 776 (2022).
ley, Say no to optimization: A nonorthogonal quantum [31] A. Zhao and A. Miyake, Group-theoretic error mitiga-
eigensolver, PRX Quantum 4, 030307 (2023). tion enabled by classical shadows and symmetries, arXiv
[16] S. E. Smart and D. A. Mazziotti, Accelerated conver- preprint arXiv:2310.03071 10.48550/arXiv.2310.03071
gence of contracted quantum eigensolvers through a (2023).
quasi-second-order, locally parameterized optimization, [32] B. Wu and D. E. Koh, Error-mitigated fermionic clas-
Journal of Chemical Theory and Computation 18, 5286 sical shadows on noisy quantum devices, npj Quantum
(2022). Information 10, 39 (2024).
[17] H. R. Grimsley, G. S. Barron, E. Barnes, S. E. [33] M. Motta and S. Zhang, Ab initio computations of
Economou, and N. J. Mayhall, Adaptive, problem- molecular systems by the auxiliary-field quantum monte
tailored variational quantum eigensolver mitigates carlo method, Wiley Interdisciplinary Reviews: Compu-
rough parameter landscapes and barren plateaus, npj tational Molecular Science 8, e1364 (2018).
Quantum Information 9, 19 (2023). [34] S. Zhang, 15 auxiliary-field quantum monte carlo for
[18] S. Wang, E. Fontana, M. Cerezo, K. Sharma, A. Sone, correlated electron systems, Emergent Phenomena in
L. Cincio, and P. J. Coles, Noise-induced barren Correlated Matter (2013).
plateaus in variational quantum algorithms, Nature [35] A. Mahajan and S. Sharma, Taming the sign problem
Communications 12, 6961 (2021). in auxiliary-field quantum monte carlo using accurate
[19] H.-Y. Huang, R. Kueng, and J. Preskill, Predicting wave functions, Journal of Chemical Theory and Com-
many properties of a quantum system from very few putation 17, 4786 (2021).
measurements, Nature Physics 16, 1050 (2020). [36] J. R. McClean, J. Romero, R. Babbush, and A. Aspuru-
[20] X. Xu and Y. Li, Quantum-assisted monte carlo algo- Guzik, The theory of variational hybrid quantum-
rithms for fermions, Quantum 7, 1072 (2023). classical algorithms, New Journal of Physics 18, 023023
[21] Y. Zhang, Y. Huang, J. Sun, D. Lv, and X. Yuan, Quan- (2016).
tum computing quantum monte carlo, arXiv preprint [37] A. K. Ekert, C. M. Alves, D. K. Oi, M. Horodecki,
arXiv:2206.10431 10.48550/arXiv.2206.10431 (2022). P. Horodecki, and L. C. Kwek, Direct estimations of lin-
ear and nonlinear functionals of a quantum state, Phys-
27
ical Review Letters 88, 217901 (2002). [55] H. Q. Pham, R. Ouyang, and D. Lv, Scalable
[38] J. Helsen and M. Walter, Thrifty shadow estimation: quantum monte carlo with direct-product trial
Reusing quantum circuits and bounding tails, Physical wave functions, arXiv preprint arXiv:2306.15186
Review Letters 131, 240602 (2023). 10.48550/arXiv.2306.15186 (2023).
[39] S. Aaronson and D. Gottesman, Improved simulation of [56] Y. Chen, L. Zhang, W. E, and R. Car, Hybrid aux-
stabilizer circuits, Physical Review A 70, 052328 (2004). iliary field quantum monte carlo for molecular sys-
[40] E. Tang, A quantum-inspired classical algorithm for rec- tems, Journal of Chemical Theory and Computation
ommendation systems, in Proceedings of the 51st an- 10.1021/acs.jctc.3c00038 (2023).
nual ACM SIGACT symposium on theory of computing [57] J. L. Weber, H. Vuong, R. A. Friesner, and D. R.
(2019) pp. 217–228. Reichman, The design of new practical constraints in
[41] G. H. Low, Classical shadows of fermions with parti- auxiliary-field quantum monte carlo, arXiv preprint
cle number symmetry, arXiv preprint arXiv:2208.08964 arXiv:2306.09207 10.48550/arXiv.2306.09207 (2023).
10.48550/arXiv.2208.08964 (2022). [58] J. Shee, J. L. Weber, D. R. Reichman, R. A. Fries-
[42] B. O’Gorman, Fermionic tomography and learning, ner, and S. Zhang, On the potentially transformative
arXiv preprint arXiv:2207.14787 (2022). role of auxiliary-field quantum monte carlo in quantum
[43] E. Wigner and P. Jordan, Über das paulische äquivalen- chemistry: A highly accurate method for transition met-
zverbot, Z. Phys 47, 46 (1928). als and beyond, The Journal of Chemical Physics 158,
[44] J. Helsen, S. Nezami, M. Reagor, and M. Walter, Match- 10.1063/5.0134009 (2023).
gate benchmarking: Scalable benchmarking of a con- [59] A. Mahajan and S. Sharma, Efficient local energy eval-
tinuous family of many-qubit gates, Quantum 6, 657 uation for multi-slater wave functions in orbital space
(2022). quantum monte carlo, The Journal of Chemical Physics
[45] T. E. O’Brien, S. Polla, N. C. Rubin, W. J. Huggins, 153, https://doi.org/10.1063/5.0025055 (2020).
S. McArdle, S. Boixo, J. R. McClean, and R. Babbush, [60] B. Posey, C. Gropp, B. Wilson, B. McGeachie, S. Padhi,
Error mitigation via verified phase estimation, PRX A. Herzog, and A. Apon, Addressing the challenges of
Quantum 2, 020317 (2021). executing a massive computational cluster in the cloud,
[46] Y. Zhou and Q. Liu, Performance analysis of multi-shot in 2018 18th IEEE/ACM International Symposium on
shadow estimation, Quantum 7, 1044 (2023). Cluster, Cloud and Grid Computing (CCGRID) (2018)
[47] S. Hsieh, C. Zu, T. Mittiga, T. Smart, F. Machado, pp. 253–262.
B. Kobrin, T. Höhn, N. Rui, et al., Imaging stress and [61] J. F. Gonthier, M. D. Radin, C. Buda, E. J. Doskocil,
magnetism at high pressures using a nanoscale quantum C. M. Abuan, and J. Romero, Measurements as a road-
sensor, Science 366, 1349 (2019). block to near-term practical quantum advantage in
[48] P. Bhattacharyya, W. Chen, X. Huang, S. Chatterjee, chemistry: Resource analysis, Phys. Rev. Res. 4, 033154
B. Huang, B. Kobrin, Y. Lyu, T. Smart, M. Block, (2022).
E. Wang, et al., Imaging the meissner effect in hy- [62] S. McArdle, S. Endo, A. Aspuru-Guzik, S. C. Benjamin,
dride superconductors using quantum sensors, Nature and X. Yuan, Quantum computational chemistry, Re-
, 1 (2024). views of Modern Physics 92, 015003 (2020).
[49] N. Sheng, C. Vorwerk, M. Govoni, and G. Galli, Green’s [63] J. Goings, L. Zhao, J. Jakowski, T. Morris, and
function formulation of quantum defect embedding the- R. Pooser, Molecular symmetry in vqe: A dual approach
ory, Journal of Chemical Theory and Computation 18, for trapped-ion simulations of benzene, in 2023 IEEE
3512 (2022). International Conference on Quantum Computing and
[50] B. Huang, N. Sheng, M. Govoni, and G. Galli, Quantum Engineering (QCE), Vol. 2 (IEEE, 2023) pp. 76–82.
simulations of fermionic hamiltonians with efficient en- [64] R. J. Anderson, T. Shiozaki, and G. H. Booth, Effi-
coding and ansatz schemes, Journal of Chemical Theory cient and stochastic multireference perturbation theory
and Computation 19, 1487 (2023). for large active spaces within a full configuration inter-
[51] B. Huang, M. Govoni, and G. Galli, Simulating the elec- action quantum monte carlo framework, The Journal of
tronic structure of spin defects on quantum computers, Chemical Physics 152, 10.1063/1.5140086 (2020).
PRX Quantum 3, 010339 (2022). [65] M. Reiher, N. Wiebe, K. M. Svore, D. Wecker, and
[52] I. G. Ryabinkin, T.-C. Yen, S. N. Genin, and A. F. Iz- M. Troyer, Elucidating reaction mechanisms on quan-
maylov, Qubit coupled cluster method: a systematic ap- tum computers, Proceedings of the National Academy
proach to quantum chemistry on a quantum computer, of Sciences 114, 7555 (2017).
Journal of Chemical Theory and Computation 14, 6317 [66] G. Mazzola and G. Carleo, Exponential challenges
(2018). in unbiasing quantum monte carlo algorithms with
[53] C. Hempel, C. Maier, J. Romero, J. McClean, T. Monz, quantum computers, arXiv preprint arXiv:2205.09203
H. Shen, P. Jurcevic, B. P. Lanyon, P. Love, R. Bab- 10.48550/arXiv.2205.09203 (2022).
bush, et al., Quantum chemistry calculations on a [67] J. Hubbard, Calculation of partition functions, Physical
trapped-ion quantum simulator, Physical Review X 8, Review Letters 3, 77 (1959).
031022 (2018). [68] R. Stratonovich, On a method of calculating quantum
[54] S. Lee, J. Lee, H. Zhai, Y. Tong, A. M. Dalzell, A. Ku- distribution functions, in Soviet Physics Doklady, Vol. 2
mar, P. Helms, J. Gray, Z.-H. Cui, W. Liu, et al., Eval- (1957) p. 416.
uating the evidence for exponential quantum advantage [69] H. F. Trotter, On the product of semi-groups of opera-
in ground-state quantum chemistry, Nature Communi- tors, Proceedings of the American Mathematical Society
cations 14, 1952 (2023). 10, 545 (1959).
28
[70] M. Suzuki, Relationship between d-dimensional quan- scalable quantum computing on a noisy superconduct-
tal spin systems and (d+1)-dimensional ising systems: ing quantum processor, Phys. Rev. X 11, 041039 (2021).
Equivalence, critical exponents and systematic approx- [75] A. Zhao, Fermionic classical shad-
imants of the partition function and spin correlations, ows, https://github.com/zhao-andrew/
Progress of theoretical physics 56, 1454 (1976). symmetry-adjusted-classical-shadows (2023).
[71] D. J. Thouless, Stability conditions and nuclear rota- [76] Q. Sun and G. K.-L. Chan, Quantum embedding theo-
tions in the hartree-fock theory, Nuclear Physics 21, ries, Accounts of Chemical Research 49, 2705 (2016).
225 (1960). [77] H. Ma, N. Sheng, M. Govoni, and G. Galli, Quantum
[72] D. Thouless, Vibrational states of nuclei in the random embedding theory for strongly correlated states in ma-
phase approximation, Nuclear Physics 22, 78 (1961). terials, Journal of Chemical Theory and Computation
[73] H. Zhu, Multiqubit clifford groups are unitary 3-designs, 17, 2116 (2021).
Physical Review A 96, 062336 (2017). [78] V. Bergholm, J. Izaac, M. Schuld, C. Gogolin,
[74] A. Hashim, R. K. Naik, A. Morvan, J.-L. Ville, S. Ahmed, V. Ajith, M. S. Alam, G. Alonso-Linaje,
B. Mitchell, J. M. Kreikebaum, M. Davis, E. Smith, B. AkashNarayanan, A. Asadi, et al., Pennylane: Auto-
C. Iancu, K. P. O’Brien, I. Hincks, J. J. Wallman, matic differentiation of hybrid quantum-classical com-
J. Emerson, and I. Siddiqi, Randomized compiling for putations, arXiv preprint arXiv:1811.04968 (2018).