Generating approximate ground states of strongly correlated quantum many-body systems through quantum imaginary time evolution

Michael Kaicher; Florian Dommert; Christopher Wever; Maximilian Amsler; Michael Kühn; QUTAC Material Science Working Group

doi:10.1088/2399-6528/ade75c

1. Introduction

Finding the ground state of a strongly correlated quantum many body system is one of the most fundamental problems in physics and related fields, such as quantum chemistry. Many approaches exist to tackle the ground state problem for strongly correlated quantum many-body systems on classical computers. However, in many instances of strong correlation, the computational cost becomes too demanding. This can be traced back to the fact that state-of-the art classical methods for tackling quantum chemistry or ground states of lattice systems such as coupled cluster [1] or Density Matrix Renormalization Group (DMRG) theory [2, 3] display a computational cost that scales as a large polynomial of the system size or require a bond dimension that grows exponentially with system size. A natural question to ask is whether a quantum computer could solve for the ground state of a quantum many-body problem Hamiltonian in less computation time and/or higher accuracy. Unfortunately, finding the ground state of a quantum many-body Hamiltonian is considered to be hard even for an envisioned error-free quantum computer. More precisely, the problem has been shown to lie in the complexity class QMA-complete, which means that every algorithm that aims to solve the problem must be either heuristic in nature—and therefore not possess a performance guarantee—, or its cost must scale exponentially in at least one parameter that describes the problem [4]. However, most quantum algorithms designed to probe properties of the ground state only require access to a state ∣Ψ〉 that approximates the ground state ∣Ψ_gs〉 within some error 1 − ∣〈Ψ∣Ψ_gs〉∣ < ε, where 0 < ε ≪ 1 [5]. In literature, the approximate state ∣Ψ〉, which can be generated by a quantum computer, is usually called initial state or trial state. To avoid ambiguity we will use ∣Ψ_init〉 to denote an approximation to the ground state which is obtained from a classical algorithm (for instance, the Hartree–Fock ground state). We will consider two algorithms aimed at iteratively improving the overlap of a given initial state on a quantum computer. The first algorithm is a trotterized version of Imaginary Time Evolution (ITE), while the second algorithm was introduced in [6] and is called Quantum Imaginary Time Evolution (QITE). QITE is designed to approximate the non-unitary operators of trotterized ITE by unitary operators. Both, ITE and QITE require an initial state with a non-zero support of the ground state in order to be able to approximate the latter.

The computational cost of the currently most advanced quantum algorithms to probe ground state properties depend inversely on powers of the ground state overlap of the initial state being used [7]. While this overlap falls off exponentially with increasing system size [8], computations of real systems are carried out on finite system sizes, where sufficiently large overlaps might still be reached for states that can be implemented efficiently on a quantum computer [9]. However, for some systems that are e.g. of interest to the chemical industry, it is still an open question whether a sufficiently large overlap can be reached with existing methods [10, 11].

In this work, we put a lot of emphasis on providing a classical initial state ∣Ψ_init〉 (i.e. a solution from a classical algorithm that can be translated into a wave function) for the QITE and ITE simulations. We show how to obtain an initial state ∣Ψ_init〉 from the family of Fermionic Gaussian States (FGS) [12–14], which can be efficiently generated on a quantum computer [15–17] and efficiently computed on a classical computer for lattice Hamiltonians in arbitrary dimensions. This is achieved by performing an ITE of the respective covariance matrix [18] which fully describes a FGS. Since this classical algorithm can be applied to any spin and fermionic Hamiltonian, this part of our work may be of independent interest to the reader.

In order to be able to study systems where classical methods become unreliable, too expensive and slow, or inaccurate, much effort has been put in finding superior algorithms that can be executed on quantum computers. Various approaches for approximating the ground state have been suggested, such as quantum adiabatic state preparation [19], variational algorithms [20], or algorithms that generate the ground state by a projector method on early fault tolerant quantum computers [21], to name but a few. While the former two approaches are heuristic, the latter is a deterministic approach, provided access to a good initial state. Due to the hardness of the ground state problem, all quantum approaches have their drawbacks. For instance, adiabatic state preparation is heuristic in nature since it requires a protected energy gap, which is not guaranteed for arbitrary initial Hamiltonians and adiabatic paths, and can in some instances display a dependence between the ground state of the initial and final Hamiltonian [22]. On the other hand, the energy landscape of the Variational Quantum Eigensolver (VQE) [23] can be plagued by extremely small gradients for sufficiently deep circuits, a phenomenon known as the appearance of barren plateaus [24]. While fault tolerant algorithms such as [21] are not suffering from the above mentioned problems, they have an intrinsic dependence on the energy gap of the system Hamiltonian and on the overlap γ = ∣〈Ψ∣Ψ_gs〉∣ of a provided initial state ∣Ψ〉. The larger this overlap is, the faster the quantum computation will be at generating an approximate ground state of a given desired accuracy, or extract properties from it [7, 25–27].

In this work, we numerically investigate the capability of QITE to replicate the trotterized ITE algorithm. To this end, we apply QITE and ITE to initial states for various lattice and molecular systems and compare the resulting evolved states. We call this procedure algorithmic benchmark.

We add to the body of QITE literature by formulating a fermionic version of QITE and study if it could serve as a way of generating improved initial states ∣Ψ〉 when applied to a fermionic lattice model, as well as the Active Space (AS) of molecular electronic structure Hamiltonians.

Our focus on obtaining proper initial states for the system Hamiltonians we study helps to be able get a more objective algorithmic performance evaluation of (Q)ITE. More precisely, the choice of a poor initial state will generally lead to a large improvement in the quality of the produced state, but oftentimes the majority of this improvement can be traced back to a non-optimized initial state choice. Therefore, we initialize our systems in an optimized fermionic Gaussian state (which, among others, includes the Hartree–Fock state), that treats the interactions of the system in a mean-field approximation and use this state as a benchmark to see how much QITE can improve over a mean-field solution.

The paper is structured as follows. In section 2 we cover the main aspects of the ITE and QITE algorithms. section 3 introduces the details of the studied lattice and quantum chemistry systems. section 4 describes how we obtain the initial states for the various systems, which is followed by the discussion of our numerical results in section 5, and a summary and outlook in section 6.

2. Methods

This section introduces the QITE algorithm. After establishing the required structure of the Hamiltonian in section 2.1, we discuss the ITE algorithm in section 2.2, followed by a detailed introduction to the QITE algorithm in section 2.3. The section concludes with an introduction of the mutual information in section 2.4, which is used within a subroutine of QITE applied to molecular electronic structure systems.

2.1. Hamiltonian terms

In this work, we consider Hamiltonians which can describe either interacting spins or fermions and can be written as

$\begin{eqnarray}&&\hat{H}=\displaystyle \sum _{j=1}^{\tilde{m}}\hat{h}[j]+\alpha {\bf{1}},\end{eqnarray} \tag{ 1 }$

where 1 denotes the identity operator, $\hat{h}[j]$ is a Hermitian operator which acts non-trivially (i.e. not as the identity) on at most k ≤ L spins or fermionic sites, and $\alpha \in {\mathbb{R}}$ is a constant, while L describes the total number of spins or fermionic sites/orbitals. Examples of physical systems that are modelled by the Hamiltonian in equation (1) include fermionic and spin lattice systems describing e.g. condensed matter systems, as well as molecular electronic structure Hamiltonians. We note, that in principle one can map a fermionic system to a spin system (and vice versa) using various mappings [28, 29], but only a selected few mappings lead to a locality in the Hamiltonian terms $\hat{h}[j]$ that does not scale with the system size [30]. In this work, we assume that equation (1) describes the Hamiltonian that results after a potential fermion-to-qubit or qubit-to-fermion mapping was made, and therefore, for instance when studying a fermionic system, $\hat{h}[j]$ could equally well describe a qubit or a fermionic operator, depending whether a fermion-to-qubit mapping was applied or not. If $\hat{h}[j]$ describes a spin operator, we will apply the spin-QITE algorithm that we introduce in section 2.3.3, whereas if $\hat{h}[j]$ describes a fermionic operator, we will use the fermionic-QITE algorithm that is introduced in section 2.3.4.

2.2. Trotterized ITE

The solution of the imaginary time Schrödinger equation $\frac{d}{d\tau }| {\rm{\Psi }}(\tau )\rangle =-(\hat{H}-\langle {\rm{\Psi }}(\tau )| \hat{H}| {\rm{\Psi }}(\tau )\rangle )| {\rm{\Psi }}(\tau )\rangle$ with the initial condition ∣Ψ(0)〉 = ∣Ψ_init〉 for time-independent Hamiltonians is given by [31]

$\begin{eqnarray}&&| {\rm{\Psi }}(\tau )\rangle =\frac{{e}^{-\tau \hat{H}}| {{\rm{\Psi }}}_{\,\rm{init}\,}\rangle }{\sqrt{\langle {{\rm{\Psi }}}_{\,\rm{init}\,}| {e}^{-2\tau \hat{H}}| {{\rm{\Psi }}}_{\,\rm{init}\,}\rangle }}.\end{eqnarray} \tag{ 2 }$

Provided an initial state ∣Ψ_init〉 with overlap γ_init = ∣〈Ψ_init∣Ψ_gs〉∣ ≠ 0, ITE will lead to the ground state in the infinite time limit, $| {{\rm{\Psi }}}_{\,\rm{gs}\,}\rangle =\mathop{\mathrm{lim}}\limits_{\tau \to \infty }| {\rm{\Psi }}(\tau )\rangle$ . Since in practice an approximation to the ground state suffices, we only have to evolve up to a finite value

$\begin{eqnarray}&&{\tau }_{\eta }={ \mathcal O }\left(\frac{1}{{\rm{\Delta }}}\mathrm{log}\left(\frac{1}{{\gamma }_{\,\rm{init}\,}\eta }\right)\right)\end{eqnarray} \tag{ 3 }$

in order to assure that the overlap γ of the evolved state is bounded by $1-\gamma \lt \frac{{\eta }^{2}}{2}$ for a desired precision η [25]. Here, Δ ≤ ∣E₁ − E₀∣ is a lower bound on the energy gap of the system Hamiltonian, with E₀ (E₁) denoting the exact ground (first excited) state energy. From a physical point of view, ITE can be viewed as a cooling process where τ → ∞ corresponds to the zero temperature limit, where the system will be found in its ground state. Thus, τ⁻¹ can be interpreted as a temperature, and correspondingly ITE is a process which lowers the overall entropy of the system. In the context of quantum chemistry systems, it turns out that in practice one empirically finds that the properties of the ground state do not differ much from its low temperature (i.e. large, but finite τ) canonical density operator ${e}^{-\tau \hat{H}}$ and that using a large but finite τ which is independent of the system size L is oftentimes sufficient. This observation is consistent with the concept of locality in quantum chemistry Hamiltonians, and locality is also what allows us to get an accurate description of the ground state of the electronic structure Hamiltonian in the full basis by only solving the ground state problem within a small sub-region (the so-called active space) accurately [32].

We can divide the total ITE propagator into n-many small time steps Δτ = τ/n, such that the imaginary time propagator can be expressed as

$\begin{eqnarray}&&{e}^{-\tau \hat{H}}={\left({e}^{-{\rm{\Delta }}\tau \hat{H}}\right)}^{n}.\end{eqnarray} \tag{ 4 }$

We can split the ITE propagator ${e}^{-{\rm{\Delta }}\tau \hat{H}}$ of the Hamiltonian in equation (1) into a product of ITE propagators ${e}^{-\frac{{\rm{\Delta }}\tau }{2}\hat{h}[j]}$ of the individual Hermitian operators $\hat{h}[j]$ using a second-order Trotterization,

$\begin{eqnarray}\begin{array}{rcl}{e}^{-\tau \hat{H}} & \approx & {\left({e}^{-\frac{{\rm{\Delta }}\tau }{2}\hat{h}[1]}\cdots {e}^{-\frac{{\rm{\Delta }}\tau }{2}\hat{h}[\tilde{m}]}{e}^{-\frac{{\rm{\Delta }}\tau }{2}\hat{h}[\tilde{m}]}\cdots {e}^{-\frac{{\rm{\Delta }}\tau }{2}\hat{h}[1]}\right)}^{n}+{ \mathcal O }({\rm{\Delta }}{\tau }^{2})\\ & = & \displaystyle \prod _{l=1}^{mn}{e}^{-{\rm{\Delta }}\tau \hat{h}[f(l)]}+{ \mathcal O }({\rm{\Delta }}{\tau }^{2}),\end{array}\end{eqnarray} \tag{ 5 }$

where m denotes the number of Hamiltonian terms which arise from the Trotterization of the $\tilde{m}$ terms from the untrotterized Hamiltonian of equation (1). A second-order Trotterization gives $m=2\tilde{m}$ ⁶ [33]. Above, we have introduced an index function f(l) with l ∈ [mn] that ensures the correct prefactor for the step size, as well as the correct Hamiltonian term in equation (1) and it keeps track of the effect of the Trotterization of equation (5). For instance, in the second-order Trotterization of equation (5) we use, $\hat{h}[f(\tilde{m}+1)]=\frac{1}{2}\hat{h}[\tilde{m}]$ . From this point on, we will mostly use $\hat{h}[l]$ instead of $\hat{h}[f(l)]$ for readability.

We define the trotterized ITE as

$\begin{eqnarray}&&| {{\rm{\Psi }}}_{l}\rangle =\frac{{e}^{-{\rm{\Delta }}\tau \hat{h}[f(l)]}| {{\rm{\Psi }}}_{l-1}\rangle }{\sqrt{\langle {{\rm{\Psi }}}_{l-1}| {e}^{-2{\rm{\Delta }}\tau \hat{h}[f(l)]}| {{\rm{\Psi }}}_{l-1}\rangle }}.\end{eqnarray} \tag{ 6 }$

Since in general the individual terms $\hat{h}[j]$ do not commute with each other, the Trotterization of equation (5) introduces an error in the imaginary time propagator which in our case scales as ${ \mathcal O }({\rm{\Delta }}{\tau }^{2})$ . This leads to the trotterized ITE state in equation (6) for τ ≥ τ_η converging to a state whose energy expectation value will lie above the ground state energy,

$\begin{eqnarray}&&{E}_{\,\rm{ITE}\,}=\mathop{\mathrm{lim}}\limits_{l\to \infty }\langle {{\rm{\Psi }}}_{l}| \hat{H}| {{\rm{\Psi }}}_{l}\rangle \geqslant {E}_{0},\end{eqnarray} \tag{ 7 }$

where either smaller step sizes, or a larger Trotter order at fixed step size will lead to E_ITE getting closer to E₀.

2.3. The QITE algorithm

2.3.1. QITE workflow

We present an overview of the QITE workflow in figure 1. One first considers the formalized problem introduced in section 2.1, a Hamiltonian $\hat{H}$ that is composed of $\tilde{m}$ terms $\hat{h}\left[j\right]$ with $j\in [\tilde{m}]$ . The goal of QITE is to imitate the ITE, which requires the preparation of an initial state ∣Ψ_init〉 which is discussed in section 4. The term $\hat{h}\left[j\right]$ can either describe a linear combinations of Pauli strings (see section 2.3.3) or a linear combination of fermionic operators (see section 2.3.4). Depending on whether the underlying Hamiltonian describes a lattice system or a molecular electronic structure problem, we perform an operator expansion on a set D_j of spins, fermionic sites or orbitals. The choice of D_j is crucial for the success of QITE and is guided by the Manhattan distance when we deal with a lattice system (see Section IIC2 a), and by the mutual information between orbitals when we investigate molecular electronic structure systems (see section 2.4), respectively. Finally, section 2.3.2 gives a short introduction to the main idea of the QITE algorithm, with a more detailed discussion on how this is put to practice in spin and fermionic systems in section 2.3.3 and 2.3.4, respectively.

Figure 1. Refer to the following caption and surrounding text. — **Figure 1.** The scheme for the QITE workflow that is to be read from top to bottom shows the main elements of the QITE algorithm. The color-coding of the elements within the blue boxes highlight their entailment on the steps later on. The arrows in the second box indicate the kind of system that may arise from the composition of the Hamiltonian $\hat{H}$ . Combining these pieces of information guides through the series of choices to be made when implementing the QITE algorithm.
Download figure:
Standard image High-resolution image

Code details We implemented this workflow (see figure 1) using Python. Our calculations rely on state vector simulations of the QITE algorithm on classical hardware. For our initial state preparation, we calculated the Pfaffian of skew-symmetric matrices using the pfapack library[34, 35]. Molecular electronic structure Hamiltonians for different basis sets as well as the studied Fermi-Hubbard system and their corresponding approximate ground states were determined using PySCF 2.2.1. [36–38]. The OpenFermion 1.5.1 [39] library was used for generating the spin and fermionic operators, as well as to translate the obtained covariance matrix of a pure fermionic Gaussian state into a state vector in its Hilbert space representation through the relations we derive in Appendix A.4. The system of linear equations to derive the operator expansion has been solved by a Conjugate Gradient iteration of the SciPy 1.10.1. [40] library. Our calculations of the mutual information were supported by the TeNPy 0.10.0 [41] library, which was used to compute the required reduced density matrices of the ground state more efficiently, by using the Matrix Product State (MPS) representation of the latter for a large bond dimension.

2.3.2. QITE introduction

The main idea of the QITE algorithm is to approximate the trotterized imaginary time propagator ${e}^{-{\rm{\Delta }}\tau \hat{h}[f(l)]}$ by a unitary operator ${\hat{U}}_{l}$ at each iteration step, without the need of auxiliary qubits. More specifically, for a fixed discretization step size Δτ and step l ∈ [mn], the QITE algorithm tries to find the unitary ${\hat{U}}_{l}$ which approximates

$\begin{eqnarray}&&| {{\rm{\Psi }}}_{l+1}\rangle \equiv \frac{1}{\sqrt{{c}_{l}}}{e}^{-{\rm{\Delta }}\tau \hat{h}[f(l)]}| {{\rm{\Psi }}}_{l}\rangle \approx {\hat{U}}_{l}| {{\rm{\Psi }}}_{l}\rangle ,\end{eqnarray} \tag{ 8 }$

by minimizing the distance

$\begin{eqnarray}&&\mathop{\min }\limits_{{U}_{l}}\unicode{x02016}\frac{1}{\sqrt{{c}_{l}}}{e}^{-{\rm{\Delta }}\tau \hat{h}[f(l)]}| {{\rm{\Psi }}}_{l}\rangle -{\hat{U}}_{l}| {{\rm{\Psi }}}_{l}\rangle \unicode{x02016},\end{eqnarray} \tag{ 9 }$

where $\parallel | \varphi \rangle \parallel =\sqrt{\langle \varphi | \varphi \rangle }$ is the Euclidean norm of a vector ∣φ〉 and

$\begin{eqnarray}&&{c}_{l}=\left|\langle {{\rm{\Psi }}}_{l}| {e}^{-2{\rm{\Delta }}\tau \hat{h}[f(l)]}| {{\rm{\Psi }}}_{l}\rangle \right|\end{eqnarray} \tag{ 10 }$

is a normalization constant. The algorithm is initialized in the state ∣Ψ_l=0〉 = ∣Ψ_init〉 which must have non-zero support on the desired ground state, γ_init > 0. In the following, we will describe how the unitary U_l can be found at each iteration step l, following [6].

Any unitary operator can be written as

$\begin{eqnarray}&&{\hat{U}}_{l}={e}^{-i{\rm{\Delta }}\tau \hat{A}[l]},\end{eqnarray} \tag{ 11 }$

where $\hat{A}[l]$ is a Hermitian operator. Therefore, the task of finding ${\hat{U}}_{l}$ is equivalent to that of finding $\hat{A}[l]$ . We will assume sufficiently small time steps, such that ${\hat{U}}_{l}={\bf{1}}-i{\rm{\Delta }}\tau \hat{A}[l]+{ \mathcal O }({\rm{\Delta }}{\tau }^{2})$ . While the computational cost of finding a general unitary operator acting on L qubits or fermionic modes will scale as ${ \mathcal O }(\exp (L))$ , the central finding of [6] is that for lattice systems with a finite correlation length ${ \mathcal C }$ , the cost of finding the unitary ${\hat{U}}_{l}$ as well as approximating this unitary in terms of elementary quantum gates scales only as ${ \mathcal O }(\exp ({ \mathcal C }))$ . Importantly, for systems with a finite correlation length, while still exponential, this is a finite value which does not increase as one increases the size of the system.

a. Support and domain We define the supportS_j of an operator $\hat{h}[j]$ as the set of indices it acts on non-trivially (i.e. not as the identity), where $j\in [\tilde{m}]$ . For spin operators these indices correspond to the spin indices, while for fermionic systems they correspond to fermionic sites or orbitals. The domainD_j similarly defines a set of indices which includes the support S_j, but also includes other indices that are selected based on the specifics of the studied system. In order to describe which indices to include in the domain D_j, one introduces a parameter ν which sets the size of the set (for d-dimensional lattices, D_j ∝ ν^d, while for molecular systems ν will set the number of additional orbitals added to the support). Depending on the studied system, the indices will correspond to spin or fermionic sites, or orbitals.

In figure 2 we give examples of how we choose the support and domain when $\hat{h}[j]$ describes a spin- or fermionic lattice Hamiltonian, or a molecular electronic structure Hamiltonian. Figure 2(a) describes a term $\hat{h}[j]$ which is part of a 18 spins (i.e. 18 qubits) Hamiltonian whose underlying geometry describes a hexagonal spin lattice. For simplicity, we here assume that there are no periodic boundary conditions, contrary to the systems studied later on. We further assume that the interaction strength decreases with increasing distance between two sites, which justifies using the Manhattan distance as a means of picking the domain. In the provided example, the support S_j = {8, 14} (highlighted in yellow) is of size ∣S_j∣ = 2 and the domain is set by the Manhattan distance ν = 1 (ν = 2). In this case, the domain D_j is given by all yellow and purple (and pink) colours, leading to a domain ∣D_j∣ = 7 (∣D_j∣ = 13). Figure 2(b) describes an example where the term $\hat{h}[j]$ is part of a Hamiltonian describing a fermionic lattice of L = 8 sites with short-range interactions, each site containing two possible spin configurations α or β, thus a total of 16 fermionic modes or qubits. Here, the support of $\hat{h}[j]$ contains sites S_j = {3, 4}, which corresponds to 4 fermionic modes or qubits. For the fermionic lattice, ν describes the number of additional sites added, where each site contains two fermionic modes. The domain for ν = 1 contains three sites, but it is not clear whether to pick site 2 or site 5. In this case, we pick one of the two randomly, thus a potential outcome could be D_j = {3, 4, 5}. For ν = 2 (ν = 4), the resulting domain would then include all yellow and purple (and pink) sites.

Figure 2. Refer to the following caption and surrounding text. — **Figure 2.** Examples for the support S_j—highlighted in yellow— and the domain D_j—highlighted in yellow and purple (and pink)—for various ν for a given Hamiltonian in equation (1). The Hamiltonian term $\hat{h}[j]$ describes a hexagonal spin lattice 2(a) (here for simplicity without periodic boundary conditions), a one-dimensional fermionic lattice 2(b), or a molecular electronic structure problem 2(c). The first two examples assume systems where the interaction of the sites decreases with increasing lattice distance. Here, ν corresponds to the Manhattan distance and the number of additional fermionic sites to be included, respectively. For fermionic lattices, which sites are to be included is also chosen by the Manhattan distance of the fermionic lattice in case of short range interactions. In the last example, ν sets the number of additional orbitals (i.e. additional to the support) included in the domain and which we choose based on the largest mutual information value I(p, q) (where large magnitudes are indicated by thicker connecting lines), where p ∈ S_j and q ∈ [L]\S_j, where for large systems I(p, q) needs to be provided by an approximate classical DMRG calculation of the ground state.
Download figure:
Standard image High-resolution image

In figure 2(c), $\hat{h}[j]$ is part of a molecular electronic structure Hamiltonian of L = 6 orbitals, i.e. 12 spin-orbitals or qubits. In this example, the support is given by S_j = {1, 5} and the domain is D_j = {1, 3, 5} (D_j = {0, 1, 3, 5}) for ν = 1 (ν = 2). Here, ν describes the number of additional orbitals besides those included in S_j and we choose the additional orbitals based on those which possess the largest mutual information I(p, q) (indicated by the thickness of the connecting lines), where p ∈ S_j and q ∈ [L]\S_j. Note, that the definition and particular form of the mutual information for the studied systems will be detailed in section 2.4 and Appendix G.

In general, the size of the support can vary and is bounded by the locality of the Hamiltonian. For systems with long-range (or also just beyond nearest-neighbor-) interactions, the size of the domain D_j of a given term $\hat{h}[j]$ will be larger than for systems restricted to only nearest-neighbor interactions. As we will see in the following, the computational cost of QITE will scale exponentially in the size of the domain D_j, and is the main limiting factor for the simulations we were able to run (and not the system size L).

b. Exact QITE It was shown in [6], that for d-dimensional lattice spin systems whose Hamiltonian is k-local, the support of the Hermitian operator $\hat{A}[l]$ in equation (11) can be chosen to be the domain D_l of the support S_l of the Hamiltonian term $\hat{h}[l]$ within a Manhattan distance ν for $l\in [\tilde{m}]$ . More specifically, it was shown that for spin systems with a finite correlation length ${ \mathcal C }$ , in order to approximate the ITE state within precision ε > 0, i.e.

$\begin{eqnarray}&&\unicode{x02016}\displaystyle \prod _{l=1}^{mn}\frac{1}{\sqrt{{c}_{l}}}{e}^{-{\rm{\Delta }}\tau \hat{h}[f(l)]}| {{\rm{\Psi }}}_{\,\rm{init}\,}\rangle -\displaystyle \prod _{l=1}^{mn}{\hat{U}}_{l}| {{\rm{\Psi }}}_{\,\rm{init}\,}\rangle \unicode{x02016}\,\leqslant \,\varepsilon ,\end{eqnarray} \tag{ 12 }$

one requires the Manhattan distance to be at least

$\begin{eqnarray}&&{\nu }_{{ \mathcal C }}=2{ \mathcal C }\mathrm{ln}(2\sqrt{2}nm/\varepsilon ).\end{eqnarray} \tag{ 13 }$

The proof of the above is based on Uhlmann’s theorem [42] and will not be subject of this work (a detailed proof is given in [6]). For a d-dimensional lattice spin system, the size of the domain of $\hat{h}[f(l)]$ scales as ∣D_f(l)∣ ≤ kν^d for every Hamiltonian term $\hat{h}[l]$ . Using equation (13), this means that for a finite correlation length ${ \mathcal C }$ , the number of indices in D(l) is independent of the system size L.

Since $\hat{A}[l]$ is unknown, one has to expand it in a basis of operators and determine its components. While in general such a basis would contain a number ${ \mathcal O }(\exp (L))$ of basis operators, a direct consequence of equation (13) is that the basis must only include roughly $\exp (k{\nu }^{d})$ basis operators, which is independent of the system size L. The running time T of the corresponding QITE algorithm for an imaginary time τ = nΔτ is then roughly given by

$\begin{eqnarray}&&T=mn{e}^{k{\nu }^{d}}.\end{eqnarray} \tag{ 14 }$

The above results provide the mathematical basis that legitimizes the Ansatz to approximate the non-unitary QITE by a sequence of unitary operators for lattice spin systems with a finite correlation length. It is then conjectured in [6] that a similar result holds for fermionic systems under similar conditions. However, many systems of interests display long-range interactions, diverging correlation lengths, or do not admit to be formulated as a simple lattice theory, such as molecular electronic structure Hamiltonians. Part of the objective of this work is to study how the QITE algorithm used as a heuristic Ansatz described in the following paragraph performs for such systems.

c. Main approximation to speed up ITE and QITE simulation In order to speed up the trotterized ITE, spin- and fermionic-QITE simulations, we are uniting all Hamiltonian terms $\hat{h}[l]$ which possess the same domain D_l for a given ν. However, if there are two domains that are subsets of each other, e.g. D_i = {1, 3} and D_j = {1, 2, 3}, we treat them separately. This reduces the number of Hamiltonian terms $\tilde{m}$ in equation (1) greatly, but also leads to terms $\hat{h}[l]$ which contain a linear combination of different spin and fermionic excitation operators that will be difficult to realize in practice.

d. Inexact QITE It is also possible to use the QITE algorithm as a heuristic algorithm to find an approximate ground state ∣Ψ〉 using a very small domain size, i.e. ν = 1, 2, … , which in many instances will be much smaller than the true correlation length ${ \mathcal C }$ of the studied system. In this case, QITE loses its rigorous performance guarantees from equations (12)–(13). This means that at some point in time, the unitary approximation ${\hat{U}}_{l}$ of the trotterized imaginary time propagator ${e}^{-{\rm{\Delta }}\tau \hat{h}[f(l)]}$ will start to deviate significantly and the resulting state ∣Ψ_l〉 will start to differ largely from its ITE counterpart. Nevertheless, it is still possible, that for small evolution times the evolved state will lead to an improvement over the classical input state ∣Ψ_init〉. We will examine this claim thoroughly in section 5.

2.3.3. Spin-QITE

If the Hermitian Hamiltonian term $\hat{h}[f(l)]$ describes a spin operator, the Hermitian operator $\hat{A}[l]$ can be determined from an expansion in a spin operator basis, as we explain in the following and refer to as spin-QITE. Let D_l denote the domain of $\hat{h}[l]$ which we assume to be a tensor product of spin operators ${\hat{\sigma }}_{i}^{x}$ , ${\hat{\sigma }}_{i}^{y}$ , and ${\hat{\sigma }}_{i}^{z}$ , and we define ${\hat{\sigma }}_{i}^{0}={\mathbb{1}}$ as the identity on spin index i ∈ [L]. We write ${\{{\hat{\sigma }}^{0},{\hat{\sigma }}^{x},{\hat{\sigma }}^{y},{\hat{\sigma }}^{z}\}}^{\otimes {D}_{l}}$ in order to denote all possible ${4}^{| {D}_{l}| }$ Pauli strings (including the identity) on the indices contained in D_l. A single element of ${\{0,x,y,z\}}^{\otimes {D}_{l}}$ is given by ${\hat{\sigma }}_{{\bf{I}}}={\hat{\sigma }}_{{i}_{1}}^{{\alpha }_{1}}{\hat{\sigma }}_{{i}_{2}}^{{\alpha }_{2}}\cdots {\hat{\sigma }}_{{i}_{| D(l)| }}^{{\alpha }_{| D(l)| }}$ , where I = {(i₁, α₁), (i₂, α₂), …, (i_∣D(l)∣, α_∣D(l)∣)}, 1≤i₁ < i₂ < ⋯ < i_∣D(l)∣<L and α_k ∈ {0, x, y, z}. We then define the expansion of an Hermitian operator $\hat{A}[l]$ in the spin basis as

$\begin{eqnarray}&&\hat{A}[l]=\displaystyle \sum _{{\bf{I}}\in {\{0,x,y,z\}}^{\otimes {D}_{l}}}a{[l]}_{{\bf{I}}}{\hat{\sigma }}_{{\bf{I}}},\end{eqnarray} \tag{ 15 }$

where $a{[l]}_{{\bf{I}}}\in {\mathbb{R}}$ are real-valued coefficients and ${\hat{\sigma }}_{{\bf{I}}}$ are Hermitian operators. Note, that in principle not all possible Pauli operator strings will contribute to the solution, and their number can be reduced by symmetry considerations and properties of the desired wave function [6]. However, in our spin system simulations we included all ${4}^{| {D}_{l}| }$ operators and leave their potential reduction for future work.

As shown in appendix B, the coefficients a[l]_I (and therefore also $\hat{A}[l]$ and ${\hat{U}}_{l}$ ), condensed in the vector a[l], can be obtained from a classical solution of the following system of linear equations,

$\begin{eqnarray}&&{\bf{S}}[l]{\bf{a}}[l]=-{\bf{b}}[l],\end{eqnarray} \tag{ 16 }$

where we defined the Gram matrix S[l] and vector b[l] with real-valued entries as

$\begin{eqnarray}&&S{[l]}_{{\bf{I}},{\bf{J}}}=\langle {{\rm{\Psi }}}_{l-1}| \{{\hat{\sigma }}_{{\bf{I}}},{\hat{\sigma }}_{{\bf{J}}}\}| {{\rm{\Psi }}}_{l-1}\rangle ,\end{eqnarray} \tag{ 17 }$

$\begin{eqnarray}&&b{[l]}_{{\bf{I}}}=i\langle {{\rm{\Psi }}}_{l-1}| [{\hat{\sigma }}_{{\bf{I}}},\hat{h}[l]]| {{\rm{\Psi }}}_{l-1}\rangle ,\end{eqnarray} \tag{ 18 }$

which correspond to the real- and imaginary parts of the expectation values of $2{\hat{\sigma }}_{{\bf{I}}}{\hat{\sigma }}_{{\bf{J}}}$ and $2{\hat{\sigma }}_{{\bf{I}}}\hat{h}[l]$ , respectively. We introduced the notation $[\hat{A},\hat{B}]=\hat{A}\hat{B}-\hat{B}\hat{A}$ for the commutator, and $\{\hat{A},\hat{B}\}=\hat{A}\hat{B}+\hat{B}\hat{A}$ for the anti-commutator of two operators $\hat{A},\hat{B}$ . When considering the Gram matrix S[l], we observe that ${\hat{\sigma }}_{{\bf{I}}}{\hat{\sigma }}_{{\bf{J}}}=\alpha ({\bf{I}},{\bf{J}}){\hat{\sigma }}_{{\bf{K}}}$ , where α(I, J) ∈ { ± 1, ± i} is a phase factor that depends on the Pauli strings described by the index vectors I, J. Thus, even though the Gram matrix is a ${4}^{| {D}_{l}| }\times {4}^{| {D}_{l}| }$ matrix, it only requires the evaluation of ${4}^{| {D}_{l}| }$ expectation values $\langle {{\rm{\Psi }}}_{l-1}| {\hat{\sigma }}_{{\bf{K}}}| {{\rm{\Psi }}}_{l-1}\rangle$ .

At each step of the QITE algorithm, the wave function ∣Ψ_l〉 has to be generated, and the ${4}^{| {D}_{l}| }$ Pauli string operator expectation values in equations (17)–(18) have to be measured on the quantum computer. As previously stated, in practice one still has to decompose each unitary ${\hat{U}}_{l}$ into e.g. one- and two-qubit gate operations, a cost that will also scale as ${ \mathcal O }(\exp | {D}_{l}| )$ [43]. We will ignore this cost (as well as the potential error that comes with its implementation) entirely in this work and will simply assume that U_l can be implemented exactly. We also ignore the error resulting from using a finite number of measurements in order to evaluate equations (17)–(18). This means that the actual performance of the Spin-QITE (as well as the fermionic-QITE formulation introduced in the following section) in an actual quantum computation will most likely perform worse than our algorithmic studies would indicate.

2.3.4. Fermionic-QITE

If the Hermitian Hamiltonian term $\hat{h}[f(l)]$ describes a fermionic operator, the Hermitian operator $\hat{A}[l]$ can be determined from an expansion in a fermionic operator basis, as we explain in the following and refer to as fermionic-QITE. We expand the Hermitian operator $\hat{A}[l]$ in terms of products of Majorana operators ${\hat{\gamma }}_{\mu }$ with $\mu \in [2{L}^{{\prime} }]$ , where ${L}^{{\prime} }$ denotes the actual number of fermionic modes (for instance, for a Fermi-Hubbard lattice with L sites or a molecular electronic structure Hamiltonian described by a basis of L orbitals, ${L}^{{\prime} }=2L$ ). Majorana operators are Hermitian operators that fulfill the anticommutator relation $\{{\hat{\gamma }}_{\mu },{\hat{\gamma }}_{\nu }\}=2{\delta }_{\mu \nu }$ and can be expressed in terms of fermionic creation and annihilation operators through ${\hat{\gamma }}_{2j-1}={\hat{c}}_{j}+{\hat{c}}_{j}^{\dagger }$ and ${\hat{\gamma }}_{2j}=-i({\hat{c}}_{j}-{\hat{c}}_{j}^{\dagger })$ for $j\in [{L}^{{\prime} }]$ . We further define an ordered set of indices μ = {μ₁, μ₂, …, μ_∣μ∣}, where ${\mu }_{k}\in [2{L}^{{\prime} }]$ and $0\leqslant {\mu }_{1}\lt {\mu }_{2}\lt \cdots \lt {\mu }_{| {\boldsymbol{\mu }}| }\leqslant 2{L}^{{\prime} }$ . Since we will only consider systems that conserve parity, we restrict the number of elements in our sets to be even, ∣μ∣ = 2m, where $m\subseteq [{L}^{{\prime} }]$ . We introduce the following short-hand notation for an ordered product of Majorana operators

$\begin{eqnarray}&&{\hat{\gamma }}_{{\boldsymbol{\mu }}}={(-i)}^{\left(\genfrac{}{}{0.0pt}{}{| {\boldsymbol{\mu }}| }{2}\right)}\displaystyle \prod _{\mu \in {\boldsymbol{\mu }}}{\hat{\gamma }}_{\mu },\end{eqnarray} \tag{ 19 }$

where the prefactor ensures that ${\hat{\gamma }}_{{\boldsymbol{\mu }}}$ is hermitian. Similar to equation (15), for fermionic systems we then choose the following basis expansion,

$\begin{eqnarray}&&\hat{A}[l]=\displaystyle \sum _{{\boldsymbol{\mu }}}a{[l]}_{{\boldsymbol{\mu }}}{\hat{\gamma }}_{{\boldsymbol{\mu }}}.\end{eqnarray} \tag{ 20 }$

Here, the sum goes over various sets μ which have yet not been specified (except for them being ordered and containing an even number of elements from $[2{L}^{{\prime} }]$ ). By choosing $a{[l]}_{{\boldsymbol{\mu }}}\in {\mathbb{R}}$ , the operator $\hat{A}[l]$ will be hermitian, and we can use the same linear equation that we derived for the Spin-QITE, equation (B9),

$\begin{eqnarray}&&\displaystyle \sum _{{\boldsymbol{\mu }}}a{[l]}_{{\boldsymbol{\mu }}}\langle {{\rm{\Psi }}}_{l-1}| \{{\hat{\gamma }}_{{\boldsymbol{\mu }}},{\hat{\gamma }}_{{\boldsymbol{\nu }}}\}| {{\rm{\Psi }}}_{l-1}\rangle =-i\langle {{\rm{\Psi }}}_{l-1}| [{\hat{\gamma }}_{{\boldsymbol{\nu }}},\hat{h}[l]]| {{\rm{\Psi }}}_{l-1}\rangle ,\end{eqnarray} \tag{ 21 }$

or equivalently

$\begin{eqnarray}&&{\bf{S}}[l]{\bf{a}}[l]=-{\bf{b}}[l],\end{eqnarray} \tag{ 22 }$

where the minus sign is convention and we defined the following real-valued entries

$\begin{eqnarray}&&S{[l]}_{{\boldsymbol{\mu ,\nu }}}=\langle {{\rm{\Psi }}}_{l-1}| \{{\hat{\gamma }}_{{\boldsymbol{\mu }}},{\hat{\gamma }}_{{\boldsymbol{\nu }}}\}| {{\rm{\Psi }}}_{l-1}\rangle ,\end{eqnarray} \tag{ 23 }$

$\begin{eqnarray}&&b{[l]}_{{\boldsymbol{\nu }}}=i\langle {{\rm{\Psi }}}_{l-1}| [{\hat{\gamma }}_{{\boldsymbol{\nu }}},\hat{h}[l]]| {{\rm{\Psi }}}_{l-1}\rangle .\end{eqnarray} \tag{ 24 }$

Since in QITE we expand the operator $\hat{A}[l]$ in the domain of the current Hermitian Hamiltonian term $\hat{h}[l]$ , we will define a corresponding set of indices ${D}_{l}^{(\gamma )}$ : These are the Majorana indices of the domain D_l. To illustrate, consider the case where the domain contains two indices D_l = {i, j} with $i\lt j\in [{L}^{{\prime} }]$ , then ${D}_{l}^{(\gamma )}=\{2i-1,2i,2j-1,2j\}$ . Therefore, the sum in equation (20) goes over all ${\boldsymbol{\mu }}\subseteq {D}_{l}^{(\gamma )}$ in the most general case. If one were to include the identity operator 1, one could express every Hermitian operator that acts in the Hilbert-subspace containing D_l modes with equation (20). However, it is often possible to truncate the expansion at lower order polynomials ∣μ∣ < 2∣D_l∣ and use basis expansion operators which respect the symmetry of the underlying system Hamiltonian, similar to the procedure in (unitary) coupled cluster approaches [44–46]. In what follows, we restrict ourselves to expansions of up to order ∣μ∣ = 4, but in principle, higher order expansions are also possible.

Number-conserving Ansatz Since for many fermionic systems, the particle number operator $\hat{N}={\sum }_{p=1}^{L}\left({\hat{c}}_{p,\alpha }^{\dagger }{\hat{c}}_{p,\alpha }+{\hat{c}}_{p,\beta }^{\dagger }{\hat{c}}_{p,\beta }\right)$ and other operators, such as the spin projection operator ${\hat{S}}_{z}=\frac{1}{2}{\sum }_{p=1}^{L}\left({\hat{c}}_{p,\alpha }^{\dagger }{\hat{c}}_{p,\alpha }-{\hat{c}}_{p,\beta }^{\dagger }{\hat{c}}_{p,\beta }\right)$ commute with the system Hamiltonian, one can reduce the number of terms in equation (20) by restricting the operator basis ${\hat{\gamma }}_{{\boldsymbol{\mu }}}$ to operators that respect those symmetries. We define the orbital excitation operator

$\begin{eqnarray}&&{\hat{E}}_{pq}={\hat{c}}_{p,\alpha }^{\dagger }{\hat{c}}_{q,\alpha }+{\hat{c}}_{p,\beta }^{\dagger }{\hat{c}}_{q,\beta }.\end{eqnarray} \tag{ 25 }$

This allows us to define the single orbital excitation operator

$\begin{eqnarray}&&i({\hat{E}}_{pq}-{\hat{E}}_{pq}^{\dagger })\end{eqnarray} \tag{ 26 }$

and double excitation operator

$\begin{eqnarray}&&i({\hat{E}}_{pq}{\hat{E}}_{rs}-{\hat{E}}_{rs}^{\dagger }{\hat{E}}_{pq}^{\dagger }),\end{eqnarray} \tag{ 27 }$

where we assume p < r and q < s. Equations (26)–(27) will replace the more general Ansatz γ_μ of equation (19) in our fermionic-QITE algorithm for both, fermionic lattice and molecular electronic structure Hamiltonians.

In principle, one would have to include not just the single and double excitation operators in equations (26)–(27) in γ_μ, but also up to n_el-th order excitation operators, where n_el denotes the number of particles, in order to be able to exactly reproduce the ITE behavior with QITE. Thus the single- and double restriction introduces an additional error, which is independent of the Trotterization error.

Note, that while fermionic-QITE can be formulated in the fermionic picture and then transformed into Pauli operators by means of e.g. the Jordan-Wigner transformation, one could directly implement it on a fermionic quantum processing unit [47], provided sufficient coherent control and the means of decomposing the unitaries ${\hat{U}}_{l}$ into the basic underlying fermionic operations is possible at acceptable cost.

We summarize the main steps of the trotterized ITE (steps 1+4a), exact ITE (steps 1+4b), spin-QITE (steps 1+2+3+5a) and fermionic-QITE (steps 1+2+3+5b), as well as the main sources of approximation errors (highlighted by red boxes) in figure 3.

Figure 3. Refer to the following caption and surrounding text. — **Figure 3.** Sketch of the trotterized ITE and QITE. The blue highlighted numbers indicate the main intermediate steps. Note, that the major step of providing an initial state ∣Ψ_init〉, as discussed in section 4, has been omitted from this flow chart. The white box gives three examples to better understand the connection between the domains D_j, the number of Hamiltonian terms $\tilde{m}$ and the parameter ν that defines the size of the domain for a given Hamiltonian term $\hat{h}[j]$ . Trotterized ITE is given by steps 1+4a and takes a given system Hamiltonian that can be written as a sum of individual tensor products and realized the non-unitary ITE propagation. Exact ITE is described by steps 1+4b and differs from trotterized ITE in that omits the trotterization all together. QITE on the other hand approximates the ITE propagator of Hamiltonian terms for a given ν as explained below step 2, and approximates the second-order Trotterized ITE by a sequence of unitary operators. For spin-QITE, the operator expansion is given by equation (16) and it is realized by following steps 1+2+3+5a. Fermionic-QITE uses a truncated operator expansion, only considering single and double fermionic operator excitations, and is realized through steps 1+2+3+5b. In the above diagram, ’mol. el.’ refers to ’molecular electronic’. Importantly, this sketch highlights the main sources of error (indicated as red boxes) that appear, including errors due to Trotterization (steps 3+4a), using a too small value ν < ν^* (steps 5a+5b), and the error due to using a truncated operator expansion in step 5b. The green (rose) color coding in the left- and right-most columns indicate that the FHM and electronic structure Hamiltonian (spin lattice Hamiltonians) are solved via fermionic (spin)-QITE.
Download figure:
Standard image High-resolution image

**Figure 3.** Sketch of the trotterized ITE and QITE. The blue highlighted numbers indicate the main intermediate steps. Note, that the major step of providing an initial state ∣Ψ_init〉, as discussed in section 4, has been omitted from this flow chart. The white box gives three examples to better understand the connection between the domains D_j, the number of Hamiltonian terms $\tilde{m}$ and the parameter ν that defines the size of the domain for a given Hamiltonian term $\hat{h}[j]$ . Trotterized ITE is given by steps 1+4a and takes a given system Hamiltonian that can be written as a sum of individual tensor products and realized the non-unitary ITE propagation. Exact ITE is described by steps 1+4b and differs from trotterized ITE in that omits the trotterization all together. QITE on the other hand approximates the ITE propagator of Hamiltonian terms for a given ν as explained below step 2, and approximates the second-order Trotterized ITE by a sequence of unitary operators. For spin-QITE, the operator expansion is given by equation (16) and it is realized by following steps 1+2+3+5a. Fermionic-QITE uses a truncated operator expansion, only considering single and double fermionic operator excitations, and is realized through steps 1+2+3+5b. In the above diagram, ’mol. el.’ refers to ’molecular electronic’. Importantly, this sketch highlights the main sources of error (indicated as red boxes) that appear, including errors due to Trotterization (steps 3+4a), using a too small value ν < ν^* (steps 5a+5b), and the error due to using a truncated operator expansion in step 5b. The green (rose) color coding in the left- and right-most columns indicate that the FHM and electronic structure Hamiltonian (spin lattice Hamiltonians) are solved via fermionic (spin)-QITE.
Download figure:
Standard image High-resolution image

2.4. Mutual information for domain selection in molecular electronic structure Hamiltonians

For quantum chemistry systems, the concept of a correlation length that determines how to choose the orbitals for the domain D_j is less natural and needs to be replaced by an appropriate selection criterion. Typically, correlations of two observables ${\hat{M}}_{A},{\hat{M}}_{B}$ of two subsystems A and B are measured by connected correlation functions, ${ \mathcal C }({\hat{M}}_{A},{\hat{M}}_{B})=\langle \hat{{M}_{A}}\otimes {\hat{M}}_{B}\rangle -\langle \hat{{M}_{A}}\rangle \langle \hat{{M}_{B}}\rangle$ (where the tensor product is here included for clarity). Other measures of correlation exist and might be advantageous in certain cases. One such alternative measure is the mutual information I_A,B, which is defined as [48]

$\begin{eqnarray}&&{I}_{A,B}=S(A)+S(B)-S(A,B),\end{eqnarray} \tag{ 28 }$

where $S(C)=-\,\rm{tr}\,({\hat{\rho }}_{C}\mathrm{ln}({\hat{\rho }}_{C}))$ is the reduced von Neumann entropy and ${\hat{\rho }}_{C}={\,\rm{tr}\,}_{L\setminus C}(\hat{\rho })$ is the reduced density matrix of a sub-system C. The mutual information is a measure of the total amount of correlation present in a quantum state $\hat{\rho }$ . At zero temperature it coincides with the entanglement entropy, and it is a measure of the total amount of information that system A possesses about system B. Whether the correlations are due to quantum entanglement or whether they are a result of classical correlations, can however not simply be deduced from equation (28) [48, 49]. One advantage of the mutual information over connected correlation functions is that correlations in the systems are a lot less likely to be overlooked, since ${I}_{A,B}\geqslant \frac{{ \mathcal C }{({\hat{M}}_{A},{\hat{M}}_{B})}^{2}}{2\parallel {\hat{M}}_{A}{\parallel }^{2}\parallel {\hat{M}}_{B}{\parallel }^{2}}$ [50]. It follows that a mutual information that displays exponentially decaying mutual information in the distance between A and B also displays exponentially decaying correlation functions.

In the context of quantum chemistry, the mutual information has been used as a means to reintroduce locality in the second quantized Hamiltonian, a prerequisite for MPS optimized through DMRG approaches to be successful when applied to molecular Hamiltonians [3]. A second quantized molecular Hamiltonian is an artificial quasi one-dimensional lattice representation of the molecular electronic structure Hamiltonian, where each lattice site index corresponds to a molecular orbital. In principle, all sites are coupled to each other by means of one- and two-particle terms that connect up two four orbitals. The mutual information has been used in order to come up with a black-box algorithm that is able to identify the suitable AS that describes the part of the molecule that displays the strongest correlation [51]. This can be viewed as restoring a sort of locality in the Hamiltonian, or alternatively as reducing the average interaction-range of a Hamiltonian, by finding a suitable basis, and is central to the success of DMRG when applied to molecular electronic structure Hamiltonians [3, 48, 51–53].

Naturally, the mutual information can also be used in order to identify the amount of correlation within an AS, i.e. which AS orbital pairs are strongly correlated. In the context of QITE, this represents a physically motivated criterion for picking a suitable domain D_j for the support S_j of a given second quantized fermionic Hamiltonian term $\hat{h}[j]$ . In all our simulations, for a given $\hat{h}[j]$ and corresponding S_j, we let ν denote the number of additional orbitals we want to include in the domain D_j. Which orbitals are included is decided based on which orbitals have the largest mutual information w.r.t. the orbitals contained in the support, ${{\rm{\max }}}_{p\in {S}_{j},q\in [L]\setminus {S}_{j}}I(p,q)$ .

In figure 4 we show the results for the orbital mutual information computed for the singlet molecular system O₃ described in table 1. For this system, the largest orbital correlation can be observed between orbitals 5 and 6, followed by a weaker correlation between orbital 2 and 6. The example of figure 2(c) that showcases how the domain is chosen for a given molecular Hamiltonian was generated by a fictitious 6-orbital system, but would in general be obtained from the mutual information as shown in figure 4.

Table 1. Molecular systems studied in this work. The number of unpaired electrons is written in parentheses after the atomic/molecular symbol. The number of electrons and orbitals in the AS in the respective basis set is given by the tuple (n_el, n_orb). For reference, the last column gives the converged Restricted Open-shell Hartree–Fock (ROHF) energy ${E}_{\rm{ROHF}\,}^{\,\rm{full}}$ (in Hartree [Ha]) in the full molecular orbital basis calculated using the Python package PySCF. We include the superscript in order to distinguish this ROHF energy from the energies obtained from ROHF calculations restricted to the AS Hamiltonian in table 3.

System	Basis	AS	${E}_{\rm{ROHF}\,}^{\,\rm{full}}$ [Ha]
Ne (0)	cc-pVDZ	(8,8)	−128.48878
Fe(III)-NTA (1)	def2-QZVPP	(5,5)	−2149.28577
Fe(III)-NTA (3)	def2-QZVPP	(5,5)	−2149.32012
O₂ (0)	cc-pVQZ	(8,6)	−149.59954
O₂ (2)	cc-pVQZ	(8,6)	−149.66431
O₃ (0)	cc-pVQZ	(12,9)	−224.35855

Figure 4. Refer to the following caption and surrounding text. — **Figure 4.** Mutual information I(i, j) between spatial orbitals i, j of the AS ground state of system singlet O₃ described in table 1.
Download figure:
Standard image High-resolution image

A fact that we have so far neglected is that in order to use the mutual information as a selection criterion, one needs to have access to the density matrix ${\hat{\rho }}_{\rm{gs}\,}=| {{\rm{\Psi }}}_{\,\rm{gs}}\rangle \langle {{\rm{\Psi }}}_{\,\rm{gs}\,}|$ . Clearly one faces a dilemma here, since it is the ground state which we want to approximate in QITE. However, we propose to use an approximate MPS state obtained from a DMRG simulation of the molecular Hamiltonian ground state problem in order to compute the mutual information. However, using the mutual information obtained from an approximate solution could lead in some instances to domains that differ from domains chosen by means of the exact solution, which in turn could lead to a different performance of QITE. A study of this effect is however not part of this work.

3. Studied systems

We consider a variety of lattice and molecular electronic structure Hamiltonians. In particular, we study the Heisenberg Model (HM), the Transverse-Field Ising Model (TFIM), the Fermi-Hubbard Model (FHM), and the molecular electronic structure Hamiltonian AS of Ne, two forms of the Fe(III)-NTA molecule and three forms of molecular oxygen. For the lattice geometries, we consider a one-dimensional ring, a two-dimensional triangular ladder, and a two-dimensional hexagonal lattice with periodic boundary conditions in one direction, see figure 5. For the spin systems, we consider Nearest-Neighbor (NN), Short-Range (SR)—which includes NN and Next-to-NN (NNN) interactions—, and Long-Range (LR) interactions, while the FHM is restricted to NN hopping and on-site interactions. The Hamiltonians we consider all fall into the category of k-local Hamiltonians, which is the class of Hamiltonians that can be written as a sum of terms, where each term acts on at most k qubits or particles.

Figure 5. Refer to the following caption and surrounding text. — **Figure 5.** Lattice systems studied in this work. Periodic boundary conditions are highlighted as red and blue lines.
Download figure:
Standard image High-resolution image

Solving the ground state problem of a given k-local Hamiltonian is QMA-complete [4], which means that there is very likely no classical or quantum algorithm that can solve the hardest instances of the problem in polynomial time. The Hamiltonians considered in our work display in the worst case a computational complexity that lies in the class of QMA-complete (HM [54, 55], FHM [56], molecular electronic structure Hamiltonians [57]), or in the class of StoqMA-complete (TFIM [58]). However, we will in this work not try to determine the hardest instances of a given Hamiltonian. In industrial simulation workflows of material and quantum chemistry systems, the parameters describing a spin- or fermionic Hamiltonian are determined from a cascade of approximations. These approximations aim to keep the model computationally tractable, yet retain the main interaction mechanisms which allow for qualitatively correct predictions. It is not clear beforehand whether or not the resulting Hamiltonians from such a workflow are indeed difficult to study with classical computational methods and, also, whether a quantum computer could provide an advantage (in the sense of either computational speedup, or increased accuracy over classical methods, at justifiable economical cost and ecological footprint).

Two of the systems we study, a J₁J₂ model of CrI₃ and the AS of Fe(III)-NTA, represent prototypical candidates of systems that could originate from such industrial workflows. For instance, when deriving a simplified single-layer model of CrI₃ using the respective interaction terms from density functional theory calculations of J₁ = 3.29 meV and J₂ = −0.07 meV [59], we found that the values we obtain lead to a system which can be exactly solved by Generalized Hartree–Fock (GHF) theory (ferromagnetic ground state at 0 K), in agreement with what was reported in literature [60]. In order to apply the QITE algorithm to a less trivial system, we choose a set of interaction parameters that results in a ground state different from ferromagnetic. We select J₁ = 1 and J₂ = −0.5 at the phase boundary according to the diagram shown in figure 3 of [60] where the classical mean-field method GHF introduced in section 4.3.1 no longer gives an accurate description. The parameters for the other lattice systems we consider are chosen similarly in a way that GHF no longer provides a very good solution. Analogously, in the case of molecular electronic structure Hamiltonians, besides Fe(III)-NTA we include examples of molecular oxygen, which are known to be notoriously difficult for single-reference methods such as Hartree–Fock (HF) due to their large amount of static correlation [61].

The Hamiltonians defined in the two subsections below, together with the lattice geometries depicted in figure 5, and the system specifications provided in tables 1 and 2 for the lattice and molecular systems, respectively, fully characterize the systems studied in this work.

Table 2. Lattice systems studied in this work, including the Heisenberg Model (HM) 3.1.1, the J₁J₂-model (which is a HM restricted to NN and NNN interactions), the Transverse-Field Ising Model (TFIM) 3.1.2, and Fermi-Hubbard Model (FHM) 3.1.3. The number of sites L is identical to the number of qubits for all systems, with the exception of the FHM, where ${L}^{{\prime} }=2L$ qubits are required to describe L sites. The dimension of the lattice is given by D. The type of interaction is indicated by NN, SR, or LR interaction. The parameters coupled with the respective equations and the boundary conditions shown in figure 5 describe the resulting Hamiltonian.

System	equation	D	Lattice	Type	L	Parameters
HM I	(29)	1	Figure 5(a)	NN	10	B = 0, J = 1
HM II	(29)	1	Figure 5(a)	LR	10	B = 0.4, α = 1
HM III	(29)	2	Figure 5(b)	NN	12	B = 0.4, J = 1
TFIM I	(31)	1	Figure 5(a)	LR	10	B = 0.4, α = 0.3
TFIM II	(31)	1	Figure 5(a)	LR	10	B = 0.4, α = 0.1
J₁J₂	(29)	2	Figure 5(c)	SR	12	B = 0.1, J₁ = 1, J₂ = − 0.5
FHM	(32)	1	Figure 5(a)	NN	10	t = 1, U = 1

Table 3. The first column shows the multi-configurational diagnostic Z_s(1) defined in equation (36) of the ground state of the AS Hamiltonians in equation (33) of the various systems defined in table 1. The remaining columns give the AS ROHF energy E_ROHF, the ground state energy E₀ which corresponds to the ground state energy of the AS Hamiltonian, the final energy E_ITE of the trotterized ITE, the squared overlaps of the ROHF wave function ${F}_{\rm{ROHF}\,}={\gamma }_{\,\rm{init}}^{2}$ , and of the ITE wave function F_ITE = γ² of the AS Hamiltonian. Energies are given in units of Hartree ([Ha]), while fidelities are given as a percentage ([%]). Note, that the ITE of O₂ (0) is not converged yet at τ = 10, but converges at a much later point in time as can be seen in figure 26. A similar behavior is observed for Fe(III)-NTA (1).

System	Z_s(1)	E_ROHF	E₀	E_ITE	F_ROHF	F_ITE
Ne (0)	0.0441	−128.4888	−128.6033	−128.6028	97.866	99.983
Fe(III)-NTA (1)	0.1839	−2149.2850	−2149.3024	−2149.3014	89.005	95.925
Fe(III)-NTA (3)	0.0459	−2149.3187	−2149.3310	−2149.3310	97.581	99.996
O₂ (0)	0.2607	−149.5900	−149.7306	−149.7203	46.187	64.759
O₂ (2)	0.1285	−149.6628	−149.7633	−149.7633	93.340	99.992
O₃ (0)	0.1860	−224.3529	−224.5866	−224.586	81.216	99.983

3.1. Lattice Hamiltonians

3.1.1. Heisenberg model

The Heisenberg model describes an effective model for Mott insulators, and follows as a special case of the Fermi-Hubbard Hamiltonian—described below in section 3.1.3—in the large U / t limit. The Heisenberg Hamiltonian is given by [62]

$\begin{eqnarray}&&\hat{H}=\displaystyle \sum _{i\lt j}{J}_{ij}\left({\hat{\sigma }}_{i}^{x}{\hat{\sigma }}_{j}^{x}+{\hat{\sigma }}_{i}^{y}{\hat{\sigma }}_{j}^{y}+{\hat{\sigma }}_{i}^{z}{\hat{\sigma }}_{j}^{z}\right)+B\displaystyle \sum _{i}{\hat{\sigma }}_{i}^{z}.\end{eqnarray} \tag{ 29 }$

Here, J_ij > 0 describes antiferromagnetic (AFM) coupling and B the strength of the external magnetic field. We implicitly assume an enumeration i = 1, 2, …, L of the underlying lattice. Equation (29) is not restricted to only describing one-dimensional, but can also describe higher dimensional lattice systems. For NN (NNN) interactions, J_ij ≠ 0 only when the indices i, j are next to each other in the lattice (two nearest-neighbor hops apart), which is indicated by 〈ij〉 (〈〈ij〉〉). Then, J_ij reduces to J_〈ij〉 = J. The Hamiltonian which describes a Heisenberg model restricted to J₁ = J_〈ij〉 and J₂ = J_{〈〈ij〉〉} interactions will also be studied, and is known as the J₁J₂-model [60]. We also consider long-range interactions, which have been realized in experimental setups [63] and are described by

$\begin{eqnarray}&&{J}_{ij}=\frac{1}{| i-j{| }^{\alpha }},\end{eqnarray} \tag{ 30 }$

where α is a free parameter which allows one to tune the system from a uniform (α = 0) to a NN (α → ∞) interaction. The range 0 < α < 1 in one dimensional systems is considered as the strong-long-range regime. Here, as well as in the TFIM below, we assume that ∣i − j∣ denotes the Manhattan distance of the sites i and j in the lattice (thus the minimal amount of nearest-neighbor hops in the lattice connecting two sites rather than the actual physical distance).

Chromium Triiodide While ferromagnetism in two-dimensional van der Waals (vdW) materials has been known for decades [64, 65], reports of isolated, atomically thin magnetic sheets through successful exfoliation from the bulk have only recently emerged [66], such as Cr₂Ge₂Te₆ [66], FePS₃ [67], VSe₂ [68], and MnSe [69]. One of the earliest and most studied 2D magnets is CrI₃ [70], which exhibits a honeycomb lattice where the magnetic Cr³⁺ ions within each layer are interconnected by bridging iodine ligands [71, 72]. The implication of integrating such magnets in vertical vdW heterostructures are tremendous, allowing controlled tuning of magnetic properties through tailored interlayer interactions or external fields to explore different magnetic order or to investigate new physical phenomena. A successful deployment in novel devices could potentially unlock their applications in spintronics where the electron spin is exploited as a further degree of freedom in addition to their charge, for example to store or transfer data, or for computation. To this end, spintronic devices have been explored extensively, including, e.g., spin valves and spin filters [73, 74].

From a theoretical perspective, while the Mermin-Wagner theorem excludes order in two dimensions at finite temperatures, the introduction of any anisotropy can lead to a gapping of the low-energy modes and allows the formation of long-range ordering. In practice, the source of such anisotropies can stem, e.g., from spin-orbit coupling or lattice distortions, leading to persistent magnetism with finite critical temperatures [68]. By mapping the relevant intersite interactions onto effective lattice models, real 2D magnetic vdW materials thus serve as an ideal platform to theoretically investigate emerging physical behaviors with their respective spin Hamiltonians. Depending on the spin degree of freedom, the isotropic Heisenberg model (in the absence of magnetic anisotropy), the XY-model (easy-plane anisotropy) or the Ising model (easy-axis anisotropy) are appropriate to study the underlying physics.

In addition to empirically fitting the effective model parameters to experimental measurements, electronic structure methods have been employed to extract them from first principles calculation, in particular by computing the exchange coupling J_ij that describe the interactions between the spins on site i and j of the lattice (see 3.1.2). Depending on the specific method employed and whether bulk or monolayer CrI₃ is considered, the exact values of J_ij may somewhat differ [59, 75–77]. In Solovyev et al [77], the intralayer parameters are denoted with ${J}_{1}^{\,\rm{Solovyev}\,}$ , ${J}_{3}^{\,\rm{Solovyev}\,}$ , and ${J}_{6}^{\,\rm{Solovyev}\,}$ , whereas the interlayer interactions are denoted with ${J}_{2}^{\,\rm{Solovyev}\,}$ , ${J}_{4}^{\,\rm{Solovyev}\,}$ , and ${J}_{5}^{\,\rm{Solovyev}\,}$ . In this work, we approximate a single layer CrI₃ with a two-dimensional hexagonal lattice by neglecting all interlayer exchange interactions, and by only taking into account the NN and NNN intralayer parameters ${J}_{1}^{\,\rm{Solovyev}\,}$ and ${J}_{3}^{\,\rm{Solovyev}\,}$ , respectively, which in our notation are renamed to ${J}_{1}={J}_{\langle ij\rangle }={J}_{1}^{\,\rm{Solovyev}\,}$ and ${J}_{2}={J}_{\langle \langle ij\rangle \rangle }={J}_{3}^{\,\rm{Solovyev}\,}$ at the beginning of this section.

The current state-of-the-art classical method for simulating spin systems is based on Matrix Product States (MPS) paired with the Density Matrix Renormalization Group (DMRG) method [78]. The amount of entanglement an MPS can describe is bounded by the logarithm of its bond dimension χ, and therefore it is only efficient in one-dimensional systems, since their ground states follow an area law which can be efficiently represented by an MPS in one dimension [42, 79]. In higher dimensions, the area law conjecture states that constant-gapped and locally interacting lattice quantum systems also satisfy the area law, thus leading to the requirement that the bond dimension of the MPS scales exponentially with system size⁷ [2, 80]. While MPS are one-dimensional representatives of the family of tensor networks, their higher-dimensional extensions, such as projected entangled pair states [81, 82] or states generated by entanglement renormalization [83], can describe two-dimensional systems that satisfy an area law. Unfortunately, unlike for MPS in one dimension, computing expectation values with Projected Entangled Pair States (PEPS) in two dimensions is in general inefficient and approximate methods for the tensor contractions have to be employed [84, 85]. Therefore, spin systems in two or higher dimensions are natural candidates for the application of ground state preparation quantum algorithms such as QITE.

3.1.2. Transverse-field Ising model

We consider the Transverse-Field Ising Model (TFIM), which is a paradigmatic model for the appearance of quantum phase transition at a critical point at zero temperature for short- and long-range interactions [86–88] and has recently been studied at scale with a neutral atom quantum simulator [89]. A similar model on a triangular lattice has been studied using QITE in [90]. Its Hamiltonian is given by

$\begin{eqnarray}&&\hat{H}=\displaystyle \sum _{i\lt j}{J}_{ij}{\hat{\sigma }}_{i}^{x}{\hat{\sigma }}_{j}^{x}+B\displaystyle \sum _{i}{\hat{\sigma }}_{i}^{z},\end{eqnarray} \tag{ 31 }$

where B sets the strength of the transverse magnetic field. The TFIM is an extension of the classical Ising model (that does not include a transversal field) and, unlike its classical version, displays a quantum phase transition at criticality even in one dimension. For weak long-range interactions in equation (30) with α ≥ 1, the one-dimensional TFIM can be described well by a classical GHF method, while in the long-range regime α < 1 the GHF solutions start deviating from DMRG results as α gets closer to the uniform interaction limit [91]. Thus, the TFIM is a model where we can apply QITE to a problem where we can control the quality of the initial state ∣Ψ_init〉 simply by tuning the parameter α.

3.1.3. Fermi-Hubbard model

The Hubbard model was introduced to describe the Mott transition from a conductor to an insulator and is one of the simplest many-particle theories that cannot be reduced to a single particle theory without resorting to some sort of approximation. We consider a minimal one-dimensional Hubbard model with L sites for a band structure in the atomic limit at half filling, n_el = L, whose Hamiltonian is given by [62]

$\begin{eqnarray}&&\hat{H}=-t\displaystyle \sum _{\langle p,q\rangle }{\hat{E}}_{pq}+\frac{U}{2}\displaystyle \sum _{p=1}^{L}\left({\hat{E}}_{pp}^{2}-{\hat{E}}_{pp}\right)\end{eqnarray} \tag{ 32 }$

with periodic boundary conditions, where t describes the hopping and U the onsite interaction parameter, and we introduced the fermionic orbital excitation operators as defined in equation (25). Here, the indices p, q describe the sites of the lattice containing L fermionic sites and 〈p, q〉 describes NN sites in the lattice. As we can see from the definition of the fermionic excitation operators ${\hat{E}}_{pq}$ in equation (25), the spin is implicitly contained in the operators. While the Hamiltonian in equation (32) is exactly solvable in one dimension [92], it serves as first test for fermionic-QITE applied to a local fermionic lattice Hamiltonian, before moving on to the more complicated molecular electronic structure Hamiltonians in section 3.2.

3.2. Molecular electronic structure Hamiltonians

As a rather simple atomic many-body system, that is well described by a single Slater determinant, i.e., a single-reference case, we investigate the neon atom. The selected (8,8) Active Space (AS) comprises the doubly occupied 2s and 2p as well as the unoccupied 3s and 3p atomic orbitals. Dunning’s correlation-consistent basis set of valence double-ζ quality (cc-pVDZ) is employed in our calculations [93].

Significantly more challenging is the electronic structure of the Fe(III)-NTA molecule, where the trianion obtained by deprotonating the chelating agent nitrilo triacetic acid (NTA) [94–96] forms a coordination complex with the iron(III) ion. Chelating agents are produced on an industrial scale due to their significant technical relevance. They are used in many applications, e.g., as water softeners in household applications, for the selective extraction of metals in mining, or as ligands for catalysts [97–102]. In these applications, chelating agents generally function by binding to a metal center, such as a transition metal ion. An example is the chelate complex Fe(III)-NTA mentioned above. In this work, we investigate the low- and intermediate-spin states of this complex with one and three unpaired electrons, respectively, which pose a challenge to many widely used electronic structure methods such as density functional theory (DFT) due to their enhanced multi-reference character [103]. Respective computationally optimized structures are taken from [103] and are given in Appendix F for completeness. The selected (5,5) AS comprises five molecular orbitals with predominantly iron 3d character. Basis sets of quadruple-ζ valence quality with two sets of polarization functions (def2-QZVPP) are employed [104, 105].

We furthermore investigate three molecular forms of oxygen, namely singlet and triplet molecular oxygen as well as ozone. Singlet oxygen, abbreviated as O₂ (0) in this work, is a highly reactive molecule formed by electronic excitation of triplet oxygen, O₂ (2). Another reactive form of oxygen is ozone, O₃, which is typically formed by a photochemical reaction in the atmosphere and is responsible for significant material damage worldwide [106]. In particular singlet oxygen and ozone are known to exhibit significant multi-reference character which poses a challenge to single-reference electronic structure methods [107–114]. Thus, both systems are suitable cases to study in this work. Experimentally determined structures are taken from [115] and [116] and are given in Appendix F for completeness. For all three forms of oxygen, the selected AS comprises all molecular orbitals with predominantly atomic 2p character. This results in a (8,6) AS for singlet and triplet oxygen and a (12,9) AS for ozone. Dunning’s correlation-consistent basis set of valence quadruple-ζ quality (cc-pVQZ) is used [93].

We consider the electronic structure Hamiltonian of the systems described above in a single particle basis of size n_orb,full containing n_{el, full} electrons. As single particle basis, we decided to use the orbitals obtained from a complete active space self-consistent field (CASSCF) calculation employing PySCF. Using the AS characterized by the tuple (n_el, n_orb), one can rewrite the original second quantized Hamiltonian as a Hamiltonian containing only n_orb ≤ n_orb,full orbitals and n_el ≤ n_{el, full} electrons [117],

$\begin{eqnarray}\begin{array}{rcl}\hat{H} & = & \displaystyle \sum _{p,q=1}^{{n}_{\,\rm{orb}\,}}{\tilde{h}}_{pq}{\hat{E}}_{pq}+\frac{1}{2}\displaystyle \sum _{p,q,r,s=1}^{{n}_{\,\rm{orb}\,}}{\tilde{g}}_{pqrs}\left({\hat{E}}_{pq}{\hat{E}}_{rs}-{\hat{E}}_{ps}{\delta }_{qr}\right)\\ & & +{\tilde{h}}_{\,\rm{nuc}\,}{\bf{1}}.\end{array}\end{eqnarray} \tag{ 33 }$

The influence of the frozen core orbitals and its electrons is implicitly contained in the corresponding active-space one- and two-electron integrals ${\tilde{h}}_{pq}$ and ${\tilde{g}}_{pqrs}$ , as well as a modified constant ${\tilde{h}}_{\,\rm{nuc}\,}$ .

In order to get a good initial state for solving for the ground state of the AS Hamiltonian, we perform a Restricted Open-shell Hartree–Fock (ROHF) calculation of equation (33) using PySCF. An internal stability analysis of the ROHF solution is performed. Note, that we are here searching for a Slater determinant of given spin which minimizes the energy expectation value of the AS Hamiltonian in equation (33), and not the Hamiltonian of the full system. This ROHF energy will thus in general be higher than the ROHF energy of the full molecular orbital basis ${E}_{\rm{ROHF}\,}^{\,\rm{full}}$ , presented in table 1, and the two methods become identical when the AS includes all orbitals, n_orb = n_orb,full.

4. Initial state preparation

A prerequisite of QITE and ITE is the ability to provide an initial state ∣Ψ_init〉 that has a non-zero overlap with the desired ground state, γ_init > 0. In this section, we detail how we provide the initial state for the systems introduced in section 3.

4.1. Initial states for molecular electronic structure Hamiltonians

Hartree–Fock (HF) theory provides a good starting point for many molecules, especially those which are well described by a single Slater determinant [118]. There are various flavors to HF theory, such as Restricted HF (RHF) and Unrestricted HF (UHF), which treat the contributions of the spin-up and spin-down orbitals either equally, or independently. Another version of HF is called Restricted Open-shell HF (ROHF), which uses doubly occupied orbitals for paired and singly occupied orbitals for unpaired electrons, and can be applied to open-shell molecules as an alternative to UHF [119]. When applied to a closed shell system, ROHF coincides with RHF. The ROHF orbitals which describe the solution of the method are not unique, which means that different canonical orbitals lead to the same ROHF energy.

We use ROHF in order to obtain our initial states ∣Ψ_init〉 for the AS Hamiltonian in equation (33) and PySCF in order to generate the ROHF solutions. For molecular electronic structure Hamiltonians, applying ROHF to the AS Hamiltonian in equation (33) means that one will obtain a set of ROHF orbitals which are optimized in order to find the Slater determinant that minimizes the energy of the AS Hamiltonian (and not of the original Hamiltonian in the full molecular orbital basis). We will denote the resulting energies as E_ROHF. The reason for us to perform this ROHF calculation is that it leads to overlaps γ_init which we find to be significantly larger than those when directly using the orbitals obtained from a CASSCF calculation.

The solutions of a HF calculation can be expressed as a Slater determinant,

$\begin{eqnarray}&&| {{\rm{\Psi }}}_{\,\rm{init}\,}\rangle ={\hat{d}}_{1}^{\dagger }{\hat{d}}_{2}^{\dagger }\cdots {\hat{d}}_{{n}_{\rm{el}\,}}^{\dagger }| \,\rm{vac}\rangle ,\end{eqnarray} \tag{ 34 }$

where ∣vac〉 denotes the fermionic vacuum state (i.e. the zero eigenstate of the fermionic annihilation operator) and ${\hat{d}}_{k}^{\dagger }$ denotes fermionic creation operators which are rotated into the obtained HF orbitals. Slater determinants in an arbitrary orbital basis can be generated efficiently on a quantum computer by means of Givens rotations [16]. They are representatives of a special class of Fermionic Gaussian States (FGS), which we introduce below in section 4.3.1 and describe in detail in appendix A.1.

Assessing the quality of the initial state for larger molecules Since we require that the initial state possesses a non-vanishing overlap γ_init, it is desirable to have a diagnostic tool at hand that allows one to get a feeling of whether or not a HF solution would be a good starting point for the QITE or ITE evolution. In the context of molecular electronic structure theory, a good indicator is given by the amount of static correlation within a system. A large degree of static correlation indicates that more than one electronic configuration will have a large amplitude in the electronic wave function representing the ground state of the molecule, i.e. the electronic wave function cannot be reliably described by a single Slater determinant. Naturally, since one does not have access to the true ground state, heuristic methods based on approximate solutions have to be considered. Common diagnostic tools for estimating the amount of static correlation are the so-called T₁ and D₁ diagnostics, which rely on a coupled cluster wave function [120, 121]. In this work, we consider a different diagnostic tool, which is closely related to the mutual information introduced in section 2.4 as a means for picking the domains within the QITE algorithm. It can be computed from the single-orbital entropy

$\begin{eqnarray}&&{s}_{i}(1)=-\displaystyle \sum _{\kappa =1}^{4}{\omega }_{\kappa ,i}\mathrm{ln}({\omega }_{\kappa ,i}),\end{eqnarray} \tag{ 35 }$

where ω_κ,i are the eigenvalues of the one-orbital reduced density matrix of orbital i and κ denotes all four possible spin configurations of the orbital. The entanglement based multi-configurational diagnostic we use is then given by [61]

$\begin{eqnarray}&&{Z}_{s(1)}=\frac{1}{L\mathrm{ln}(4)}\displaystyle \sum _{i=1}^{L}{s}_{i}(1),\end{eqnarray} \tag{ 36 }$

where L = n_orb. Since the maximal amount of entanglement a single orbital can have is upper bounded by $\mathrm{ln}(4)$ , the diagnostic in equation (36) gives 0 (1) for a respective wave function displaying zero (maximal) entanglement. Thus, one would expect that the ROHF solution of the AS of systems with a low (high) Z_s(1) value will provide a good (poor) initial state. The values of Z_s(1) for the chemical systems we study are reported in table 3.

In order to be able to evaluate equation (36), one needs access to an approximate wave function of the actual ground state. We will use an MPS representation of the true ground state ∣Ψ_gs〉 for computing equation (36), just as we do for computing the mutual information from equation (28). As before, we can afford this because the system sizes we consider in our examples are small enough. For larger systems, the MPS of the wave function would be replaced by an MPS approximation thereof. Details on the connection between the mutual information and the single- and two-orbital entropy can be found in appendix C.

Note, that like most diagnostics, also equation (36) is not without error, and cases where the diagnostic might suggest that single-reference approaches such as coupled cluster should fail (i.e. when Z_s(1) has a value close to one), are able to resolve the energy within chemical accuracy. Furthermore, the method is most reliable when the number of electrons in the system is even and identical to the number of orbitals, i.e. n_el = L [44]. For cases where n_el ≠ n_orb, the value of Z_s(1) will i.g. be artificially lowered.

4.2. Initial states for fermionic lattice Hamiltonians

In our simulations of the FHM, we use an initial state ∣Ψ_init〉 which is the result of a ROHF calculation carried out with PySCF, and is thus of the form of equation (34). In the limit of U/t ⪅ 1, a mean-field solution will provide a reasonably large overlap with the true ground state for system sizes which are still amenable to exact diagonalization [122], which is why we choose U = t = 1 for our simulations. Note, that HF will provide worse initial states for Fermi-Hubbard models as U gets larger than t. In this regime, it is known that the Gutzwiller wave function would be a desirable initial state [123], however its non-unitary generator makes it challenging to implement on a quantum computer. While approaches exist which suggest how to prepare an appropriate initial state for the FHM, they either are based on variational algorithms to be executed on a quantum device [122, 124], or on a fault tolerant quantum algorithm subroutine that transforms the non-unitary generator of the Gutzwiller wave function into a linear combination of unitaries by means of a Hubbard-Stratonovich transformation [125]—neither of which satisfies both requirements that ∣Ψ_init〉 should be efficiently computable on a classical computer and efficiently implementable on a quantum computer. Assessing if initial states that include non-factorizable correlations beyond fermionic Gaussian provides any advantages is a particularly interesting hypothesis to address [31].

4.3. Initial states for spin lattice Hamiltonians

We introduce the classical mean-field method which we use to produce a classical initial state for both, fermionic and spin Hamiltonians in section 4.3.1 and then describe how this can be applied to a general spin Hamiltonian in section 4.3.2. A detailed description is provided in Appendix A.

4.3.1. Generalized Hartree–Fock theory

Section 4.1 describes a special cases of Generalized Hartree–Fock theory (GHF) [126], which is the subject of this section. GHF describes a procedure for finding the Fermionic Gaussian State (FGS) that possesses the lowest energy expectation value of a given Hamiltonian. A pure FGS of L fermionic modes can be written as

$\begin{eqnarray}&&| {{\rm{\Psi }}}_{\,\rm{FGS}\,}\rangle ={\hat{U}}_{{\bf{Q}}}| \,\rm{vac}\,\rangle ,\end{eqnarray} \tag{ 37 }$

where Q ∈ O(2L) is a matrix that fully characterizes the FGS, and ${\hat{U}}_{{\bf{Q}}}$ is the generator of the FGS which can be written as the exponential of a quadratic polynomial of Majorana operators whose coefficients depend on Q, as detailed in Appendix A.3. An FGS is fully described by its respective (2L × 2L) covariance matrix Γ which is related to equation (37) through

$\begin{eqnarray}&&{{\rm{\Gamma }}}_{\mu \nu }=\frac{i}{2}\langle {{\rm{\Psi }}}_{\,\rm{FGS}\,}| [{\hat{A}}_{\mu },{\hat{A}}_{\nu }]| {{\rm{\Psi }}}_{\,\rm{FGS}\,}\rangle ,\end{eqnarray} \tag{ 38 }$

where μ, ν ∈ [2L] and $\hat{{\bf{A}}}={({\hat{A}}_{1},\ldots ,\,{\hat{A}}_{2L})}^{T}={({\hat{a}}_{1},{\hat{a}}_{3},\ldots ,\,{\hat{a}}_{2L-1},{\hat{a}}_{2},{\hat{a}}_{4},\ldots ,\,{\hat{a}}_{2L})}^{T}$ is a vector of Majorana operators that are defined as ${\hat{a}}_{2p-1}={\hat{c}}_{p}^{\dagger }+{\hat{c}}_{p}$ and ${\hat{a}}_{2p}=i({\hat{c}}_{p}^{\dagger }-{\hat{c}}_{p})$ for p ∈ [L]. As shown in [18] and detailed in Appendix A.1-A.2, one can find the covariance matrix Γ which describes the state in equation (37) with minimal energy w.r.t. a given fermionic Hamiltonian at a classical computational cost of ${ \mathcal O }({L}^{3})$ (attributed to the evaluation of the Pfaffian of the associated covariance matrix). The iterative method for finding the minimum is based on ITE of the covariance matrix and requires the evaluation of the energy expectation value of the fermionic Hamiltonian w.r.t. a given FGS, as well as its gradient, which both can be computed efficiently on a classical computer through its covariance matrix using Wick’s theorem [127]. Given an optimized covariance matrix Γ describing a pure FGS, we derive the explicit form of ${\hat{U}}_{{\bf{Q}}}$ in equation (37) for both possible cases, Γ describing an odd (even) parity FGS, i.e. Pf(Γ) = + 1 (−1). This then enables one to construct the initial FGS ∣Ψ_init〉 = ∣Ψ_FGS〉 on a quantum computer using ${ \mathcal O }({L}^{2})$ gates and a circuit depth that scales as ${ \mathcal O }(L)$ [16].

4.3.2. GHF applied to a generic spin Hamiltonian

In order to apply the GHF to a general spin Hamiltonian, one has to first map the spin Hamiltonian to a fermionic Hamiltonian through e.g. the Jordan-Wigner transformation [128]. This can lead to fermionic Hamiltonians that are polynomials of Majorana operators of degree 2L. In addition, the resulting fermionic Hamiltonian does necessarily have to conserve the fermionic particle number, which is the case for the TFIM studied in this work. In case the resulting fermionic Hamiltonian does in fact conserve the particle number, one can also restrict equation (37) to covariance matrices that conserve the fermionic particle number. These number-conserving FGS are just the Slater determinants we introduced in equation (34) and GHF reduces to HF theory. We note, that GHF can be applied to lattice systems in arbitrary dimension.

5. Results

All lattice and molecular electronic structure systems we consider are described in tables 1 and 2, respectively. All systems considered in this work are initialized in a proper state ∣Ψ_init〉 following the procedures outlined in section 4. This allows us to compare QITE with a classical mean-field approach (i.e. GHF). For every system we perform a trotterized ITE as described in section 2.2, whose evolution QITE is supposed to replicate.

In all plots that display the energy and fidelity, the trotterized ITE (section 2.2) is shown as a gray line, while a red horizontal dashed line corresponds to the exact ground state energy of the studied Hamiltonian, as well as to a fidelity of 1. All other colored lines with markers display the results obtained with the QITE algorithm (section 2.3). The number of markers varies and is not representative of the number of QITE steps, which in turn is given by τ/Δτ. Different colors indicate different domain sizes which are characterized by ν, while different line styles correspond to different discretization step sizes Δτ for fixed ν. In all plots, the left-most energy (fidelity) value corresponds to the energy (fidelity) of the initial state ∣Ψ_init〉 of the GHF or ROHF solutions described in section 4. All relevant ITE, GHF and ROHF energy and fidelity results for the lattice and molecular electronic structure systems are summarized in tables 3 and 4, respectively. We are only interested in the ability of spin-QITE and fermionic-QITE to follow the trotterized ITE, and not in a quantitative comparison, which is why we do not include specific energy and fidelity values obtained for the QITE simulations in these tables.

Table 4. Classical GHF mean-field energy E_GHF, the exact ground state energy E₀, as well as first excited state energy E₁ (in arb. units) various spin systems described in the main text. E_ITE denotes the energy at the end of the evolution time τ obtained through a second order trotterized ITE with step size Δτ = 0.1, except for TFIM II, where Δτ = 0.01 was chosen for a better resolution. We also provide the fidelities of the GHF solution w.r.t. the exact ground state F_GHF = ∣〈Ψ_GHF∣Ψ_gs〉∣², as well the the fidelity of the final ITE state F_ITE = ∣〈Ψ_ITE∣Ψ_gs〉∣². All energies are given in arbitrary units ([arb. unit]) and all fidelities are given in percentage ([%]).

System	E_GHF	E₀	E₁	E_ITE	F_GHF	F_ITE
HM I	−17.3380	−18.0618	−16.3688	−18.0581	70.003	99.964
HM II	−15.2421	−15.8914	−14.7898	−15.8870	59.735	99.955
HM III	−21.4323	−23.5846	−22.9006	−23.5726	62.146	99.914
TFIM I	−6.8874	−6.9792	−6.6637	−6.9790	96.410	99.997
TFIM II	−6.9138	−7.1630	−6.6780	−7.1630	91.743	99.999
J₁J₂	−32.3776	−34.4029	−32.8834	−34.3681	39.783	99.824
FHM	−10.4443	−10.6144	−9.5989	−10.6144	96.227	99.999

a. Expansion operators Lattice spin systems are studied using the spin-QITE algorithm 2.3.3 and an operator basis ${\hat{\sigma }}_{{\bf{I}}}\in {\{{\mathbb{1}},{\hat{\sigma }}^{x},{\hat{\sigma }}^{y},{\hat{\sigma }}^{z}\}}^{\otimes {D}_{l}}$ is used for a spin Hamiltonian term $\hat{h}[l]$ in the expansion of equation (15). The fermionic lattice and molecular electronic structure models are studied using the fermionic-QITE algorithm 2.3.4 with an operator expansion of equation (20) in a basis ${\hat{\gamma }}_{{\boldsymbol{\mu }}}$ composed of all single- and double excitation operators defined in equations (26)–(27) over all fermionic sites or orbitals contained in the domain D_l of a given fermionic Hamiltonian term $\hat{h}[l]$ .

b. Exact QITE For spin lattice systems the symbol ν is identical to the Manhattan distance in the lattice, while for FHM and molecular electronic structure models it is identical to the number of additional sites and orbitals, respectively. For all models we will use ν^* to denote the regime where ν is large enough to cover the entire system L for every Hamiltonian term $\hat{h}[l]$ . In principle it should then reproduce the trotterized ITE, and only deviate from the latter in the limit Δτ → 0 in case the operator basis is truncated. In fact, if all Hamiltonian term domains are of the same size L as the system, due to the approximation we make by uniting common domain terms mentioned in section 2.3.2, there is effectively no trotterization error anymore since there is only a single Hamiltonian term $\hat{h}[l]$ to consider, i.e. $\tilde{m}=1$ . However, the set of linear equations equations (16) and (22) for spin- and fermionic-QITE were derived assuming a small step size Δτ. Thus, QITE at the critical value ν^* translates to the task of approximating exact ITE ( ${e}^{-{\rm{\Delta }}\tau \hat{H}}$ ) by a single unitary ${\hat{U}}_{1}$ in a (possibly) truncated operator basis ${\hat{\sigma }}_{{\bf{I}}}$ or ${\hat{\gamma }}_{{\boldsymbol{\mu }}}$ , respectively. Note, that for systems with a finite correlation length ${ \mathcal C }$ , each system will in principle possess a value ${\nu }_{{ \mathcal C }}\lt {\nu }^{* }$ for which no improvement will be observed when increasing beyond ${\nu }_{{ \mathcal C }}$ , and the error is limited by the truncation of the operator basis and truncation of the Taylor expansion of the ITE propagator which lead to equations (16) and (22).

c. Finite operator expansion effects While the operator basis we use for spin-QITE is complete (within D_l), the operator expansion basis for fermionic-QITE introduces an additional approximation which will result in exact-QITE results monotonically converging to energies above the trotterized ITE energy, even for very small step sizes Δτ.

d. Long time behavior For readability, we often only include the short-time behavior in the main text and move the plots that show the continued behavior at longer times τ to appendices D and E.

5.1. Lattice systems

5.1.1. Heisenberg model

a. HM I The first system we consider is identical to a system studied in [6] and is used in order to see if the QITE implementation reproduces similar results. As shown in table 2, HM I describes a one-dimensional spin chain with periodic boundary conditions, see figure 5(a), whose Hamiltonian is given by equation (29) with NN interactions J_〈ij〉 = 1 and no magnetic field, B = 0. Figure 6 shows the energies and fidelities of the spin-QITE and trotterized ITE. At the smallest Manhattan distance ν = 1 (i.e. ${\rm{\max }}(| {D}_{l}| )=4$ ), spin-QITE starts deviating from trotterized ITE after only a handful of steps, displaying an oscillatory behavior in both energy and fidelity far from the exact ground state energy. This behavior seems to be independent of the step size, as can be seen by the similar evolution for Δτ = 0.01 and Δτ = 0.001. We find that already at ν = 2 ( ${\rm{\max }}(| {D}_{l}| )=6$ ), spin-QITE is able to reproduce the trotterized ITE evolution and the energy saturates at a value E_QITE > E_ITE which is above the converged trotterized energy value E_ITE for step size Δτ = 0.1. The discrepancy in energy between QITE and trotterized ITE is due to finite value ν, and they should coincide at ν* = 4, where ${\rm{\max }}(| {D}_{l}| )=L$ . One can also see that the trotterized ITE lies slightly above the exact ground state energy, which is due to the trotterization error discussed in equation (5), and can be removed by either using a smaller step size for the evolution, or using a higher order Trotter formula. The results agree well with the results reported in [6].

Figure 6. Refer to the following caption and surrounding text. — **Figure 6.** QITE and trotterized ITE energy (top) and corresponding fidelities (bottom) for imaginary time step sizes Δτ and Manhattan distances ν for system HM I of table 2, which describes a one-dimensional spin system of L = 10 spins with NN interactions and periodic boundary conditions. Here, ν = 1 (2) corresponds to ${\rm{\max }}(| {D}_{l}| )=4$ (6).
Download figure:
Standard image High-resolution image

**Figure 6.** QITE and trotterized ITE energy (top) and corresponding fidelities (bottom) for imaginary time step sizes Δτ and Manhattan distances ν for system HM I of table 2, which describes a one-dimensional spin system of L = 10 spins with NN interactions and periodic boundary conditions. Here, ν = 1 (2) corresponds to ${\rm{\max }}(| {D}_{l}| )=4$ (6).
Download figure:
Standard image High-resolution image

b. HM II In figure 7 we show the results obtained for the one-dimensional long-range Hamiltonian that describes system HM II, where the exponent in equation (30) that sets the strength of the interaction is α = 1. Note, that since the Hamiltonian is long-range, the resulting domain sizes are larger than for NN Hamiltonians for the same value ν. Our numerical simulations were limited to ν = 1 not just for this model, but also for all lattice models that either included beyond-NN interactions or a lattice dimension larger than one, due to the large computational demands that the scaling ${4}^{| {D}_{l}| }$ leads to. We probe the QITE evolution for discretization step sizes Δτ = 0.1 and Δτ = 0.01, respectively. We find that for both realizations, QITE almost immediately deviates strongly from the trotterized ITE, indicating that, just as with the HM I model, ν = 1 is not sufficient. A QITE simulation with ν = 2 would here lead to ${\rm{\max }}(| {D}_{l}| )=10$ , thus a 256-fold increase in computational cost (both on the classical side, and regarding the number of operators that need to be measured by the quantum computer). The long-term behavior shows an oscillatory behavior with a fidelity that quickly drops off to zero, see figure 19.

Figure 7. Refer to the following caption and surrounding text. — **Figure 7.** Results for system HM II of table 2, which describes the one-dimensional spin system of L = 10 spins of figure 5(a) with long-range interactions α = 1 and periodic boundary conditions. Here, ν = 1 corresponds to ${\rm{\max }}(| {D}_{l}| )=6$ .
Download figure:
Standard image High-resolution image

**Figure 7.** Results for system HM II of table 2, which describes the one-dimensional spin system of L = 10 spins of figure 5(a) with long-range interactions α = 1 and periodic boundary conditions. Here, ν = 1 corresponds to ${\rm{\max }}(| {D}_{l}| )=6$ .
Download figure:
Standard image High-resolution image

c. HM III In figure 8 we show the results obtained for the Heisenberg model on a two-dimensional triangular lattice with periodic boundary conditions in one direction and two rows of six spins each, leading to a total of L = 12, as shown in figure 5(b). Triangular lattices are of particular interest when studying antiferromagnetic systems, since they can lead to frustration [89]. Like HM I, the system HM III only considers antiferromagnetic NN interactions. Unlike HM I, QITE is already stable for ν = 1 which corresponds to a domain size of ${\rm{\max }}(| {D}_{l}| )=7$ ⁸ . The discrepancy between the converged QITE energy E_QITE and the converged trotterized ITE energy E_QITE can be explained by the small domain size.

Figure 8. Refer to the following caption and surrounding text. — **Figure 8.** Results for system HM III of table 2, which describes the two-dimensional spin system of figure 5(b) with NN interactions and periodic boundary conditions along the main axis. Here, ν = 1 corresponds to ${\rm{\max }}(| {D}_{l}| )=7$ .
Download figure:
Standard image High-resolution image

**Figure 8.** Results for system HM III of table 2, which describes the two-dimensional spin system of figure 5(b) with NN interactions and periodic boundary conditions along the main axis. Here, ν = 1 corresponds to ${\rm{\max }}(| {D}_{l}| )=7$ .
Download figure:
Standard image High-resolution image

5.1.2. Transverse-field Ising model

a. TFIM I In figure 9 we present results of the simulation of the TFIM I system, which describes the transverse-field Ising model with long-range interactions in the strong-long-range regime α = 0.3 [87]. For the one-dimensional chain of a system displaying long-range interactions, ${\rm{\max }}(| {D}_{l}| )=6$ for ν = 1. Similar to the HM II model which also describes long-range interactions, one can observe that for ν = 1 the QITE quickly deviates from the monotonic behavior of trotterized ITE, independent of the time step size being Δτ = 0.1 or Δτ = 0.01. Thus, as observed in the long-range HM II, a larger ν is required in order to be able to improve the initial state overlap. The long-time behavior up to τ = 10 is presented for completeness in figure 20.

Figure 9. Refer to the following caption and surrounding text. — **Figure 9.** Results for system TFIM I of table 2, which describes the one-dimensional spin system of figure 5(a) with strong long-range interactions α = 0.3 and periodic boundary conditions. Here, ν = 1 corresponds to ${\rm{\max }}(| {D}_{l}| )=6$ .
Download figure:
Standard image High-resolution image

b. TFIM II System TFIM II using the same lattice geometry as TFIM I considers the even stronger long-range regime α = 0.1, reflected in the lower initial state overlap. The step sizes we consider here are Δτ = 0.01 for the trotterized ITE and Δτ = ∈ {0.01, 0.001} for QITE. The smaller step sizes are chosen in order to be able to resolve if an improvement over the initial state overlap can be achieved in the first part of the evolution. The results display in figure 10 show that QITE does indeed improve until τ ≈ 0.1, before quickly dropping off to zero similar to figure 20.

Figure 10. Refer to the following caption and surrounding text. — **Figure 10.** QITE energy and fidelity of TFIM II, similar to figure 9, but for smaller step sizes Δτ and in the even stronger long-range regime α = 0.1.
Download figure:
Standard image High-resolution image

5.1.3. J₁J₂-model

The last spin system we consider is the two-dimensional J₁J₂-model on a hexagonal lattice as depicted in figure 5(c) with NN and NNN interactions. Similar to the previous NN models shown in figures 6 and 8, the results for this system in figure 11 display an initially stable behavior of QITE regarding the energy, until τ ≈ 0.5, even for the smallest non-trivial Manhattan distance ν = 1, which corresponds to ${\rm{\max }}(| {D}_{l}| )=7$ . Interestingly, the fidelity continues to improve well beyond the point where the energy starts growing again, and peaks at around τ ≈ 2.5, before worsening again. We observe that the fidelity can be increased from around 40% to 90% even though we are only able to simulate the smallest Manhattan distance ν = 1. We also note that this presents heuristic evidence, that the behavior of the energy might not be a reliable tool for determining the performance of QITE, whose main purpose is to amplify the overlap γ, and not necessarily the improvement of the energy.

Figure 11. Refer to the following caption and surrounding text. — **Figure 11.** QITE energy and fidelity for a 3x2 honeycomb lattice displayed in figure 5(c) of the J₁J₂ model for ν = 1 and ${\rm{\max }}(| {D}_{l}| )=7$ .
Download figure:
Standard image High-resolution image

Note, that this is only considering a simplified model of the CrI₃ description introduced in section 3.1.1 with periodic boundary in one direction. In order to be able to simulate the larger lattice 5(d) of CrI₃ with periodic boundary conditions in both directions and include a study of Manhattan distance of ν = 2, the code base would have to be improved significantly since this would lead to domain sizes of ${\rm{\max }}(| {D}_{l}| )=14$ .

5.1.4. Fermi-Hubbard model

As a proof of principle, we study the one-dimensional FHM system described in table 2, whose Hamiltonian is given by equation (32) and allows for hopping between NN sites. The ITE and QITE results are displayed in figure 12. Here, ν represents the number of additional sites to be included in the domain of a Hamiltonian term (in addition to the Hamiltonian term’s support). Each site contains two fermionic modes, so the system we consider with L = 10 corresponds to 20 fermionic modes. For the FHM, ν describes the number of additional sites added to the domain, which are picked according to their Manhattan distance from the support sites, as explained in section 2.3.2.

Figure 12. Refer to the following caption and surrounding text. — **Figure 12.** QITE energy for various imaginary time step sizes Δτ and a number of ν additional sites for a 1D Fermi-Hubbard model consisting of L = 10 sites (i.e. 20 qubits) with NN coupling and periodic boundary conditions, with Hamiltonian parameters t = U = 1. The operator expansion basis is $\{i({\hat{E}}_{pq}-{\hat{E}}_{pq}^{\dagger })\}\cup \{i({\hat{E}}_{pq}{\hat{E}}_{rs}-{\hat{E}}_{rs}^{\dagger }{\hat{E}}_{pq}^{\dagger })\}$ , where in this particular case the domain size is identical to the size of the support plus ν, where ν denotes the number of additional sites (closest in Manhattan distance) included.
Download figure:
Standard image High-resolution image

**Figure 12.** QITE energy for various imaginary time step sizes Δτ and a number of ν additional sites for a 1D Fermi-Hubbard model consisting of L = 10 sites (i.e. 20 qubits) with NN coupling and periodic boundary conditions, with Hamiltonian parameters t = U = 1. The operator expansion basis is $\{i({\hat{E}}_{pq}-{\hat{E}}_{pq}^{\dagger })\}\cup \{i({\hat{E}}_{pq}{\hat{E}}_{rs}-{\hat{E}}_{rs}^{\dagger }{\hat{E}}_{pq}^{\dagger })\}$ , where in this particular case the domain size is identical to the size of the support plus ν, where ν denotes the number of additional sites (closest in Manhattan distance) included.
Download figure:
Standard image High-resolution image

For ν = 1, we observe an early deviation of QITE from ITE similar to the HM I system in figure 6. This can be an effect that is due to the fact that ν = 1 considers only one of the two neighboring sites (chosen randomly) of the Hamiltonian term support in figure 2(b). For ν = 2, both the left- and right-hand side sites are included, which corresponds to a Manhattan distance radius of size one. The behavior of ν = 2 already leads to an almost exact replication of the trotterized ITE. Interestingly, as shown in figure 21, we find that the long-time behavior is less stable for larger domain sizes ν = 4 than for ν = 2 and ν = 3.

5.2. Molecular electronic structure systems

We now turn to study the behavior of fermionic-QITE 2.3.4 applied to the AS of the molecular electronic structure Hamiltonians defined in equation (33). The molecular and atomic systems are listed in Table 1 and energies and fidelities of the ROHF and trotterized ITE methods are presented in table 3. This table also contains the multi-configurational diagnostic Z_s(1) defined in equation (36). Similar to the lattice systems, we do not give explicit numbers for the QITE energies and fidelities, since we are only interested in their ability to follow the trotterized ITE, and not in an exact quantitative comparison.

Each molecular electronic structure Hamiltonian is characterized by the size of the AS L = n_orb and the number of electrons n_el within the AS. A molecular electronic structure Hamiltonian of an atom/molecule in a basis of L orbitals contains ${L}^{{\prime} }=2L$ spin-orbitals. The details of the configurations of the molecules we consider can be found in section 3.2 and Appendix F. The mutual information of the ground state of the respective systems, used in order to pick the orbitals in the fermionic-QITE formulation, are given in figure 4 and Appendix G.

5.2.1. Neon

The first electronic structure system we study is the Neon atom described in section 3.2. The AS is characterized by (n_el, n_orb) = (8, 8), and the mutual information of its ground state is shown in figure 30. Figure 13 shows the results of fermionic-QITE for a range of domain sizes which are characterized by the number of additional orbitals ν included to the orbitals of the support. One can see that at step size Δτ = 0.01 all ν initially lead to an improvement of the fidelity and energy, but only the exact fermionic-QITE simulation ν^* = 7, where the domain size is always identical to the number of AS orbitals ∣D_l∣ = L, is stable. One also observes that the discretization step size Δτ = 0.01 leads to an almost exact replication of the trotterized ITE behavior for ν^*, compared with a more coarse grained discretization step size Δτ = 0.1, where the energy is decreasing at a lower rate. Since at ν^* there is no trotterization error anymore, the only difference between QITE and ITE is the error due to the truncated operator expansion and the error from the taylor expansion of the imaginary time propagator. The long-time behavior of the evolution is shown in figure 22. The low diagnostic value Z_s(1) = 0.0441 in table 3 indicates that a single Slater determinant could provide an initial state with a large fidelity, which is confirmed by the large fidelity F_ROHF ≈ 97.9%.

Figure 13. Refer to the following caption and surrounding text. — **Figure 13.** QITE energy and fidelity for various imaginary time step sizes Δτ and domains characterized by ν for the (8, 8) AS of Ne with zero unpaired electrons. The operator expansion basis is $\{i({\hat{E}}_{pq}-{\hat{E}}_{pq}^{\dagger })\}\cup \{i({\hat{E}}_{pq}{\hat{E}}_{rs}-{\hat{E}}_{rs}^{\dagger }{\hat{E}}_{pq}^{\dagger })\}$ , where the domain size is identical to the size of the support plus ν, where ν is denotes the number of additional orbitals included, where the latter are chosen as the largest entries in the p columns of the mutual information matrix I(: , p), where p are the orbitals contained in the respective support of the Hamiltonian terms.
Download figure:
Standard image High-resolution image

Figure 14. Refer to the following caption and surrounding text. — **Figure 14.** QITE energy and fidelity as in figure 13 but for the (5, 5) AS of Fe(III)-NTA with one unpaired electron.
Download figure:
Standard image High-resolution image

5.2.2. Fe(III)-NTA

We now consider the fermionic-QITE results for the (5, 5) AS of the Fe(III)-NTA (0) system described in table 1 and section 3.2. The mutual information used for picking the orbitals of the domain are given in figure 31. The initial state has a fidelity of F_ROHF = 89% and the fidelity only gradually increases. Unlike the results for Ne (see section 5.2.1), all approximate QITE simulations with ν < ν^* are at the beginning following the trotterized ITE evolution and start to deviate from it around τ = 0.5. Interestingly, a finer step size Δτ = 0.01 leads to an earlier deviation of the approximate QITE from trotterized ITE in comparison to Δτ = 0.1. The only stable behavior can be observed for exact QITE at ν^* = 4.

In figure 15 we show the results for Fe(III)-NTA with three unpaired electrons. The mutual information is depicted in figure 32 and shows significantly less overall correlation than in the case of only one unpaired electron of figure 31. One can see that approximate QITE of system Fe(III)-NTA (3) is much more stable than Fe(III)-NTA (1), and only starts to deviate at a time τ ≈ 5 (compared to τ ≈ 0.5 for Fe(III)-NTA (1)), see figure 24. In addition, Z_s(1) = 0.049 and F_ROHF ≈ 97.6% for three unpaired electrons, while for one unpaired electron Z_s(1) = 0.1839 and F_ROHF ≈ 89%. Therefore, the high spin state has significantly less static correlation than the low spin state.

Figure 15. Refer to the following caption and surrounding text. — **Figure 15.** QITE energy and fidelity as in figure 14 but for the (5, 5) AS of Fe(III)-NTA with three unpaired electrons.
Download figure:
Standard image High-resolution image

5.2.3. Oxygen and ozone

We now consider the three molecular forms of oxygen introduced in section 3.2. Figure 16 shows the short-term behavior of singlet oxygen O₂. Similar to the strongly correlated Fe(III)-NTA complex with one unpaired electron, trotterized ITE produces a state whose fidelity increases only at a very slow rate. The only stable QITE realization is exact QITE which includes up to ν^* = 4 orbitals to the support. In order to check if the ITE actually reaches the ground state, we simulate the long-time behavior of QITE and trotterized ITE in figure 25 until an evolution time τ = 10 and perform a simulation of the exact (un-trotterized) ITE up to τ = 100 in figure 26 to confirm that the ground state of O₂(0) is reached in the long-time limit of the ITE. The fidelity of the ROHF solution w.r.t. the ground state F_ROHF ≈ 46% is by far the lowest across all studied molecular instances, see table 3, and the multi-reference diagnostic also gives the highest value Z_s(1) = 0.2607 across all chemical systems, indicating the strong multi-reference character of singlet oxygen. It should be noted, that for this system, we also find that the first excited state of the AS Hamiltonian possesses a fidelity of approximately F = 45% w.r.t. the ROHF solution. Figure 17 shows the results for triplet oxygen, O₂ (2). Even the domain size at ν = 1 is able to improve the fidelity of the evolved state from initially F_ROHF ≈ 93% to F ≈ 98%. The overall mutual information of the ground states of the singlet oxygen (see figure 33) also shows that it is significantly larger than that of triplet oxygen (see figure 34), similar to the behavior of the Fe(III)-NTA systems. We want to stress that we are not implying that the two systems, Fe(III)-NTA and oxygen are in any way similar, only that the qualitative behavior of their (Q)ITE is similar.

Figure 16. Refer to the following caption and surrounding text. — **Figure 16.** QITE energy and fidelity of the (8, 6) AS of O₂ system with zero unpaired electrons.
Download figure:
Standard image High-resolution image

Figure 17. Refer to the following caption and surrounding text. — **Figure 17.** As in figure 16, but now considering two unpaired electrons.
Download figure:
Standard image High-resolution image

The last system we consider is singlet molecular ozone in a (12, 9) AS in figure 18. Its respective long time behavior and mutual information are shown in figures 28 and 4, respectively. The multi-reference diagnostic and ROHF fidelity are Z_s(1) = 0.186 and F_ROHF ≈ 81%, indicative of a strong multi-reference character of the ground state. This is similar to what has been observed for the smaller AS of the Fe(III)-NTA (1) system in section 5.2.2. In the O₃ system, the smaller step size Δτ = 0.01 simulations of QITE were systematically better performing than the more coarse grained size Δτ = 0.1 simulations, and even the smallest domains at ν = 1 initially improves the fidelity until τ ≈ 0.5. The fact that exact QITE, here at ν^* = 8, still displays a gap to the exact ITE is due to the first-order approximation in terms of the step size Δτ when setting up the linear system of equations in equation (22). This can be seen more clearly in figure 29, where we zoom in on the long-time behavior of ITE and exact QITE.

Figure 18. Refer to the following caption and surrounding text. — **Figure 18.** QITE energy and fidelity as in figure 16 but for the (12, 9) AS of O₃ with zero unpaired electrons.
Download figure:
Standard image High-resolution image

6. Summary and outlook

In this work, we perform numerical experiments on the algorithmic performance of the ITE and QITE algorithms [6] applied to spin- and fermionic-lattice, as well as the AS of molecular electronic structure Hamiltonians of varying degree of correlation. The approximations and simplifications we make ignore experimental limitations (e.g. we group Hamiltonian terms of identical domains and assume perfect measurement and state preparation), as well as one of the key steps of the QITE algorithm, namely the cost of decomposing the unitary operators ${\hat{U}}_{l}$ of equation (11) into elementary quantum operations. Therefore, our results correspond to a best-case-scenario study of the performance of QITE when applied to strongly correlated systems. For this reason, only a qualitative and no direct quantitative comparison between QITE and trotterized ITE is made.

We put a lot of emphasize on providing good classical initial states ∣Ψ_init〉 in order to remove biases in the ITE and QITE algorithm performances that can be traced back to a poor initial state choice and establish a classical baseline to compare the obtained results against. We provide a detailed discussion on how to generate a good initial state for generic spin-Hamiltonians based on GHF with FGS. To the best of our knowledge, this also presents the first application of GHF theory to the studied two-dimensional spin lattice systems, more precisely, to a triangular lattice geometry of an anti-ferromagnetic Heisenberg model (HM III) that can lead to frustration, and a hexagonal lattice geometry of an antiferromagnetic J₁J₂-model.

We apply ITE and QITE to one- and two-dimensional lattice spin systems displaying short- and long-range interactions. The one-dimensional short-range Heisenberg model is the only spin system that allows us to go beyond the smallest non-trivial expansion of ν = 1. Due to the exponential growth of the computational cost with the domain size—which grows linearly (quadratically) with ν for one-(two-) dimensional systems—, we could not extend our studies beyond ν = 1 for the long-range one-dimensional and all other two-dimensional systems in this study. Nevertheless, we find that even at the lowest order expansion of ν = 1, QITE can lead to dramatic improvements of the state fidelity for two-dimensional spin systems with nearest-neighbor and next-to-nearest-neighbor interactions, namely in our spin systems HM III and J₁J₂ in comparison to the GHF solution. The latter system can describe a simplified lattice model of CrI₃, which is of interest in the field of material science. The QITE results for the studied one-dimensional long-range transverse-field Ising models TFIM I and TFIM 2, and Heisenberg model HM II are inconclusive. For those three systems, QITE almost immediately deviates from ITE for ν = 1. Whether or not an increased domain ν≥2 will lead to a comparable improvement for these systems as we observed for the one-dimensional NN system HM I should be studied in future work.

We then move on and introduce a fermionic formulation of QITE—fermionic-QITE—which we apply to the one-dimensional Fermi-Hubbard model as a proof-of-principle system. For this model, ν describes the number of additional fermionic sites included in the domain. Here, the case ν = 2 corresponds to including the left- and right-hand side neighbors of a nearest-neighbor fermionic lattice site pair into the domain (comparable to a Manhattan distance 1), and leads to a stable QITE closely following the ITE. A natural question is whether such a behavior can also be seen for more complicated Fermi-Hubbard models, and especially two-dimensional formulations thereof. Another research question is whether applying the Verstraete-Cirac mapping [30] to a fermionic lattice Hamiltonian, and then solving the resulting spin Hamiltonian with the spin formulation of QITE would lead to similar success, which would circumvent the open problem of how to efficiently simulate fermionic-QITE on a quantum computer.

The numerical studies conclude with applying the fermionic formulation of QITE to the AS of the Ne atom, as well as two different spin states of the molecule Fe(III)-NTA, and three molecular forms of oxygen. The Fe(III)-NTA molecule is of particular interest to the chemical industry. Overall, three of the five chemical systems display a strong multi-reference character. The latter is analyzed in terms of the Z_s(1)-diagnostic, while the orbitals to be included are picked based on the mutual information. One of the central observations we make is that a large overall amount of mutual information paired with a large Z_s(1) value indicative of a strong multi-configurational character requires long ITE evolution times to achieve significant improvements in the evolved quantum state. While systems displaying a rather small multi-reference character (here the systems Ne (0), Fe(III)-NTA (3), O₂ (2)) allow for notable improvements in the state fidelity for small evolution times τ ∼ 0.2 − 2, the ITE of molecular systems with strong multi-reference character shows that long evolution times τ ∼ 10 − 100 are required to observe notable improvements in fidelity. However, the fermionic-QITE simulation become unstable well before these time scales are reached, except when all AS orbitals are included (i.e. at ν^*). While the truncation of the fermionic operator expansion of our QITE simulation leads to a low-order polynomial scaling (in the number of orbitals L of the AS) of the number of terms that have to be measured on the quantum device to determine $\hat{A}[j]$ in equation (20), the cost of decomposing the resulting unitary operator ${\hat{U}}_{j}={e}^{-i{\rm{\Delta }}\tau \hat{A}[j]}$ (which is mathematically equivalent to a unitary coupled cluster singles and doubles operator) into elementary quantum operations, as well as the required number of iterations steps τ/Δτ to reach a high fidelity state ∣Ψ〉 might prove prohibitive in practice.

Clearly, a handful of numerical experiments do not allow for general statements of the practicality of fermionic-QITE applied to quantum chemistry systems. In fact, a future research direction for fermionic-QITE applied to molecules could be the choice of molecular orbital basis in which the second quantized Hamiltonian is represented. As we discuss in section 2.4, the success of MPS paired with DMRG to describe strongly correlated molecular electronic structure Hamiltonians is partly due to finding a suitable basis where the second quantized Hamiltonian becomes quasi one-dimensional. In fermionic-QITE, we rotate the AS Hamiltonian into the molecular orbitals corresponding to its AS ROHF solution. It would be interesting to see if strategies similar to those which lead to a successful formulation of DMRG in quantum chemistry could lead to a stable evolution of fermionic-QITE for the small values of ν ≪ ν^*.

Another topic of future research is to consider how both spin- and fermionic-QITE perform when we lift one of our major approximations, by no longer condensing Hamiltonian terms of identical domains into one single expression $\hat{h}[j]$ . This would be a major step towards a more realistic assessment of the algorithmic performance of QITE when applied in an actual experiment. In the same direction, the inclusion of the error due to the approximation of the unitary operators ${\hat{U}}_{l}$ into basic quantum operations, as well as the impact of measurement errors in the central quantities b[j] and S[j] would be important. First steps regarding the latter and its close connection to the correlation length ${ \mathcal C }$ of the studied system have recently been made [129]. A future study regarding the one- and two-qubit gate costs for decomposing ${\hat{U}}_{l}$ for a system of given domain size could provide valuable insights into the practicability of QITE for large systems. Once the above points are included, a natural research question is how QITE compares against other approaches, for instance those algorithms that similar rely on the ITE principle [130, 131]. One of the problems of QITE is the potential ill-behaved matrix S which can in part be attributed to a non-orthogonal basis it is represented in. Recent work has shown that a modified version of QITE can overcome this problem [132].

Our spin-QITE algorithm uses an operator basis ${\hat{\sigma }}_{{\bf{I}}}$ which takes into account all possible ${4}^{| {D}_{j}| }$ Pauli-strings for a given domain D_j. It has been shown in [6] that for real-valued Hamiltonians and wave functions this scaling can be reduced by a small constant factor by only picking certain subsets of Pauli-strings, and—even though the exponential dependence on ∣D_j∣ remains—including such reduced operator expansions could allow to study larger domains ν > 1. Alongside this, a more efficient code base able to simulate the performance of QITE for long-range and higher-dimensional systems needs to be developed.

We conclude with a few final remarks. In all studied systems we were able to study the performance of QITE since the systems were small enough to be solved by means of exact diagonalization. The fidelity is the central quantity that tells us how close a quantum state is to the desired state, but in a real experiment or a large numerical simulation, the fidelity will not be accessible. Therefore, we use the energy of the state ∣Ψ〉 as a diagnostic for the performance of QITE, since we observe that an increase in energy in most cases leads to a worsening of the fidelity at roughly the same time τ (with the exception of the J₁J₂ system), which is in agreement with the behavior observed in literature [9]. Thus in principle, the energy expectation value of the Hamiltonian $\hat{H}$ w.r.t. the evolved state ∣Ψ〉 could serve as a diagnostic for knowing when QITE is starting to deviate from ITE. However, one of the core aspects of the QITE algorithm is that it in principle (ignoring the ν^*-case) only requires to measure expectation values of much simpler operators involving products of ${\hat{\sigma }}_{{\bf{I}}}$ and $\hat{h}[j]$ , and not of the entire Hamiltonian operator. One workaround would be to measure the energy expectation value sporadically every few cycles. However, it would probably be advantageous if one could find an alternative diagnostic which does not require the energy evaluation at all. We hope that this work encourages more studies on refined and improved implementations of the algorithm, especially with the promise it shows towards improving the overlaps of lattice systems.

Acknowledgments

The authors thank the material science working group of QUTAC for support. The authors thank Davide Vodola for providing helpful insights into MPS representation of quantum states and discussions about the manuscript. M. Ka. thanks Simon Balthasar Jäger for early discussions on a fermionic formulation of QITE and Caitlin Isobel Jones for comments on the draft.

Data availability statement

All data that support the findings of this study are included within the article (and any supplementary files).

Appendix A.: Generalized Hartree–Fock theory

In the following, we will describe the mean-field approach we use in order to generate suitable initial states for both, spin and fermionic systems. This mean-field algorithm can be efficiently computed on classical computers and its result can be translated into a fermionic Gaussian state which can be efficiently implemented on a quantum computer.

A.1. Fermionic Gaussian states

We define the column vector of Majorana operators $\hat{{\bf{a}}}={({\hat{a}}_{1},\ldots ,{\hat{a}}_{2L})}^{T}$ . Let G denote a real-valued and skew-symmetric (2L × 2L) matrix, which can be decomposed by means of a Schur decomposition into

$\begin{eqnarray}&&{\bf{G}}={{\bf{Q}}}^{T}\displaystyle \underset{p=1}{\overset{L}{\oplus }}\left(\begin{array}{c}0\,\,{\beta }_{p}\\ -{\beta }_{p}\,\,0\end{array}\right){\bf{Q}},\end{eqnarray} \tag{ A1 }$

where β_p ≠ 0 is real and Q ∈ O(2L). A general Fermionic Gaussian State (FGS) has the form

$\begin{eqnarray}&&\hat{\varrho }({\bf{G}})=\frac{{e}^{-\frac{i}{4}{\hat{{\bf{a}}}}^{T}{\bf{G}}\hat{{\bf{a}}}}}{{ \mathcal Z }},\end{eqnarray} \tag{ A2 }$

where ${ \mathcal Z }={2}^{L}{\prod }_{p=1}^{L}\cosh \left(\frac{{\beta }_{p}}{2}\right)$ is a normalization constant. FGS can be fully characterized by their respective covariance matrix Γ, which is a real-valued and skew-symmetric (2L × 2L) matrix whose elements are given by

$\begin{eqnarray}&&{{\rm{\Gamma }}}_{kl}=\frac{i}{2}\,\rm{tr}\,\left(\hat{\varrho }[{\hat{a}}_{k},{\hat{a}}_{l}]\right),\end{eqnarray} \tag{ A3 }$

where $[\hat{A},\hat{B}]=\hat{A}\hat{B}-\hat{B}\hat{A}$ denotes the commutator of two operators $\hat{A},\hat{B}$ . The covariance matrix Γ can be obtained directly from G in equation (A1) through

$\begin{eqnarray}&&{\boldsymbol{\Gamma }}={{\bf{Q}}}^{T}\displaystyle \underset{p=1}{\overset{L}{\oplus }}\left(\begin{array}{c}0\,\,{\lambda }_{p}\\ -{\lambda }_{p}\,\,0\end{array}\right){\bf{Q}},\end{eqnarray} \tag{ A4 }$

where

$\begin{eqnarray}&&{\lambda }_{p}=\tanh \left(\frac{{\beta }_{p}}{2}\right).\end{eqnarray} \tag{ A5 }$

In general, λ_p ∈ [−1,1]. If all λ_j ∈ { − 1, 1}, then ϱ describes a pure FGS, otherwise, $\hat{\varrho }$ describes a mixed FGS.

The expectation value of a single tensor product of Majorana or fermionic operators can be computed efficiently through Wick’s theorem [12, 127],

$\begin{eqnarray}&&\,\rm{tr}\,\left(\hat{\varrho }{\hat{a}}_{{i}_{1}}{\hat{a}}_{{i}_{2}}\cdots {\hat{a}}_{{i}_{2m}}\right)={(-i)}^{m}\,\rm{Pf}\,\left({\left.{\boldsymbol{\Gamma }}\right|}_{{i}_{1}{i}_{2}\ldots {i}_{2m}}\right),\end{eqnarray} \tag{ A6 }$

where i₁ ≠ i₂ ≠ … ≠ i_2m for i_k ∈ {1, …, 2L} and k = 1, …, 2L. The matrix ${\left.{\boldsymbol{\Gamma }}\right|}_{{i}_{1}{i}_{2}\ldots {i}_{2m}}$ denotes a (2m × 2m)-submatrix of Γ with the corresponding rows and columns i₁, i₂, …, i_2m, and Pf(A) denotes the Pfaffian of a skew-symmetric matrix A. We will be making extensive use of equation (A6).

A.2. Minimizing the energy of a fermionic Gaussian state

The aim of this section is to find the covariance matrix of a FGS which possesses the lowest energy expectation value w.r.t. the Hamiltonian of interest within the family of FGS. As shown in e.g. [18, 31, 91], this can be realized by performing an ITE of $\hat{\varrho }$ under the assumption that Wick’s theorem holds throughout the evolution, leading to the following equation of motion for the corresponding covariance matrix,

$\begin{eqnarray}&&\frac{d{\boldsymbol{\Gamma }}}{d\tau }=\frac{1}{2}[{\boldsymbol{\Gamma }},[{\boldsymbol{\Gamma }},{\bf{F}}]],\end{eqnarray} \tag{ A7 }$

where $\tau \in {\mathbb{R}}$ denotes the imaginary time and F = F(Γ) is the mean-field matrix, which is defined as

$\begin{eqnarray}&&{\bf{F}}=4\frac{\partial E({\boldsymbol{\Gamma }})}{\partial {\boldsymbol{\Gamma }}}.\end{eqnarray} \tag{ A8 }$

Here,

$\begin{eqnarray}&&E({\boldsymbol{\Gamma }})=\,\rm{tr}\,\left(\hat{\varrho }\hat{H}\right)\end{eqnarray} \tag{ A9 }$

is the mean-field energy, which can be evaluated efficiently using equation (A6). The procedure outlined in this section is commonly referred to as Generalized Hartree–Fock (GHF) theory [126].

While the evolution through equation (A7) has many desirable properties, a heuristically much faster approach when one is interested in pure FGS solutions is to diagonalize iF(Γ) = UDU^† and then solve the following equation for the covariance matrix [15, 18],

$\begin{eqnarray}&&{\boldsymbol{\Gamma }}=i{\bf{U}}\,\rm{sgn}\,({\bf{D}}){{\bf{U}}}^{\dagger },\end{eqnarray} \tag{ A10 }$

where U is a unitary matrix, D is a real-valued diagonal matrix, and sgn() is the sign function. It is easy to verify that equation (A10) only allows for pure state solutions. This approach typically converges after a handful of iterations, whereas equation (A7) often requires significantly more iterations. Unlike in equation (A7), the Ansatz of equation (A10) does not conserve the parity of the FGS, i.e. it will result in a covariance matrix which is associated with the overall lowest energy expectation value, independent of the parity of the initial covariance matrix. It can be easily verified that pure FGS are characterized by Γ² = − 1 and that equation (A10) is a stationary solution of equation (A7). A very efficient and stable way of obtaining the FGS ground state approximate solution is to use equation (A10) as a warm-up phase and then use the obtained covariance matrix as an initial state for the ITE through equation (A7) which then runs until some convergence criteria is met.

In order to compute equation (A8), we will make extensive use of the following identity for integers p ≥ 2,

$\begin{eqnarray}&&\frac{\partial \,\rm{Pf}\,\left({\left.{\boldsymbol{\Gamma }}\right|}_{{i}_{1},\ldots ,{i}_{2p}}\right)}{\partial {\boldsymbol{\Gamma }}}=-\frac{1}{2}\,\rm{Pf}\,\left({\left.{\boldsymbol{\Gamma }}\right|}_{{i}_{1},\ldots ,{i}_{2p}}\right){\left({\left.{\boldsymbol{\Gamma }}\right|}_{{i}_{1},\ldots ,{i}_{2p}}\right)}^{-1}.\end{eqnarray} \tag{ A11 }$

Here, the matrix inverse of the submatrix is embedded in the (2L × 2L) space of the covariance matrix. By embedding the submatrix ${\left.{\boldsymbol{\Gamma }}\right|}_{{i}_{1},\ldots ,{i}_{2p}}$ in the (2N × 2N) space of the covariance matrix, we mean that all matrix entries where at least one of the row or column indices does not belong to i₁, …, i_2p are set to zero. Note, that in general ${\left({\left.{\boldsymbol{\Gamma }}\right|}_{{i}_{1},\ldots ,{i}_{2p}}\right)}^{-1}\ne -{\left.{\boldsymbol{\Gamma }}\right|}_{{i}_{1},\ldots ,{i}_{2p}}$ . This is opposed to what one finds for the full covariance matrix of a pure fermionic Gaussian state, where Γ⁻¹ = − Γ.

The case p = 1 in equation (A11) reduces to

$\begin{eqnarray}&&\frac{\partial \,\rm{Pf}\,({\boldsymbol{\Gamma }}{| }_{k,l})}{\partial {{\rm{\Gamma }}}_{mn}}=\frac{\partial {{\rm{\Gamma }}}_{kl}}{\partial {{\rm{\Gamma }}}_{mn}}=\frac{1}{2}\left({\delta }_{km}{\delta }_{ln}-{\delta }_{kn}{\delta }_{lm}\right).\end{eqnarray} \tag{ A12 }$

In the GHF method, we will often make use of equations (A11)–(A12).

A.3. Generating a pure fermionic Gaussian state from a covariance matrix

Pure FGS can either possess an even (Pf(Γ) = 1) or odd (Pf(Γ) = − 1) parity and their respective covariance matrices satisfies Γ² = − 1. The corresponding FGS can be generated by means of a real orthogonal matrix Q ∈ O(2L),

$\begin{eqnarray}&&| {{\rm{\Psi }}}_{\,\rm{FGS}\,}\rangle ={\hat{U}}_{{\bf{Q}}}| \,\rm{vac}\,\rangle ,\end{eqnarray} \tag{ A13 }$

where ${\hat{U}}_{{\bf{Q}}}$ is the unitary generator of the FGS, to be defined later. In what follows, we will decompose Q ∈ O(2L) into a matrix R ∈ SO(2L) (for the even parity case) and a reflection ${\hat{X}}_{n}$ and matrix $\bar{{\bf{R}}}\in \,\rm{SO}\,(2L)$ (for the odd parity case) and make extensive use that any ${{\bf{R}}}^{{\prime} }\in \,\rm{SO}\,(2L)$ can be written as ${{\bf{R}}}^{{\prime} }={e}^{{{\bf{A}}}^{{\prime} }}$ , where ${{\bf{A}}}^{{\prime} }$ is a real skew-symmetric matrix.

A.3.1. Even parity sector

The even case has been described e.g. in [31], and is included for completeness. For even parity sectors, we have

$\begin{eqnarray}&&| {{\rm{\Psi }}}_{\,\rm{FGS}\,}\rangle ={\hat{U}}_{{\bf{R}}}| \,\rm{vac}\,\rangle ={\hat{U}}_{{\bf{R}}}| {0}_{1},\ldots ,{0}_{L}\rangle \end{eqnarray} \tag{ A14 }$

where Q = R ∈ SO(2L), i.e. $\det ({\bf{R}})=1$ and

$\begin{eqnarray}&&{\hat{U}}_{{\bf{R}}}={e}^{\frac{1}{4}{\hat{{\bf{A}}}}^{T}\mathrm{log}({\bf{R}})\hat{{\bf{A}}}}.\end{eqnarray} \tag{ A15 }$

Here, $\mathrm{log}({\bf{R}})$ denotes the matrix logarithm of R and we introduced the p-q ordering of the covariance matrix (meaning it is ordered as $\hat{{\bf{A}}}={({\hat{a}}_{1},{\hat{a}}_{3},\ldots ,\,{\hat{a}}_{2L-1},{\hat{a}}_{2},{\hat{a}}_{4},\ldots ,\,{\hat{a}}_{2L})}^{T}$ , which we will indicate by a tilde over the covariance matrix, $\tilde{{\rm{\Gamma }}}$ ). Using the relation

$\begin{eqnarray}&&{\hat{U}}_{{\bf{R}}}^{\dagger }{\hat{A}}_{\mu }{\hat{U}}_{{\bf{R}}}=\displaystyle \sum _{\nu =1}^{2L}{R}_{\mu \nu }{\hat{A}}_{\nu },\end{eqnarray} \tag{ A16 }$

we can compute the following relation

$\begin{eqnarray}\begin{array}{rcl}{\tilde{{\rm{\Gamma }}}}_{\mu \nu } & = & \frac{i}{2}\langle {{\rm{\Psi }}}_{\,\rm{FGS}\,}| [{\hat{A}}_{\mu },{\hat{A}}_{\nu }]| {{\rm{\Psi }}}_{\,\rm{FGS}\,}\rangle \\ & = & \frac{i}{2}\langle \,\rm{vac}\,| {\hat{U}}_{{\bf{Q}}}^{\dagger }[{\hat{A}}_{\mu },{\hat{A}}_{\nu }]{\hat{U}}_{{\bf{Q}}}| \,\rm{vac}\,\rangle \\ & = & i\displaystyle \sum _{\lambda ,\sigma =1}^{2L}{Q}_{\mu \sigma }{Q}_{\nu \lambda }\langle \,\rm{vac}\,| {\hat{A}}_{\sigma }{\hat{A}}_{\lambda }| \,\rm{vac}\,\rangle .\end{array}\end{eqnarray} \tag{ A17 }$

We will now divide the possible values for σ, λ ∈ [2L] into four sectors and analyze them separately. Note, that ${\tilde{{\rm{\Gamma }}}}_{\sigma \sigma }=0$ and we therefore assume σ ≠ λ in the following.

1.
Sector σ, λ ∈ [1, …, L]: Here, due to the p-q-ordering, we have both ${\hat{A}}_{\sigma }$ and ${\hat{A}}_{\lambda }$ of the form $({\hat{c}}_{p}^{\dagger }+{\hat{c}}_{p})$ , where p ∈ [L]. We therefore have to evaluate terms such as
$\begin{eqnarray}\begin{array}{rcl}\langle \,\rm{vac}\,| ({\hat{c}}_{p}^{\dagger }+{\hat{c}}_{p})({\hat{c}}_{q}^{\dagger }+{\hat{c}}_{q})| \,\rm{vac}\,\rangle & = & \langle \,\rm{vac}\,| {\hat{c}}_{p}^{\dagger }{\hat{c}}_{q}| \,\rm{vac}\,\rangle \\ & & +\langle \,\rm{vac}\,| {\hat{c}}_{p}{\hat{c}}_{q}^{\dagger }| \,\rm{vac}\,\rangle \\ & = & 0.\end{array}\end{eqnarray} \tag{ A18 }$
since we assumed that σ ≠ λ which here translated to p ≠ q. Thus, $\langle \,\rm{vac}\,| {\hat{A}}_{\sigma }{\hat{A}}_{\lambda }| \,\rm{vac}\,\rangle =0$ for σ, λ ∈ [L].
2.
Sector σ, λ ∈ [L + 1, …, 2L]:Similar arguments apply and we see that $\langle \,\rm{vac}\,| {\hat{A}}_{\sigma }{\hat{A}}_{\lambda }| \,\rm{vac}\,\rangle =0$ for σ, λ ∈ [L + 1, …, 2L].
3.
Sector σ ∈ [1, …, L] and λ ∈ [L + 1, …, 2L]:Here we get terms of the form
$\begin{eqnarray}&&\langle \,\rm{vac}\,| ({\hat{c}}_{p}^{\dagger }+{\hat{c}}_{p})i({\hat{c}}_{q}^{\dagger }-{\hat{c}}_{q})| \,\rm{vac}\,\rangle =i{\delta }_{pq}.\end{eqnarray} \tag{ A19 }$
4.
Sector σ ∈ [L + 1, …, 2L] and λ ∈ [1, …, L]:Here we get terms of the form
$\begin{eqnarray}&&\langle \,\rm{vac}\,| i({\hat{c}}_{p}^{\dagger }-{\hat{c}}_{p})({\hat{c}}_{q}^{\dagger }+{\hat{c}}_{q})| \,\rm{vac}\,\rangle =-i{\delta }_{pq}.\end{eqnarray} \tag{ A20 }$

By defining the even-parity vacuum covariance matrix in the p-q-ordering,

$\begin{eqnarray}{\tilde{{\boldsymbol{\Upsilon }}}}^{(+1)}=\left(\begin{array}{cc}{{\bf{0}}}_{L} & {{\bf{1}}}_{L}\\ -{{\bf{1}}}_{L} & {{\bf{0}}}_{L}\end{array}\right),\end{eqnarray} \tag{ A21 }$

we can write equation (A17) as

$\begin{eqnarray}&&\tilde{{\boldsymbol{\Gamma }}}=-{\bf{Q}}{\tilde{{\boldsymbol{\Upsilon }}}}^{(+1)}{{\bf{Q}}}^{T}\end{eqnarray} \tag{ A22 }$

$\begin{eqnarray}&&=-{e}^{i{\boldsymbol{\xi }}}{\tilde{{\boldsymbol{\Upsilon }}}}^{(+1)}{e}^{-i{\boldsymbol{\xi }}},\end{eqnarray} \tag{ A23 }$

where ${\boldsymbol{\xi }}=-i\mathrm{log}({\bf{R}})$ is a purely imaginary and skew-symmetric matrix.

A.3.2. Odd parity sector

Let us now consider the case of odd parity. Here, Q ∈ O(2L) with $\det ({\bf{Q}})=-1$ . In this case we have

$\begin{eqnarray}\begin{array}{rcl}| {{\rm{\Psi }}}_{\,\rm{FGS}\,}\rangle & = & {\hat{U}}_{{\bf{Q}}}| \,\rm{vac}\,\rangle \\ & = & {\hat{U}}_{\bar{{\bf{R}}}}{X}_{L}| \,\rm{vac}\,\rangle \\ & = & {\hat{U}}_{\bar{{\bf{R}}}}| {1}_{L}\rangle ,\end{array}\end{eqnarray} \tag{ A24 }$

where we use the simplified notation ∣1_L〉 = ∣0₁, 0₂, …, 0_L−1, 1_L〉. Here, $\bar{{\bf{R}}}\in \,\rm{SO}\,(2L)$ , where

$\begin{eqnarray}&&{\bar{R}}_{\mu \nu }=\left\{\begin{array}{l}{Q}_{\mu \nu },\,\,\rm{if}\,\,\nu \ne 2L\\ -{Q}_{\mu \nu },\,\,\rm{if}\,\,\nu =2L.\end{array}\right.\end{eqnarray} \tag{ A25 }$

We know from equation (A16) that for special orthogonal matrices we have

$\begin{eqnarray}&&{\hat{U}}_{\bar{{\bf{R}}}}^{\dagger }{\hat{A}}_{\mu }{\hat{U}}_{\bar{{\bf{R}}}}=\displaystyle \sum _{\nu =1}^{2L}{\bar{R}}_{\mu \nu }{\hat{A}}_{\nu }.\end{eqnarray} \tag{ A26 }$

Similar to the even parity sector calculation, we now compute

$\begin{eqnarray}\begin{array}{rcl}{\tilde{{\rm{\Gamma }}}}_{\mu \nu } & = & \frac{i}{2}\langle {{\rm{\Psi }}}_{\,\rm{FGS}\,}| [{\hat{A}}_{\mu },{\hat{A}}_{\nu }]| {{\rm{\Psi }}}_{\,\rm{FGS}\,}\rangle \\ & = & \frac{i}{2}\langle {1}_{L}| {\hat{U}}_{\bar{{\bf{R}}}}^{\dagger }[{\hat{A}}_{\mu },{\hat{A}}_{\nu }]{\hat{U}}_{\bar{{\bf{R}}}}| {1}_{L}\rangle \\ & = & i\displaystyle \sum _{\lambda \ne \sigma }^{2L}{\bar{R}}_{\mu \sigma }{\bar{R}}_{\nu \lambda }\langle {1}_{L}| {\hat{A}}_{\sigma }{\hat{A}}_{\lambda }| {1}_{L}\rangle .\end{array}\end{eqnarray} \tag{ A27 }$

We again consider four different sectors for the values of σ, λ ∈ [2L]. Following the same approach as before, we find that by defining the (L × L)-diagonal matrix

$\begin{eqnarray}{\bar{{\bf{1}}}}_{L}=\left(\begin{array}{ccccc}1 & 0 & \cdots \, & 0 & 0\\ 0 & 1 & \cdots \, & 0 & 0\\ \vdots & \vdots & \ddots & \vdots & \vdots \\ 0 & 0 & \cdots \, & 1 & 0\\ 0 & 0 & \cdots \, & 0 & -1\end{array}\right)\end{eqnarray} \tag{ A28 }$

which is identical to the identity matrix except for the flipped sign in the last row, and defining the odd-parity vacuum covariance matrix in the p-q-ordering

$\begin{eqnarray}{\tilde{{\boldsymbol{\Upsilon }}}}^{(-1)}=\left(\begin{array}{cc}{{\bf{0}}}_{L} & {\bar{{\bf{1}}}}_{L}\\ -{\bar{{\bf{1}}}}_{L} & {{\bf{0}}}_{L}\end{array}\right)\end{eqnarray} \tag{ A29 }$

we can write

$\begin{eqnarray}\begin{array}{rcl}\tilde{{\boldsymbol{\Gamma }}} & = & -\bar{{\boldsymbol{R}}}{\tilde{{\boldsymbol{\Upsilon }}}}^{(-1)}{\bar{{\boldsymbol{R}}}}^{T}\\ & = & -{e}^{i\bar{{\boldsymbol{\xi }}}}{\tilde{{\boldsymbol{\Upsilon }}}}^{(-1)}{e}^{-i\bar{{\boldsymbol{\xi }}}},\end{array}\end{eqnarray} \tag{ A30 }$

where $\bar{{\boldsymbol{\xi }}}=-i\mathrm{log}(\bar{{\bf{R}}})$ is a purely imaginary and skew-symmetric matrix.

A.4. Generating a fermionic Gaussian state in `OpenFermion`

FGS can be efficiently implemented on a quantum circuit using ${ \mathcal O }({N}^{2})$ quantum gates and ${ \mathcal O }(N)$ circuit depth [16], and has been included in quantum software packages such as OpenFermion[39]. Let us consider a R ∈ SO(2L) which describes a fermionic Gaussian unitary transformation as in equation (A16),

$\begin{eqnarray}&&{\hat{U}}_{{\bf{R}}}={e}^{i\hat{{ \mathcal H }}({\bf{R}})},\end{eqnarray} \tag{ A31 }$

where

$\begin{eqnarray}&&\hat{{ \mathcal H }}({\bf{R}})=-\frac{i}{4}{\hat{{\bf{A}}}}^{T}\mathrm{log}({\bf{R}})\hat{{\bf{A}}}.\end{eqnarray} \tag{ A32 }$

A general quadratic fermionic Hamiltonian can be written as

$\begin{eqnarray}&&{{ \mathcal H }}_{q}=\displaystyle \sum _{j,k=1}^{L}{M}_{jk}{\hat{c}}_{j}^{\dagger }{\hat{c}}_{k}+\frac{1}{2}\displaystyle \sum _{j,k=1}^{L}\left({{\rm{\Delta }}}_{jk}{\hat{c}}_{j}^{\dagger }{\hat{c}}_{k}^{\dagger }+\,\rm{H.c.}\,\right),\end{eqnarray} \tag{ A33 }$

where M = M^† and Δ^T = − Δ are complex matrices. We define

$\begin{eqnarray}&&\hat{{\bf{C}}}={({\hat{c}}_{1},\ldots ,\,{\hat{c}}_{L},{\hat{c}}_{1}^{\dagger },\ldots ,\,{\hat{c}}_{L}^{\dagger })}^{T},\end{eqnarray} \tag{ A34 }$

and the transformation matrix

$\begin{eqnarray}{{\bf{W}}}_{m}=\left(\begin{array}{cc}{{\bf{1}}}_{L} & {{\bf{1}}}_{L}\\ -i{{\bf{1}}}_{L} & i{{\bf{1}}}_{L}\end{array}\right),\end{eqnarray} \tag{ A35 }$

which transforms between the creation and annihilation operators and Majorana operators through $\hat{{\bf{A}}}={{\bf{W}}}_{m}\hat{{\bf{C}}}$ . Then, the quadratic Hamiltonian in equation (A33) can be written as

$\begin{eqnarray}{{ \mathcal H }}_{q}=\frac{1}{2}{\hat{{\bf{C}}}}^{T}\left(\begin{array}{cc}-{{\boldsymbol{\Delta }}}^{* } & -{{\bf{M}}}^{* }\\ {\bf{M}} & {\boldsymbol{\Delta }}\end{array}\right)\hat{{\bf{C}}}.\end{eqnarray} \tag{ A36 }$

therefore we can obtain M and Δ from R through

$\begin{eqnarray}\left(\begin{array}{cc}-{{\boldsymbol{\Delta }}}^{* } & -{{\bf{M}}}^{* }\\ {\bf{M}} & {\boldsymbol{\Delta }}\end{array}\right)=-\frac{i}{2}{{\bf{W}}}_{m}^{T}\mathrm{log}({\bf{R}}){{\bf{W}}}_{m}.\end{eqnarray} \tag{ A37 }$

The function that generates a fermionic Gaussian state in OpenFermion (called gaussian_state_preparation_circuit) uses a slightly different operator convention for the quadratic operator in equation (A36), namely it uses ${\hat{{\bf{C}}}}^{{\prime} }={({\hat{c}}_{1}^{\dagger },\ldots ,\,{\hat{c}}_{L}^{\dagger },{\hat{c}}_{1},\ldots ,\,{\hat{c}}_{L})}^{T}$ instead of equation (A34), which leads to

$\begin{eqnarray}{{ \mathcal H }}_{q}=\frac{1}{2}{\hat{{\bf{C}}}{}^{{\prime} }}^{T}\left(\begin{array}{cc}{{\boldsymbol{\Delta }}}^{{\prime} } & {{\bf{M}}}^{{\prime} }\\ -{{{\bf{M}}}^{{\prime} }}^{* } & -{{{\boldsymbol{\Delta }}}^{{\prime} }}^{* }\end{array}\right){\hat{{\bf{C}}}}^{{\prime} }.\end{eqnarray} \tag{ A38 }$

By identifying ${{\bf{M}}}^{{\prime} }=-{{\bf{M}}}^{* }$ and ${{\boldsymbol{\Delta }}}^{{\prime} }=-{{\boldsymbol{\Delta }}}^{* }$ we can generate the fermionic Gaussian unitary in equation (A31) in OpenFermion by setting equation (A32) identical to equation (A33), ${ \mathcal H }({\bf{R}})={{ \mathcal H }}_{q}$ , and using the matrices ${{\bf{M}}}^{{\prime} }$ and ${{\boldsymbol{\Delta }}}^{{\prime} }$ as input for gaussian_state_preparation_circuit.

We have discussed above the case of a fermionic Gaussian unitary generated via a R ∈ SO(2L) following section A.3.1. For a general Q ∈ O(2L), a similar result follows from section A.3.2 for matrices with $\det ({\bf{Q}})=-1$ . Thus, a general pure FGS is generated via the fermionic Gaussian unitary through

$\begin{eqnarray}&&| {{\rm{\Psi }}}_{\,\rm{FGS}\,}\rangle =\left\{\begin{array}{l}{\hat{U}}_{{\bf{Q}}}| \,\rm{vac}\,\rangle ={\hat{U}}_{{\bf{R}}}| \,\rm{vac}\,\rangle ={e}^{i{ \mathcal H }({\bf{R}})}| \,\rm{vac}\,\rangle ,\quad \\ \,\,\rm{if}\,\,\det ({\bf{Q}})=1,\quad \\ {\hat{U}}_{{\bf{Q}}}| \,\rm{vac}\,\rangle ={\hat{U}}_{\bar{{\bf{R}}}}| {1}_{L}\rangle ={e}^{i{ \mathcal H }(\bar{{\bf{R}}})}| {1}_{L}\rangle ,\quad \\ \,\,\rm{if}\,\,\det ({\bf{Q}})=-1.\quad \end{array}\right.\end{eqnarray} \tag{ A39 }$

A.5. Energy expectation values and mean-field terms

From the considerations of Appendix A.2 it is clear that in order to apply GHF theory to approximate the ground state of a (spin or fermionic) Hamiltonian of interest with a FGS, one has to compute the energy expectation value, equation (A9), and mean-field matrix, equation (A8), using Wick’s theorem. In the following, we will provide explicit formulas for these expressions for two of the systems studied in this work, the Heisenberg- and transverse-field Ising model.

A.5.1. Heisenberg model

By similar considerations as in [91], the energy expectation value of the Heisenberg Hamiltonian in equation (29) of a fermionic Gaussian state described by the covariance matrix Γ is given by

$\begin{eqnarray}\begin{array}{rcl}E({\boldsymbol{\Gamma }}) & = & \displaystyle \sum _{i\lt j}{J}_{ij}\left[{(-1)}^{j-i}\left\{\,\rm{Pf}\,\left({\left.{\boldsymbol{\Gamma }}\right|}_{2i,2i+1,\ldots ,\,2j-2,2j-1}\right)\right.\right.\\ & & \left.-\,\rm{Pf}\,\left({\left.{\boldsymbol{\Gamma }}\right|}_{2i-1,2i+1,2i+2,\ldots ,2j-3,2j-2,2j}\right)\right\}\\ & & \left.+\,\rm{Pf}\,\left({\left.{\boldsymbol{\Gamma }}\right|}_{2i-1,2i,2j-1,2j}\right)\right]\\ & & -\frac{B}{2}\displaystyle \sum _{i}\left({{\rm{\Gamma }}}_{2i-1,2i}-{{\rm{\Gamma }}}_{2i,2i-1}\right).\end{array}\end{eqnarray} \tag{ A40 }$

We can compute the mean-field matrix ${\bf{F}}=4\frac{\partial E({\boldsymbol{\Gamma }})}{\partial {\boldsymbol{\Gamma }}}$ of equation (A40) using a Pfaffian gradient identity from equation (A11).

A.5.2. Transverse-field Ising model

The energy expectation value of the transverse field Ising model Hamiltonian in equation (31) of a fermionic Gaussian state is given by [91]

$\begin{eqnarray}\begin{array}{rcl}E({\boldsymbol{\Gamma }}) & = & \displaystyle \sum _{i\lt j}{J}_{ij}{(-1)}^{j-i}\,\rm{Pf}\,\left({\left.{\boldsymbol{\Gamma }}\right|}_{2i,2i+1,\ldots ,\,2j-2,2j-1}\right)\\ & & -\frac{B}{2}\displaystyle \sum _{i}\left({{\rm{\Gamma }}}_{2i-1,2i}-{{\rm{\Gamma }}}_{2i,2i-1}\right).\end{array}\end{eqnarray} \tag{ A41 }$

The mean-field matrix F can be derived from equation (A11).

Appendix B.: Derivation of the linear system of equations for QITE

In this appendix, we derive the linear system of equations for spin-QITE. The parameter values that determine the linear system of equations in both, spin- and fermionic-QITE, have to be determined from measurements on the quantum system and then solved by means of a classical algorithm.

We first derive the linear system of equations for operators $\hat{A}[l]$ in equation (11) whenever $\hat{h}[j]$ describes a term from a spin Hamiltonian in equation (1). The derivation will be based on the observation that the normalized state

$\begin{eqnarray}&&| {\bar{{\rm{\Psi }}}}_{l}^{{\prime} }\rangle =\frac{1}{\sqrt{c[l]}}{e}^{-{\rm{\Delta }}\tau \hat{h}[l]}| {{\rm{\Psi }}}_{l-1}\rangle ,\end{eqnarray} \tag{ B1 }$

where c[l] is the normalization constant defined in equation (B3), should be close to its unitary approximation

$\begin{eqnarray}&&| {{\rm{\Phi }}}_{l}\rangle ={e}^{-i{\rm{\Delta }}\tau \hat{A}[l]}| {{\rm{\Psi }}}_{l-1}\rangle \end{eqnarray} \tag{ B2 }$

to first order Δτ. Here, closeness refers to the state vector norm, defined as $\parallel | \varphi \rangle \parallel =\sqrt{\langle \varphi | \varphi \rangle }$ for some quantum state ∣φ〉. To first order Δτ and dropping the lower index "l" for simplicity,

$\begin{eqnarray}&&c[l]=1-2{\rm{\Delta }}\tau \langle {\rm{\Psi }}| \hat{h}[l]| {\rm{\Psi }}\rangle +{ \mathcal O }({\rm{\Delta }}{\tau }^{2}),\end{eqnarray} \tag{ B3 }$

therefore

$\begin{eqnarray}&&\frac{1}{\sqrt{c[l]}}=1+{\rm{\Delta }}\tau \langle {\rm{\Psi }}| \hat{h}[l]| {\rm{\Psi }}\rangle +{ \mathcal O }({\rm{\Delta }}{\tau }^{2}),\end{eqnarray} \tag{ B4 }$

which leads to

$\begin{eqnarray}&&| {\bar{{\rm{\Psi }}}}^{{\prime} }\rangle =\left({\bf{1}}+{\rm{\Delta }}\tau \langle {\rm{\Psi }}| \hat{h}[l]| {\rm{\Psi }}\rangle {\bf{1}}-{\rm{\Delta }}\tau \hat{h}[l]\right)| {\rm{\Psi }}\rangle +{ \mathcal O }({\rm{\Delta }}{\tau }^{2}).\end{eqnarray} \tag{ B5 }$

Then, to first order in Δτ, we have

$\begin{eqnarray}&&\begin{array}{l}\frac{1}{{\rm{\Delta }}\tau }\parallel | {\bar{{\rm{\Psi }}}}^{{\prime} }\rangle -{e}^{-i{\rm{\Delta }}\tau \hat{A}[l]}| {\rm{\Psi }}\rangle \parallel \\ \approx \parallel \left(\langle {\rm{\Psi }}| \hat{h}[l]| {\rm{\Psi }}\rangle {\bf{1}}-\hat{h}[l]+i\hat{A}[l]\right)| {\rm{\Psi }}\rangle \parallel .\end{array}\end{eqnarray} \tag{ B6 }$

We want to find the minimum of the above expression, therefore we set the derivative of the above expression w.r.t. $\hat{A}[l]$ identical to zero. Since $a{[l]}_{I}\in {\mathbb{R}}$ and all ${\hat{\sigma }}_{{\bf{I}}}$ are hermitian, the non-vanishing terms that need to be considered when minimizing equation (B6) w.r.t. $\hat{A}[l]$ are given by

$\begin{eqnarray}&&\frac{d}{d\hat{A}[l]}\parallel \left(\langle {\rm{\Psi }}| \hat{h}[l]| {\rm{\Psi }}\rangle {\bf{1}}-\hat{h}[l]+i\hat{A}[l]\right)| {\rm{\Psi }}\rangle \parallel =0.\end{eqnarray} \tag{ B7 }$

Since the operator $\hat{A}[l]$ is determined by the real-valued coefficients a[l]_I, we can reformulate the above condition for a minimum in terms of the real-valued coefficients a[l]_I,

$\begin{eqnarray}&&\begin{array}{l}\frac{d}{da{[l]}_{{\bf{K}}}}\langle {\rm{\Psi }}| \left(-i\hat{h}[l]\displaystyle \sum _{{\bf{I}}}a{[l]}_{{\bf{I}}}{\hat{\sigma }}_{{\bf{I}}}+i\displaystyle \sum _{{\bf{I}}}a{[l]}_{{\bf{I}}}{\hat{\sigma }}_{{\bf{I}}}\hat{h}[l]\right.\\ \left.+\displaystyle \sum _{{\bf{I}},{\bf{J}}}a{[l]}_{{\bf{I}}}a{[l]}_{{\bf{J}}}{\hat{\sigma }}_{{\bf{I}}}{\hat{\sigma }}_{{\bf{J}}}\right)| {\rm{\Psi }}\rangle =0,\end{array}\end{eqnarray} \tag{ B8 }$

where we neglected terms that are independent of a[l]_K. Carrying out the derivative leads to

$\begin{eqnarray}&&\displaystyle \sum _{{\bf{I}}}a{[l]}_{{\bf{I}}}\langle {{\rm{\Psi }}}_{l-1}| \{{\hat{\sigma }}_{{\bf{I}}},{\hat{\sigma }}_{{\bf{K}}}\}| {{\rm{\Psi }}}_{l-1}\rangle =-i\langle {{\rm{\Psi }}}_{l-1}| [{\hat{\sigma }}_{{\bf{K}}},\hat{h}[l]]| {{\rm{\Psi }}}_{l-1}\rangle ,\end{eqnarray} \tag{ B9 }$

which leads to equation (16).

Appendix C.: Connection between mutual information and single orbital entropy

The mutual information in equation (28) between two different orbitals i, j ∈ [L] can be written as [51, 53]

$\begin{eqnarray}&&I(i,j)=-\frac{1}{2}\left({s}_{ij}(2)-{s}_{i}(1)-{s}_{j}(1)\right)(1-{\delta }_{ij}),\end{eqnarray} \tag{ C1 }$

where δ_ij is the Kronecker delta and we introduced the two-orbital entropy

$\begin{eqnarray}&&{s}_{ij}(2)=-\displaystyle \sum _{\kappa =1}^{16}{\omega }_{\kappa ,i,j}\mathrm{ln}({\omega }_{\kappa ,i,j}),\end{eqnarray} \tag{ C2 }$

where ω_κ,i,j are the eigenvalues of the two-orbital reduced density matrix of orbitals i, j, and κ goes over all 16 possible spin configurations two orbitals can have.

Appendix D.: Long-time behavior lattice systems

Figure 19 displays the long-time behavior of figure 7. The energy quickly increases to a very large value before displaying a random oscillatory behavior. The fidelity quickly deteriorates and falls off to zero around the same time the oscillations in energy are observed.

Figure 19. Refer to the following caption and surrounding text. — **Figure 19.** Long-time behavior of figure 7 for QITE energy and fidelity of HM II system.
Download figure:
Standard image High-resolution image

Figure 20. Refer to the following caption and surrounding text. — **Figure 20.** Long-time behavior of figure 9 for QITE energy and fidelity of TFIM I system.
Download figure:
Standard image High-resolution image

Figure 21. Refer to the following caption and surrounding text. — **Figure 21.** Long-time behavior of figure 12 for QITE energy and fidelity of FHM system.
Download figure:
Standard image High-resolution image

Figure 20 shows the long-term behavior of system TFIM I, which corresponds to a one-dimensional long-range TFIM with periodic boundary conditions. Similar to figure 19, a quick fall of the fidelity can be observed, with random oscillations in the energy well above the ground state energy. Figure 21 shows the long-time behavior of the FHM system. While the smallest ν = 1 is diverging from the trotterized ITE quickly, already ν = 2 (which here corresponds to a Manhattan distance of 1 in figure 2(b), since both, the left- and right-hand side neighboring sites of the two NN sites of the support are included) shows a stable behavior - in fact just ast stable as ν = 3. Interestingly, the largest domain we consider, ν = 4, does not display the most stable behavior and starts deviating more strongly than the smaller domain sizes as τ increases.

Appendix E.: Long-time behavior molecular electronic structure systems

Figure 22 shows the long-term behavior of the Ne (0) system of table 1. The only stable fermionic-QITE realisation is given by ν^* = 7 which describes the case where the domain size always covers the entire active space. One can see that both, the coarse grained Δτ = 0.1 and the smaller time step Δτ = 0.01 both converge to the trotterized ITE, with the former trailing the latter.

Figure 22. Refer to the following caption and surrounding text. — **Figure 22.** Long-time behavior of figure 13 for QITE energy and fidelity of the Ne (0) system.
Download figure:
Standard image High-resolution image

Figures 23 and 24 show the long-term behavior of Fe(III)-NTA for the one and three unpaired electrons, respectively. Note, that in figure 23 at the final time τ = 10, the ITE fidelity is still only at roughly 96% but the ITE energy has reached chemical accuracy as it is within 0.5 mHa from the ground state energy, see table 3.

Figure 23. Refer to the following caption and surrounding text. — **Figure 23.** Long-time behavior of figure 23 for QITE energy and fidelity of Fe(III)-NTA (1) system.
Download figure:
Standard image High-resolution image

Figure 24. Refer to the following caption and surrounding text. — **Figure 24.** QITE energy and fidelity as in figure 16 but for the (5, 5) AS of Fe(III)-NTA (3) system.
Download figure:
Standard image High-resolution image

Figure 25 shows the long-time behavior of O₂ (0). Even more pronounced than Fe(III)-NTA (1) is the relatively slow increase in fidelity for QITE at ν^* = 5 and ITE. In order to ensure that the ground state is reached in the long-term limit τ = 100, we perform a simulation of the exact ITE, i.e. $| {\rm{\Psi }}\rangle \propto {e}^{-\tau \hat{H}}| {{\rm{\Psi }}}_{\,\rm{init}\,}\rangle$ in figure 26. We see that the trotterized ITE and exact ITE evolutions coincide (we stopped the simulation of trotterized ITE at τ = 10) and that indeed the ground state of O₂ (0) is reached eventually.

Figure 25. Refer to the following caption and surrounding text. — **Figure 25.** Long-time behavior of figure 17 for QITE energy and fidelity of O₂ system with zero unpaired electrons.
Download figure:
Standard image High-resolution image

Figure 26. Refer to the following caption and surrounding text. — **Figure 26.** Comparison of trotterized ITE with step size Δτ = 0.1 and exact ITE for O₂ (0).
Download figure:
Standard image High-resolution image

Figure 27 shows the long-time behavior of O₂ (2). In comparison to O₂ (0) in figure 25, one can observe that the ITE and QITE evolution at ν^* = 5 quickly converge to the ground state.

Figure 27. Refer to the following caption and surrounding text. — **Figure 27.** Long-time behavior of figure 17 for QITE energy and fidelity of O₂ with two unpaired electrons.
Download figure:
Standard image High-resolution image

Figure 28 displays the long-time behavior of O₃ (0) until τ = 10. In order to show the difference between QITE and trotterized ITE still present even at exact QITE, we show the behavior of QITE at ν^* = 8 for two discretization step sizes Δτ = 0.1 and Δτ = 0.01, as well as the results of trotterized ITE with step size Δτ = 0.1 in figure 29. Note, that we only display the results from time τ ∈ [4, 10]. One can see the effect of the error due to the expansion error made when deriving equation (22), by comparing QITE and ITE with the same step size Δτ = 0.1, and how this error can be reduced by moving to a smaller step size Δτ = 0.01.

Figure 28. Refer to the following caption and surrounding text. — **Figure 28.** Long-time behavior of figure 18 for QITE energy and fidelity of O₃ with zero unpaired electrons.
Download figure:
Standard image High-resolution image

Figure 29. Refer to the following caption and surrounding text. — **Figure 29.** Difference between trotterized ITE and exact QITE (i.e. ν^*) at finite discretization step sizes Δτ of figure 18 for O₃ with zero unpaired electrons.
Download figure:
Standard image High-resolution image

Appendix F.: Molecular structures

The molecular structures of the studied systems in section 5.2 are presented in tables 5, 6, 7, 8, 9, respectively.

Table 5. Cartesian coordinates of Fe(III)-NTA (1) in Å taken from [103] (optimized at the density functional theory level).

C	−1.0290441	−0.9471117	−1.9865887
C	−2.2505436	−0.7922495	0.1582250
C	−0.2593185	−2.2544373	0.0162178
O	−1.3454085	0.8783278	−3.5342318
O	−1.1347371	1.1483476	1.0151618
O	0.8453748	−2.7999130	2.0922098
O	−0.2780587	1.3040005	−1.6070564
O	−3.2498211	0.8000765	1.6534429
C	−0.9188318	0.5115628	−2.4417824
C	−2.2565356	0.4618854	1.0159078
C	0.5342286	−1.9274094	1.2848894
N	−0.9221422	−1.0235290	−0.5013883
Fe	0.2526900	0.4497192	0.0039380
O	1.8221685	−0.2815654	−1.0936915
H	−1.9555949	−1.4065777	−2.3473606
H	−0.1824638	−1.4867161	−2.4264081
H	−2.5130292	−1.6526686	0.7833244
H	−3.0320380	−0.6959329	−0.6040366
H	0.4617231	−2.6044401	−0.7316186
H	−0.9840969	−3.0551564	0.1969690
H	2.1227503	0.3853924	−1.7373941
H	2.5891208	−0.4447721	−0.5150575
O	0.9052095	−0.6681693	1.3941789
O	1.4573655	1.9796179	0.5273073
H	1.0868407	2.8424014	0.2685815
H	1.5840897	2.0352695	1.4913642

Table 6. Cartesian coordinates of Fe(III)-NTA (3) in Å taken from [103] (optimized at the density functional theory level).

C	−1.0631696	−0.8986273	−1.9564065
C	−2.2726265	−0.8715147	0.2176117
C	−0.2213187	−2.2369457	0.0042160
O	−1.3060002	0.9464995	−3.4925138
O	−1.4204678	1.2219942	1.0540575
O	0.8952750	−2.7897046	2.0745880
O	−0.2503233	1.3405105	−1.5583148
O	−3.5805554	0.7176234	1.4194742
C	−0.9043439	0.5588651	−2.4011021
C	−2.4669757	0.4632095	0.9502270
C	0.5498120	−1.9156119	1.2884213
N	−0.9483701	−1.0225278	−0.4731344
Fe	0.2137470	0.5250676	0.0955542
O	1.9886683	−0.4000745	−1.1969343
H	−2.0097221	−1.3160535	−2.3146150
H	−0.2431632	−1.4604326	−2.4182724
H	−2.3697581	−1.6649618	0.9671875
H	−3.0803148	−1.0078491	−0.5091128
H	0.5200916	−2.5218690	−0.7507292
H	−0.9090669	−3.0759629	0.1513051
H	2.2755494	0.2246208	−1.8844300
H	2.7845954	−0.5673481	−0.6640381
O	0.8682823	−0.6398588	1.4466564
O	1.4404474	2.0311775	0.6137125
H	1.3158602	2.8791116	0.1523929
H	1.5437445	2.2366153	1.5593020

Table 7. Cartesian coordinates of O₂ (0) in Å taken from [115] (experimental).

O	0.0	0.0	0.607800
O	0.0	0.0	−0.607800

Table 8. Cartesian coordinates of O₂ (2) in Å taken from [115] (experimental).

O	0.0	0.0	0.603760
O	0.0	0.0	−0.603760

Table 9. Cartesian coordinates of O₃ (0) in Å taken from [116] (experimental).

O	0.0	0.0
O	0.0	1.2717000
O	1.1383850	1.8385340

Appendix G.: Mutual information for the studied molecular electronic structure Hamiltonians

Figures 30, 31, 32, 33, 34 display the mutual information of the MPS representation of the ground state of the AS Hamiltonian describing the various systems of table 1.

Figure 30. Refer to the following caption and surrounding text. — **Figure 30.** Mutual information I(i, j) of Ne (0).
Download figure:
Standard image High-resolution image

Figure 31. Refer to the following caption and surrounding text. — **Figure 31.** Mutual information I(i, j) of Fe(III)-NTA (1).
Download figure:
Standard image High-resolution image

Figure 32. Refer to the following caption and surrounding text. — **Figure 32.** Mutual information I(i, j) of Fe(III)-NTA (3).
Download figure:
Standard image High-resolution image

Figure 33. Refer to the following caption and surrounding text. — **Figure 33.** Mutual information I(i, j) of O₂ (0).
Download figure:
Standard image High-resolution image

Figure 34. Refer to the following caption and surrounding text. — **Figure 34.** Mutual information I(i, j) of O₂ (2).
Download figure:
Standard image High-resolution image

Figure 32 displays the mutual information of the MPS representation of the ground state of the (5, 5) AS Hamiltonian describing system Fe(III)-NTA (3) system in table 1. The amount of mutual information in figure 31 is significantly larger than that in figure 32. This is accompanied by a stark contrast in the Z_s(1)-diagnostic (Z_s(1) = 0.1839 and Z_s(1) = 0.0459) and the fidelities of the respective ROHF states (F_ROHF = 0.89 and F_ROHF = 0.98) 4. Figures 33 and 34 display the mutual information of the MPS representation of the ground states of the (8, 6) AS Hamiltonians describing the singlet and triplet oxygen systems described in table 1. One can again observe a stark contrast in the total amount of mutual information when comparing the system whose ITE requires a long time evolution, O₂ (0) with the system that converges rather quickly to the ground state, O₂ (2), accompanied with a factor two difference in the diagnostics Z_s(1) = 0.2607 and Z_s(1) = 0.1285, respectively.

Appendix H.: Variational QITE

In this appendix we try to make a heuristic argument as to why we believe that QITE will not be an algorithm that can run on noisy quantum computers (QC), but will require some level of error correction. Our strategy is to show that a related algorithm which requires significantly less circuit depth, but essentially requires similar type of function evaluations on a QC, already struggles at beating the results provided by a simple classical mean-field theory (namely, GHF). In order to show this, we use the variational QITE (vQITE) algorithm[130], which is implemented in Qiskit [133] and thus allows for the use of Qiskit’s noise and error mitigation simulation tools.

The vQITE algorithm is an algorithm designed to approximate the ground state of a quantum many-body Hamiltonian. Unlike QITE, in its basic formulation it constructs an Ansatz for the ground state wave function that can be described by a sequence of unitary operators ${\hat{U}}_{j}({\theta }_{j})$ of fixed length K, $| {\rm{\Psi }}({\boldsymbol{\theta }})\rangle ={\prod }_{j=1}^{K}{\hat{U}}_{j}({\theta }_{j})| {{\rm{\Psi }}}_{\,\rm{init}\,}\rangle$ , where ${\boldsymbol{\theta }}={({\theta }_{1},\ldots ,{\theta }_{K})}^{T}$ . This is similar to the variational quantum eigensolver [23], with the exception that the parameters θ need not be optimized classically, but are instead chosen based on the evaluation of the Fubiny-Study metric tensor on a QC. This is equivalent to an evolution of the state vector where the parameters θ are evolved following an imaginary time evolution of the state Ansatz ∣Ψ(θ)〉 [134].

In principle, vQITE could serve as a candidate to compare the performance of QITE against. By design, unlike QITE whose quantum circuit depth grows with the time τ, the gate depth of vQITE is fixed by the Ansatz used for generating ∣Ψ(θ)〉. Still, it also requires a similar discretization of imaginary time τ into small step sizes Δτ, and at each time step a complete evaluation of the Fubiny-Study metric tensor in order to perform the update θ_j(τ) → θ_j(τ + Δτ). While it is important to compare the performance of quantum algorithms against each other, the amount of approximations and assumptions we make in order to speed up the numerical calculations would make for an extremely biased and unfair comparison of the two fundamentally different approaches QITE and vQITE. Thus, instead of trying to compare these two algorithms directly, we tried to get a feeling of how much effort would be realistically needed for a fixed circuit depth Ansatz as vQITE to be able to obtain energies which lie below those generated by a simple classical method such as GHF introduced in Appendix 4.3.1. Due to the fact that QITE will result in much deeper circuits than vQITE in practice, this will provide a rough idea of why we believe that QITE might only be executable for large system sizes on an error-corrected QC. Since this analysis is completely decoupled from the rest of the paper, we have only included it in the Appendix.

For this purpose, we study here the vQITE method with simulated hardware noise, including error mitigation (EM) for a variational Hamiltonian Ansatz (HVA) [135]. The circuit of HVA is made up of products of exponentials of the terms ${\hat{H}}_{j}$ , that appear in the system Hamiltonian $\hat{H}={\sum }_{j}{\hat{H}}_{j}$ , multiplied by variational parameters θ_j(τ), applied to some initial state. We trotterize the exponentials using one Trotter step. To the best of our knowledge, the vQITE method has so far only been studied on noisy hardware using hardware-efficient ansätze, e.g. in [136, 137]. Since the HVA is often used as a physics-motivated Ansatz in many variational algorithms [138, 139], a numerical analysis of its use in vQITE with simulated noise, may therefore benefit future studies on the topic. Of particular interest in this appendix, is to investigate the amount of necessary shots and required hardware fidelity, in order for the vQITE algorithm, using HVA, to drop below the energy of the GHF solution for a simple model.

We consider a simple 1D XXZ spin model with periodic boundary conditions (PBC):

$\begin{eqnarray*}&&{\hat{H}}_{\rm{XXZ}\,}^{\,\rm{PBC}}=-J\displaystyle \sum _{k=1}^{L=4}\left({\hat{\sigma }}_{k}^{x}{\hat{\sigma }}_{k+1}^{x}+{\hat{\sigma }}_{k}^{y}{\hat{\sigma }}_{k+1}^{y}+{\rm{\Delta }}\left[{\hat{\sigma }}_{k}^{z}{\hat{\sigma }}_{k+1}^{z}-{\bf{1}}\right]\right)\end{eqnarray*}$

with J = 1, Δ = − 0.2 on L = 4 sites and ${\hat{\sigma }}_{L+1}={\hat{\sigma }}_{1}$ due to PBC. For these parameter values, the system cannot be exactly solved with the GHF solution and therefore is an interesting test case for the vQITE method. The XXZ Hamiltonian is usually studied as a deformation that arises due to anisotropy in the z-direction of the Heisenberg model [140], that may be viewed as an effective model for generalized Hubbard models with a broken symmetry [141]. The Hamiltonian terms of the HVA are taken to be ${\hat{\sigma }}_{k}^{x}{\hat{\sigma }}_{k+1}^{x}+{\hat{\sigma }}_{k}^{y}{\hat{\sigma }}_{k+1}^{y}$ and ${\hat{\sigma }}_{k}^{z}{\hat{\sigma }}_{k+1}^{z}$ where k is even or odd. Following appendix B of [138], we use L/2 = 2 layers of the HVA, resulting in a total of 8 parameters for the vQITE evolution. The initial state is chosen to be the ground state of the terms ${\hat{\sigma }}_{k}^{x}{\hat{\sigma }}_{k+1}^{x}+{\hat{\sigma }}_{k}^{y}{\hat{\sigma }}_{k+1}^{y}+{\hat{\sigma }}_{k}^{z}{\hat{\sigma }}_{k+1}^{z}$ for even sites k, which is an easy to prepare Bell-state (cf figure 2 in [138] for the respective quantum circuit).

We execute the vQITE algorithm with a time step Δτ = 0.1 and include three types of noise: shot noise, readout-, and gate errors. The circuits for a vQITE step and energy evaluation are executed with M_evol = 8192 and M_H = 10⁶ shots respectively. For the readout error, we introduce a parameter p_0∣1, for the probability that a single qubit is falsely measured to be 0, instead of the correct value 1. The parameter for falsely measuring a 1, given the correct value 0, is fixed for simplicity to ${p}_{1| 0}=\frac{{p}_{0| 1}}{2}$ . The gate error is simulated as a depolarization channel with parameter p_depol and $\frac{{p}_{depol}}{100}$ for the two-qubit and single-qubit gates respectively. In order to mitigate the readout and gate errors we use Twirled Readout Error Extinction (TREX) [142] and Zero-Noise Extrapolation (ZNE) [143–145] respectively. For ZNE we perform a linear extrapolation with noise factors (1, 1.5, 2). To include gate errors, we transpile the 4-qubit circuit into a 1D chain of nearest-neighbour coupled qubits with native gates of IBM’s Falcon processors.

In figure 35 we show our numerical results including only shot and readout error, for two values of the readout error p_0∣1. The exact vQITE result (solid gray) converges to the exact ground state energy (dashed red), i.e. two layers of HVA with no noise suffices to reach the exact ground state. For a readout error p_0∣1 = 0.001, vQITE with EM (solid blue, square) improves upon the GHF solution (dotdashed violet). We also see that readout EM is crucial to achieve a good result. A readout error of p_0∣1 = 0.01, roughly the measurement errors of current hardware, has however too large a variation to distinguish the exact result from the GHF solution, while its result without TREX lies outside the chosen y-window. Figure 36 shows the numerical result with shot error, a fixed readout error of p_0∣1 = 0.01 and varying gate error p_depol, including TREX and ZNE. For a gate error p_depol = 0.001, vQITE (solid blue) slightly beats the GHF solution, though struggles to distinguish it from the exact result due to large variations. The results of p_depol = 0.01, 0.005 lie above and outside of the chosen y-interval. Therefore, a maximal hardware gate error of p_depol = 0.001 can be allowed to have some slight improvement on the GHF solution. At the same time however, the time evolutions with p_depol = 0.01, 0.001, including TREX and ZNE find variational parameters that improve upon the GHF solution, as seen from the dashed orange and dotdashed green plots ’exact H’. This observation, that a vQITE evolution including errors may still converge to variational parameters close to the global minima, was also seen in [137].

Figure 35. Refer to the following caption and surrounding text. — **Figure 35.** Energy evaluation of vQITE simulations with shot and varying readout error, for 20 time steps, including TREX EM. The error bars are the standard deviation of the energy using M_H shots. The exact ground state (dashed red) and GHF (dotdashed violet) energy equal −6.871 and −6.857 respectively. The exact vQITE plot (solid gray) is the noiseless vQITE result. The dashed blue, square vQITE plot contains no TREX EM.
Download figure:
Standard image High-resolution image

Figure 36. Refer to the following caption and surrounding text. — **Figure 36.** Same as in figure 35, but including a varying gate error p_depol and ZNE EM. The readout error is fixed to p_0∣1 = 0.01. For the dotdashed green and dashed orange plots, a noisy vQITE evolution is performed, but the final energy is evaluated *exactly* for the Ansatz circuit with the vQITE parameters found from the noisy evolution.
Download figure:
Standard image High-resolution image

We conclude that even for this simple case with only 4 qubits, vQITE with HVA struggles to beat the GHF solution. A slight improvement on GHF for this model may be expected, for hardware errors of p_0∣1, p_depol ∼ 0.001, which is however at the extreme limit of today’s best QC hardwares. Larger system sizes will further increase the noise, making vQITE with the HVA struggle disproportionately more. The QITE algorithm, having vastly deeper circuits, will therefore likely not be applicable to large system sizes without some level of error correction.

Dates

Peer review information

2.3.1. QITE workflow

2.3.2. QITE introduction

2.3.3. Spin-QITE

2.3.4. Fermionic-QITE

3.1.1. Heisenberg model

3.1.2. Transverse-field Ising model

3.1.3. Fermi-Hubbard model

4.3.1. Generalized Hartree–Fock theory

4.3.2. GHF applied to a generic spin Hamiltonian

5.1.1. Heisenberg model

5.1.2. Transverse-field Ising model

5.1.3. J₁J₂-model

5.1.4. Fermi-Hubbard model

5.2.1. Neon

5.2.2. Fe(III)-NTA

5.2.3. Oxygen and ozone

A.3.1. Even parity sector

A.3.2. Odd parity sector

A.5.1. Heisenberg model

A.5.2. Transverse-field Ising model

Generating approximate ground states of strongly correlated quantum many-body systems through quantum imaginary time evolution

Author notes

Author notes

Notes

Article metrics

Submit

Share this article

Dates

Peer review information

Abstract

1. Introduction

2. Methods

2.1. Hamiltonian terms

2.2. Trotterized ITE

2.3. The QITE algorithm

2.3.1. QITE workflow

2.3.2. QITE introduction

2.3.3. Spin-QITE

2.3.4. Fermionic-QITE

2.4. Mutual information for domain selection in molecular electronic structure Hamiltonians

3. Studied systems

3.1. Lattice Hamiltonians

3.1.1. Heisenberg model

3.1.2. Transverse-field Ising model

3.1.3. Fermi-Hubbard model

3.2. Molecular electronic structure Hamiltonians

4. Initial state preparation

4.1. Initial states for molecular electronic structure Hamiltonians

4.2. Initial states for fermionic lattice Hamiltonians

4.3. Initial states for spin lattice Hamiltonians

4.3.1. Generalized Hartree–Fock theory

4.3.2. GHF applied to a generic spin Hamiltonian

5. Results

5.1. Lattice systems

5.1.1. Heisenberg model

5.1.2. Transverse-field Ising model

5.1.3. J1J2-model

5.1.4. Fermi-Hubbard model

5.2. Molecular electronic structure systems

5.2.1. Neon

5.2.2. Fe(III)-NTA

5.2.3. Oxygen and ozone

6. Summary and outlook

Acknowledgments

Data availability statement

Appendix A.: Generalized Hartree–Fock theory

A.1. Fermionic Gaussian states

A.2. Minimizing the energy of a fermionic Gaussian state

A.3. Generating a pure fermionic Gaussian state from a covariance matrix

A.3.1. Even parity sector

A.3.2. Odd parity sector

A.4. Generating a fermionic Gaussian state in OpenFermion

A.5. Energy expectation values and mean-field terms

A.5.1. Heisenberg model

A.5.2. Transverse-field Ising model

Appendix B.: Derivation of the linear system of equations for QITE

Appendix C.: Connection between mutual information and single orbital entropy

Appendix D.: Long-time behavior lattice systems

Appendix E.: Long-time behavior molecular electronic structure systems

Appendix F.: Molecular structures

Appendix G.: Mutual information for the studied molecular electronic structure Hamiltonians

Appendix H.: Variational QITE

Footnotes

You may also like

5.1.3. J₁J₂-model

A.4. Generating a fermionic Gaussian state in `OpenFermion`