• Tidak ada hasil yang ditemukan

Classical Simulability of Level- 1 RQAOA for Ising Models

Dalam dokumen Quantum Information at High and Low Energies (Halaman 143-149)

OBSTACLES TO STATE PREPARATION AND VARIATIONAL OPTIMIZATION FROM SYMMETRY PROTECTION

C.4 Classical Simulability of Level- 1 RQAOA for Ising Models

Suppose 𝐽 is a real symmetric matrix of size 𝑛. Here we consider Ising-like cost functions such that the corresponding Hamiltonian is

𝐻 = βˆ‘οΈ

1≀𝑝 <π‘žβ‰€π‘›

𝐽𝑝,π‘žπ‘π‘π‘π‘ž .

The mean values of a Pauli operatorπ‘π‘π‘π‘žon the level-1 QAOA state

|Ψ𝐻(𝛽, 𝛾)i =𝑒𝑖 𝛽 𝐡𝑒𝑖 𝛾 𝐻|+𝑛i

can be computed in time 𝑂(𝑛) using an explicit analytic formula. Such a formula was derived for the Max-Cut cost function by Wang et al. [24, Theorem 1]. Below we provide a generalization to general Ising Hamiltonians. Since the total number of terms in the cost function is𝑂(𝑛2), simulating each step ofRQAOAtakes time at most𝑂(𝑛3). Assuming that𝑛𝑐=𝑂(1), the number of steps is roughly𝑛so that the full simulation cost is𝑂(𝑛4). Crucially, the simulation cost of this method does not depend on the depth of the variational circuit. This is important because RQAOA may potentially increase the depth from𝑂(1)to𝑂(𝑛)since it adds many new terms to the cost function.

Lemma 3.C.1. Fix a pair of qubits 1 ≀ 𝑒 < 𝑣 ≀ 𝑛. Let 𝑐 = cos(2𝛽) and

𝑠 =sin(2𝛽). Then

hΨ𝐻(𝛽,1) |𝑍𝑒𝑍𝑣|Ψ𝐻(𝛽,1)i = (𝑠2/2) Γ–

𝑝≠𝑒,𝑣

cos[2𝐽𝑒, π‘βˆ’2𝐽𝑣 , 𝑝] βˆ’ (𝑠2/2) Γ–

𝑝≠𝑒,𝑣

cos[2𝐽𝑒, 𝑝+2𝐽𝑣 , 𝑝] +𝑐 𝑠·sin(2𝐽𝑒,𝑣)

"

Γ–

𝑝≠𝑒,𝑣

cos(2𝐽𝑒, 𝑝) + Γ–

𝑝≠𝑒,𝑣

cos(2𝐽𝑣 , 𝑝)

# .

Here we only consider the case𝛾 =1 since𝛾can be absorbed into the definition of 𝐽.

Proof. Given a 2-qubit observable𝑂, define the mean value πœ‡(𝑂) =hΨ𝐻(𝛽,1) |𝑂𝑒,𝑣|Ψ𝐻(𝛽,1)i. We are interested in the observable𝑂 =𝑍 𝑍 ≑ 𝑍 βŠ— 𝑍.

We note that all terms in 𝐻 and 𝐡 that act trivially on {𝑒, 𝑣} do not contribute to πœ‡(𝑂). Such terms can be set to zero. Given a 2-qubit observable𝑂, define a mean value

πœ‡0(𝑂)= h+𝑛|𝑒𝑖 𝐻

0

𝑂𝑒,π‘£π‘’βˆ’π‘– 𝐻

0|+𝑛i, where 𝐻0= βˆ‘οΈ

𝑝≠𝑒,𝑣

(𝐽𝑒, 𝑝𝑍𝑒+𝐽𝑣 , 𝑝𝑍𝑣)𝑍𝑝. Using the identities

𝑒𝑖 𝛽(𝑋𝑒+𝑋𝑣)π‘π‘’π‘π‘£π‘’βˆ’π‘– 𝛽(𝑋𝑒+𝑋𝑣) = 𝑐2𝑍𝑒𝑍𝑣+𝑠2π‘Œπ‘’π‘Œπ‘£+𝑐 𝑠(π‘π‘’π‘Œπ‘£+π‘Œπ‘’π‘π‘£), 𝑒𝑖 𝐽𝑒 , π‘£π‘π‘’π‘π‘£π‘π‘’π‘π‘£π‘’βˆ’π‘– 𝐽𝑒 , 𝑣𝑍𝑒𝑍𝑣 = 𝑍𝑒𝑍𝑣,

𝑒𝑖 𝐽𝑒 , π‘£π‘π‘’π‘π‘£π‘Œπ‘’π‘Œπ‘£π‘’βˆ’π‘– 𝐽𝑒 , 𝑣𝑍𝑒𝑍𝑣 = π‘Œπ‘’π‘Œπ‘£

𝑒𝑖 𝐽𝑒 , π‘£π‘π‘’π‘π‘£π‘π‘’π‘Œπ‘£π‘’βˆ’π‘– 𝐽𝑒 , 𝑣𝑍𝑒𝑍𝑣 = cos(2𝐽𝑒,𝑣)π‘π‘’π‘Œπ‘£+sin(2𝐽𝑒,𝑣)𝑋𝑣, 𝑒𝑖 𝐽𝑒 , π‘£π‘π‘’π‘π‘£π‘Œπ‘’π‘π‘£π‘’βˆ’π‘– 𝐽𝑒 , 𝑣𝑍𝑒𝑍𝑣 = cos(2𝐽𝑒,𝑣)π‘Œπ‘’π‘π‘£+sin(2𝐽𝑒,𝑣)𝑋𝑒, and noting thatπœ‡0(𝑍 𝑍)=0, one easily gets

πœ‡(𝑍 𝑍) =𝑠2Β·πœ‡0(π‘Œ π‘Œ) +𝑐 𝑠·cos(2𝐽𝑒,𝑣) [πœ‡0(π‘π‘Œ) +πœ‡0(π‘Œ 𝑍)] +𝑐 𝑠·sin(2𝐽𝑒,𝑣) [πœ‡0(𝑋 𝐼) +πœ‡0(𝐼 𝑋)]. Using the explicit form of𝐻0, one gets

π‘’βˆ’π‘– 𝐻

0|+𝑛i= 1 2

βˆ‘οΈ

π‘Ž,𝑏=0,1

|π‘Ž, 𝑏i𝑒,𝑣 βŠ— |Ξ¦(π‘Ž, 𝑏)ielse, where|Ξ¦(π‘Ž, 𝑏)i is a tensor product state ofπ‘›βˆ’2 qubits defined by

|Ξ¦(π‘Ž, 𝑏)i = Ì

𝑝≠𝑒,𝑣

|𝐽𝑒, 𝑝(βˆ’1)π‘Ž+𝐽𝑣 , 𝑝(βˆ’1)𝑏i𝑝 where |πœƒi β‰‘π‘’βˆ’π‘– πœƒ 𝑍|+i.

Combining Eqs. (3.25) and (3.28), one gets πœ‡0(𝑂) = (1/4) βˆ‘οΈ

π‘Ž,𝑏,π‘Ž0,𝑏0=0,1

hπ‘Ž0, 𝑏0|𝑂|π‘Ž, 𝑏i Β· hΞ¦(π‘Ž0, 𝑏0) |Ξ¦(π‘Ž, 𝑏)i.

Using the tensor product form of the states |Ξ¦(π‘Ž, 𝑏)i and the identity hπœƒ0|πœƒi = cos(πœƒβˆ’πœƒ0)gives

hΞ¦(π‘Ž0, 𝑏0) |Ξ¦(π‘Ž, 𝑏)i = Γ–

𝑝≠𝑒,𝑣

cos[𝐽𝑒, 𝑝(βˆ’1)π‘Žβˆ’π½π‘’, 𝑝(βˆ’1)π‘Ž0+𝐽𝑣 , 𝑝(βˆ’1)π‘βˆ’π½π‘£ , 𝑝(βˆ’1)𝑏0]. From Eqs. (3.30) and (3.31), one can easily compute the mean valueπœ‡0(𝑂) for any 2-qubit observable.

Consider first the case𝑂 =π‘Œ π‘Œ. Then the only terms contributing to Eq. (3.30) are those withπ‘Ž0=π‘ŽβŠ•1 and𝑏0=π‘βŠ•1. The identityhπ‘ŽβŠ•1|π‘Œ|π‘Ži=βˆ’π‘–(βˆ’1)π‘Žgives

πœ‡0(π‘Œ π‘Œ) =βˆ’(1/4) βˆ‘οΈ

π‘Ž,𝑏=0,1

(βˆ’1)π‘Ž+𝑏 Γ–

𝑝≠𝑒,𝑣

cos[2𝐽𝑒, 𝑝(βˆ’1)π‘Ž+2𝐽𝑣 , 𝑝(βˆ’1)𝑏], that is,

πœ‡0(π‘Œ π‘Œ)= (1/2) Γ–

𝑝≠𝑒,𝑣

cos[2𝐽𝑒, π‘βˆ’2𝐽𝑣 , 𝑝] βˆ’ (1/2) Γ–

𝑝≠𝑒,𝑣

cos[2𝐽𝑒, 𝑝+2𝐽𝑣 , 𝑝].

Next, consider the case𝑂 =π‘Œ 𝑍. Note that the matrix elementshπ‘Ž0, 𝑏0|𝑂|π‘Ž, 𝑏ihave zero real part. From Eqs. (3.30) and (3.31), one infers that πœ‡0(π‘Œ 𝑍) has zero real part. This implies

πœ‡0(π‘Œ 𝑍)= πœ‡0(π‘π‘Œ) =0.

Finally, consider the case𝑂 =𝑋 𝐼. Then the only terms that contribute to Eq. (3.30) are those withπ‘Ž0=π‘Ž βŠ•1 and 𝑏0=𝑏. We get

πœ‡0(𝑋 𝐼) = Γ–

𝑝≠𝑒,𝑣

cos(2𝐽𝑒, 𝑝).

Here we noted that the inner product Eq. (3.31) withπ‘Ž0=π‘ŽβŠ•1 and𝑏0=𝑏does not depend onπ‘Ž, 𝑏. By the same argument,

πœ‡0(𝐼 𝑋) = Γ–

𝑝≠𝑒,𝑣

cos(2𝐽𝑣 , 𝑝).

Combining Eq. (3.27) and Eqs. (3.33),(3.34),(3.35),(3.36), one arrives at Eq. (3.23).

For more general cost functions that include interactions among three or more variables, there are two complications: First, unlike in the Ising case, the variable elimination process will typically increase the degree of non-locality of interactions.

Second, mean values of Pauli operators on the QAOA stateΨ𝐻(𝛽, 𝛾) lack a simple analytic formula (as far as we know). However, one can approximately compute the mean values using the Monte Carlo method due to Van den Nest [30]. A specialization of this method to simulation of the level-1 QAOA is described in [31].

The Monte Carlo simulator has runtime scaling polynomially with the number of qubits, number of terms in the cost function, and the inverse error tolerance, see [31]

for details. This method also requires no restrictions on the depth of the variational circuit.

An important distinction between QAOA and RQAOA lies in the measurement step.

QAOA requires few-qubit measurements to estimate the variational energy as well as the final𝑛-qubit measurement that assigns a value to each individual variable. This last step is what makes QAOA hard to simulate classically and may lead to a quantum advantage [32]. In contrast, RQAOA only needs few-qubit measurements to estimate mean values of individual terms in the cost function. The𝑛-qubit measurement step is replaced by the correlation rounding that eliminates variables one by one. One may ask whether the lack of multi-qubit measurements also precludes a quantum advantage. Indeed, in the special case of level-1 variational circuits and the Ising-like cost function RQAOA can be efficiently simulated classically, see above. However, level-𝑝 RQAOA with 𝑝 > 1 as well as level-1 RQAOA with more general cost functions are not known to be classically simulable in polynomial time, leaving room for a quantum advantage.

3.D Comparison of QAOA, RQAOA, and Classical Algorithms D.1 QAOA versus Classical Local Algorithms

In this section, we discuss another limitation of QAOA which results from its locality and the covariance condition discussed in Lemma 3.A.2: we compare QAOA to a certain very simple classical local algorithm (see Lemma 3.D.1 below). We show that there is an exponential number of problem instances for which the classical local algorithm outperforms QAOA.

Let us briefly sketch the notion of a local classical algorithm. We envision that the tuple(𝐽𝑒)π‘’βˆˆπΈ is given as input. Here we are interested in algorithms which are local

with respect to the underlying graph𝐺. Forπ‘Ÿ ∈Nand𝑣 βˆˆπ‘‰, define πΈπ‘Ÿ(𝑣)=

π‘Ÿ

Ø

β„“=1

Ø

(𝑒

1,...,𝑒ℓ) path withπ‘£βˆˆπ‘’

1

{𝑒

1, . . . , 𝑒ℓ}

to be the set of edges that belong to a path starting at 𝑣 of length bounded by π‘Ÿ. Consider a classical algorithmAwhich on input{𝐽𝑒}π‘’βˆˆπΈoutputsπ‘₯ = (π‘₯

1, . . . , π‘₯𝑛) ∈ {0,1}𝑛. We say that A isπ‘Ÿ-local if there is a family of functions {𝑔𝑣 : RπΈπ‘Ÿ(𝑣) β†’ {0,1}}π‘£βˆˆπ‘‰ such that the following holds for every problem instance (𝐽𝑒)π‘’βˆˆπΈ ∈ R𝐸: We have

π‘₯𝑣 =𝑔𝑣 {𝐽𝑒}π‘’βˆˆπΈπ‘Ÿ(𝑣)

for every𝑣 βˆˆπ‘‰ .

In other words, in anπ‘Ÿ-local classical algorithm, every output bitπ‘₯𝑣only depends on edge weights𝐽𝑒belonging to paths of length bounded byπ‘Ÿstarting at𝑣. We note that this definition can easily be generalized to the probabilistic case (e.g., by including local random bits). For the purposes of this section, deterministic functions turn out to be sufficient.

The (choice of) family{𝑔𝑣}π‘£βˆˆπ‘‰ can be considered as a set of variational parameters for the classical algorithm. To keep the number of variational parameters constant, we consider vertex-transitive graphs 𝐺. Fix π‘£βˆ— ∈ 𝑉. For every 𝑣 ∈ 𝑉, fix an automorphism πœ‹π‘£ of 𝐺 such that πœ‹π‘£(π‘£βˆ—) = 𝑣. Then the sets πΈπ‘Ÿ(𝑣) for different 𝑣 ∈ 𝑉 can be identified via πΈπ‘Ÿ(𝑣) = πœ‹π‘£(πΈπ‘Ÿ(π‘£βˆ—)). We say that anπ‘Ÿ-local classical algorithm is uniformif (after this identification)𝑔𝑣 ≑ 𝑔 for all 𝑣 βˆˆπ‘‰, i.e., if there is a single function𝑔 : RπΈπ‘Ÿ(π‘£βˆ—) β†’ {0,1} specifying the behavior of the algorithm.

To obtain general-purpose algorithms (applicable to any instance), the function 𝑔 : RπΈπ‘Ÿ(π‘£βˆ—) β†’ {0,1} should be chosen adapatively (i.e., potentially depending on the instance). The definition of local classical algorithm sketched here includes e.g., the algorithms considered in Ref. [20], though it is slightly more general as the local functions can be arbitrary.

Let𝑛=6π‘Ÿ be a multiple of 6. Consider𝑛-qubit Hamiltonians (cf. (3.A)) of the form 𝐻(𝐽) = βˆ‘οΈ

π‘˜βˆˆZ𝑛

π½π‘˜π‘π‘˜π‘π‘˜+

1 where 𝐽 = (𝐽

0, . . . , π½π‘›βˆ’

1) ∈ {1,βˆ’1}𝑛.

To define locality and uniformity for the cycle graphZ𝑛, letπœ‹π‘£(𝑀) =𝑣+𝑀 (mod 𝑛) be chosen as translation modulo𝑛for𝑣 ∈Z𝑛. We show the following:

Lemma 3.D.1. There is a subsetS βŠ‚ {1,βˆ’1}𝑛of2𝑛/3problem instances such that the following holds:

(i) QAOA𝑝(𝐻(𝐽)) ≀ 𝑝/(𝑝+1) for every𝑝 ∈Nand every𝐽 ∈ S.

(ii) There is a1-local uniform classical algorithm such that for every 𝐽 ∈ S, the algorithm outputsπ‘₯ ∈ {0,1}𝑛such thathπ‘₯|𝐻(𝐽) |π‘₯i=1.

(iii) Level-1RQAOAachieves the approximation ratio1. Proof. For every𝑠= (𝑠

0, . . . , 𝑠

2π‘Ÿβˆ’1) ∈ {0,1}2π‘Ÿ, define𝐽 =𝐽(𝑠) ∈ {1,βˆ’1}𝑛by 𝐽3π‘Ž =𝐽

3π‘Ž+1 =(βˆ’1)π‘ π‘Ž, and 𝐽

3π‘Ž+2 =1,

for allπ‘Ž =0,1, . . . ,2π‘Ÿβˆ’1. We claim that the setS ={𝐽(𝑠) | 𝑠 ∈ {0,1}2π‘Ÿ}has the required properties. Consider an instance𝐻(𝐽(𝑠))with𝑠 ∈ S. Define

𝑋(𝑠) =

2π‘Ÿβˆ’1

Γ–

π‘Ž=0

𝑋3π‘Ž+1 . Then𝐻(𝐽(𝑠)) is related to𝐻

Z𝑛 =Í

π‘—βˆˆZ𝑛𝑍𝑗𝑍𝑗+

1by the gauge transformation 𝐻(𝐽(𝑠))= 𝑋(𝑠)𝐻

Z𝑛𝑋(𝑠)βˆ’1.

Since the QAOA algorithm is invariant under such gauge transformation (see Lemma 3.A.2), we obtain

QAOA𝑝(𝐻(𝐽(𝑠)))=QAOA𝑝(𝐻

Z𝑛) ≀ 𝑝 𝑝+1 where we use the bound

QAOA𝑝(𝐻MaxCut

Z𝑛 ) ≀ 2𝑝+1 2𝑝+2 ,

proven in [22] for even𝑛, in combination with Lemma 3.A.2. This shows (i).

For the proof of (ii), consider the classical algorithm A which on input 𝐽 = (𝐽

0, . . . , π½π‘›βˆ’

1)outputs

π‘₯𝑣 =𝑔(π½π‘£βˆ’

1, 𝐽𝑣) for every𝑣 ∈Z𝑛, where

𝑔(𝐽 , 𝐽0) =





ο£²



ο£³

1 if(𝐽 , 𝐽0)= (βˆ’1,βˆ’1) 0 otherwise.

Clearly, the algorithm A is uniform and 1-local, and it is easy to check that the output satisfieshπ‘₯|𝐻(𝐽) |π‘₯i=1.

The proof of (iii) is given as a part of Lemma 3.D.2.

Dalam dokumen Quantum Information at High and Low Energies (Halaman 143-149)