Chapter IV: A simplified and improved approach to tensor network operators
4.3 Matrix Product Operators (MPOs)
Since many detailed and comprehensive presentations of MPOs already exist [22, 45, 46, 94, 107], this section will simply contain a brief overview in order to establish notation, as well as some simple examples which we will call upon in later sections.
Overview
Consider a 1D system which has been discretized into ๐ฟ localized sites, each with a local Hilbert space H๐ of dimension ๐๐. A general operator ห๐ acting on such a system can be written as,
ห ๐=โ๏ธ
{๐ห๐}
๐๐1๐2...๐๐ฟ๐ห1๐ห2...๐ห๐ฟ, (4.1) where {๐ห๐} is the set of local operators acting on H๐ and ๐ is a rank-๐ฟ tensor with indices๐๐whose dimensions are equal to the cardinality of their respective set {๐ห๐}. ๐contains the weights associated with all possible configurations of the local operators ห๐๐.
By fixing the indices, a specific element ๐๐1๐2...๐๐ฟ of the tensor ๐ can then be decomposed into a product of matrices๐[๐],
๐๐1๐2...๐๐ฟ =โ๏ธ
{๐ผ}
๐๐1
๐ผ1[1]๐๐2
๐ผ1๐ผ2[2]... ๐๐๐ฟ
๐ผ๐ฟโ1[๐ฟ], (4.2) where๐ผindexes the so-called โvirtualโ or โauxiliaryโ indices which are introduced to perform the matrix multiplication. In Eq. (4.2) the๐๐are simply labels, intended to indicate that each matrix๐[๐] is chosen specifically so that their product repro- duces the element ๐๐1๐2...๐๐ฟ. However, if the labels are all reinterpreted as their corresponding indices from Eq. (4.1), then we see that the full tensor ๐ can be reconstructed as the contraction over rank-3 tensors๐[๐].
Similar to MPS, this decomposition of a rank-๐ฟ tensor into ๐ฟ rank-3 tensors is motivated by the fact that most operators of interest do not contain general๐ฟ-body interactions, but instead are usually limited to few-body terms. This means that, while in general this decomposition could be exponentially expensive, often the๐ tensor is quite sparse and such a transformation can be a highly efficient way to represent the full tensor.
Figure 4.1: Tensor network diagrams of (a) an MPO, (b) a gMPO, and (c) a PEPO.
It is common and frequently useful to associate the operators ห๐๐ with their corre- sponding coefficient tensor๐[๐]according to,
ห ๐๐ผ
๐โ1๐ผ๐[๐] =โ๏ธ
๐๐
๐๐๐
๐ผ๐โ1๐ผ๐[๐] ๐ห๐. (4.3) This yields matrices ห๐[๐] in which every element is a๐๐ร๐๐ local operator acting onH๐. The full operator ห๐ is thus reconstructed via simple matrix multiplication,
ห ๐ =โ๏ธ
{๐ผ}
ห ๐๐ผ
1[1]๐ห๐ผ
1๐ผ2[2] ...๐ห๐ผ
๐ฟโ1[๐ฟ], (4.4)
and the set of matrices{๐ห[๐]}are referred to as the MPO representation of ห๐. This form of an MPO is commonly used throughout the literature, and will be heavily utilized in the remainder of this work.
We will now relate the MPO form in Eq. (4.4) to the common diagrammatic repre- sentation, as seen in Fig. 4.1. Since every element of ห๐๐ผ
๐โ1๐ผ๐[๐] is itself a ๐๐ ร๐๐ matrix, each individual numerical element can be exposed by introducing two new indices ๐๐ and ๐โฒ
๐, each of dimension ๐๐. By fixing each of ๐ผ๐โ1, ๐ผ๐, ๐๐, and ๐โฒ
๐, the expression (๐ห๐ผ
๐โ1๐ผ๐[๐])๐๐๐โฒ
๐ yields a single number. More commonly written as ห๐
๐๐๐โฒ
๐
๐ผ๐โ1๐ผ๐[๐], the correspondence to the rank-4 tensors shown in MPO diagrams becomes apparent. The new indices ๐๐ and ๐โฒ
๐ are the so-called โphysicalโ indices, which map the action of the local operators onto the corresponding site tensors of an MPS.
Examples
Frequently the operator that one wants to encode as an MPO is a Hamiltonian ห๐ป, so that the DMRG algorithm can be used to find its ground state in the form of an MPS.
Here we will explicitly write out the well-known matrices ห๐[๐] which make up the MPO representations of several common Hamiltonians consisting of 1- and 2-body terms. There are multiple techniques that can be used to derive these matrices, each with their own conventions and notation, but in this work we will remain agnostic to these different languages in an attempt to make the presentation in the following sections as conceptually simple and widely accessible as possible. To do so, we will simply refer back to these explicit examples. In lieu of derivations we will point to helpful references for readers who do not already have a preferred technique for understanding the form of MPO matrices.
Nearest-neighbor interactions
Consider a system of ๐ฟ sites, which are indexed by๐ โ {1,2, ..., ๐ฟ}, and a Hamil- tonian consisting of local terms and nearest-neighbor interactions of the form
ห
๐ป = ร๐ฟ
๐=1๐ถห๐ +ร๐ฟโ1
๐=1 ๐ดห๐๐ตห๐+1. In the MPO literature this Hamiltonian is usually written with ห๐ต= ๐ดหso that the interaction is symmetric and ห๐ปis Hermitian, however in this paper we will always keep the operators distinct for purposes of notational clarity, even though this means that some Hamiltonians under consideration will be non-Hermitian when ห๐ต โ ๐ดห. The MPO matrices for this Hamiltonian, denoted
ห
๐๐ ๐, are given by,
๐ห๐ ๐[1] = ห
๐ถ ๐ดห ๐ผห
, ๐ห๐ ๐[๐ฟ] = ห
๐ผ ๐ตห ๐ถห ๐
, ห
๐๐ ๐[๐] =ยฉ
ยญ
ยญ
ยซ ห
๐ผ ห0 ห0 ห
๐ต ห0 ห0 ๐ถห ๐ดห ๐ผห
ยช
ยฎ
ยฎ
ยฌ
, (4.5)
where ห๐ผ is the identity operator and ห0 is the zero operator.
If instead the interaction is symmetric so that ห๐ป =ร๐ฟ
๐=1๐ถห๐+ร๐ฟโ1
๐=1 (๐ดห๐๐ตห๐+1+๐ตห๐๐ดห๐+1), then the MPO matrices (๐ห๐ ๐โ๐ ๐ฆ ๐) are given by,
ห
๐๐ ๐โ๐ ๐ฆ ๐[1] =
๐ถห ๐ดห ๐ตห ๐ผห
, ห
๐๐ ๐โ๐ ๐ฆ ๐[๐ฟ] = ห
๐ผ ๐ตห ๐ดห ๐ถห ๐
,
ห
๐๐ ๐โ๐ ๐ฆ ๐[๐] =
ยฉ
ยญ
ยญ
ยญ
ยญ
ยญ
ยซ ห
๐ผ ห0 ห0 ห0 ห
๐ต ห0 ห0 ห0 ๐ดห ห0 ห0 ห0 ๐ถห ๐ดห ๐ตห ๐ผห
ยช
ยฎ
ยฎ
ยฎ
ยฎ
ยฎ
ยฌ
. (4.6)
In general, for an exact MPO representation of a Hamiltonian ห๐ป, the required bond dimension of the MPO matrices is๐ท =2+๐ยท๐, where๐ is the maximum distance over which interactions occur and๐is the number of unique operators that act โfirstโ
in the interactions. This is reflected in Eq. (4.5) where ๐ = 1 and ๐ = 1, and in Eq. (4.6) where๐ =1 and๐ =2. To understand these patterns, as well as the form of the MPO matrices in this section, we recommend Ref. [46].
Exponentially decaying interactions
One important exception to the above result is the MPO representation of a Hamil- tonian which has long-range interactions that decay exponentially, such as ห๐ป = ร
๐๐ถห๐+ร
๐ < ๐๐โ๐(๐โ๐)๐ดห๐๐ตห๐. Here we have introduced a second index ๐ which runs from๐ +1 to ๐ฟ. Despite the fact that๐ = ๐ฟ in this case, the Hamiltonian has an exact, compact representation with๐ท =3 MPO matrices(๐ห๐๐ฅ ๐) of the form,
ห
๐๐๐ฅ ๐[1] =
๐ถห ๐ดห ๐ผห
, ๐ห๐๐ฅ ๐[๐ฟ] =
๐ผห ๐โ๐๐ตห ๐ถห ๐
, ห
๐๐๐ฅ ๐[๐] =ยฉ
ยญ
ยญ
ยซ ห
๐ผ ห0 ห0
๐โ๐๐ตห ๐โ๐๐ผห ห0 ๐ถห ๐ดห ๐ผห
ยช
ยฎ
ยฎ
ยฌ
. (4.7)
Refs. [45, 107, 108, 117] provide insight into why this is possible for the unique case of exponential interactions.
A special case of this representation, which will prove useful in later sections, is when๐ =0. The Hamiltonian then has long-range interactions between every pair of sites but the strength of the interactions are all the same, ห๐ป =ร
๐๐ถห๐+ร
๐ < ๐ ๐ดห๐๐ตห๐. We will denote this special case with its own MPO notation: ห๐๐ข๐๐ ๐ ๐๐ ๐.
Much like before, if the interactions are symmetric so that ห๐ป =ร
๐๐ถห๐+ร
๐โ ๐๐โ๐|๐โ๐|๐ดห๐๐ตห๐ (where now both๐, ๐ โ {1, ..., ๐ฟ}), the MPO matrices become,
ห
๐๐๐ฅ ๐โ๐ ๐ฆ ๐[1] =
๐ถห ๐ดห ๐ตห ๐ผห
, ห
๐๐๐ฅ ๐โ๐ ๐ฆ ๐[๐ฟ] =
๐ผห ๐โ๐๐ตห ๐โ๐๐ดห ๐ถห ๐
,
ห
๐๐๐ฅ ๐โ๐ ๐ฆ ๐[๐] =
ยฉ
ยญ
ยญ
ยญ
ยญ
ยญ
ยซ ห
๐ผ ห0 ห0 ห0
๐โ๐๐ตห ๐โ๐๐ผห ห0 ห0 ๐โ๐๐ดห ห0 ๐โ๐๐ผห ห0
ห
๐ถ ๐ดห ๐ตห ๐ผห ยช
ยฎ
ยฎ
ยฎ
ยฎ
ยฎ
ยฌ
. (4.8)
Again, we will give the special case of๐=0 its own notation, ห๐๐ข๐๐ ๐ ๐๐ ๐โ๐ ๐ฆ ๐, which will prove useful in the coming sections.
General two-body long-range interactions
As mentioned previously, exact MPO representations of Hamiltonians with general long-range interaction coefficients ห๐ป๐ ๐๐ = ร
๐๐ถห๐ + ร
๐ < ๐๐๐ ๐๐ดห๐๐ตห๐ require a bond dimension which is proportional to๐ฟ [45]. However, if๐๐ ๐ is a smoothly decaying function of the distance between two sites, ๐๐ ๐ = ๐(๐ โ ๐), then highly accurate approximate MPO representations of ห๐ป๐ ๐๐ can often be found which have finite, constant bond dimensions. The traditional technique is to fit ๐(๐ โ๐) by a sum of exponentials [107, 108],
๐(๐โ๐) =
๐พ
โ๏ธ
๐=1
๐๐๐โ๐๐(๐โ๐). (4.9) This yields an MPO representation of ห๐ป๐ ๐๐ with bond dimension๐พ +2, where the MPO matrices take the form,
ห
๐๐พโ๐๐ฅ ๐[1] =
๐ถห ๐1๐ดห ๐2๐ดห ยท ยท ยท ๐๐พ๐ดห ๐ผห
, ห
๐๐พโ๐๐ฅ ๐[๐ฟ] =
๐ผห ๐โ๐1๐ตห ๐โ๐2๐ตห ยท ยท ยท ๐โ๐๐พ๐ตห ๐ถห ๐
,
ห
๐๐พโ๐๐ฅ ๐[๐] =
ยฉ
ยญ
ยญ
ยญ
ยญ
ยญ
ยญ
ยญ
ยญ
ยญ
ยญ
ยซ ห
๐ผ ห0 ห0 ยท ยท ยท ห0 ห0
๐โ๐1๐ตห ๐โ๐1๐ผห ห0 ยท ยท ยท ห0 ห0 ๐โ๐2๐ตห ห0 ๐โ๐2๐ผห ยท ยท ยท ห0 ห0
.. .
.. .
.. .
..
. ห0 ห0 ๐โ๐๐พ๐ตห ห0 ห0 ยท ยท ยท ๐โ๐๐พ๐ผห ห0
ห
๐ถ ๐1๐ดห ๐2๐ดห ยท ยท ยท ๐๐พ๐ดห ๐ผห ยช
ยฎ
ยฎ
ยฎ
ยฎ
ยฎ
ยฎ
ยฎ
ยฎ
ยฎ
ยฎ
ยฌ
. (4.10)
The accuracy of the representation{๐ห๐พโ๐๐ฅ ๐}is determined by the quality of the fit in Eq. (4.9).
Although this is often a reasonably accurate approach, several more sophisticated techniques have been developed in recent years which are based on the singular value decomposition (SVD) of blocks of๐๐ ๐ [94, 97]. These methods also work most effectively when ๐๐ ๐ is a smooth function of the distance, but they are able to fit more general functions ๐ that may be challenging to represent directly with exponentials like those in Eq. (4.9) [94]. They also can be a bit more efficient, producing a higher accuracy representation of ห๐ป๐ ๐๐ with a smaller bond dimension than Eq. (4.10) [97].
In this work, we utilize the technique described in Ref. [97]. The basic idea is that
(a)
(b)
Figure 4.2: A set of tensor network diagrams that represent the elements of the operator-valued MPO matrix ห๐๐ ๐๐[๐](a), along with the additional elements needed for ห๐๐ ๐๐โ๐ ๐ฆ ๐[๐] (b). Here we assume ห๐๐ ๐๐[๐] is a (2+๐๐) ร (2+๐๐) matrix and
ห
๐๐ ๐๐โ๐ ๐ฆ ๐[๐] is a(2+2๐๐) ร (2+2๐๐) matrix. We use the symbol โ1โ to denote the first value of a given index,๐ to denote the final value of a given index,๐to denote the set of values ranging from 2 to๐๐+1,๐to denote the set of values ranging from ๐๐+2 to 2๐๐+1,๐โฒto denote the set of values ranging from 2 to๐๐+1, and๐โฒto denote the set of values ranging from ๐๐ +2 to 2๐๐ +1. This index labelling corresponds directly to the expressions in Equations (4.11) and (4.12).
the MPO matrices ห๐๐ ๐๐for the general Hamiltonian ห๐ป๐ ๐๐ can be written as,
ห
๐๐ ๐๐[๐] =ยฉ
ยญ
ยญ
ยซ ห
๐ผ ห0 ห0
(๐ฃ๐)๐๐ตห (๐๐)๐ ๐โฒ๐ผห ห0 ๐ถห (๐ค๐)๐โฒ๐ดห ๐ผห
ยช
ยฎ
ยฎ
ยฌ
, (4.11)
where ยฎ๐ฃ๐ is a column vector of coefficients that has length๐๐ and is indexed by ๐, ๐๐is an๐๐ร๐๐matrix of coefficients indexed by๐and๐โฒ, and ๐คยฎ๐ is a row vector of coefficients that has length๐๐and is indexed by๐โฒ, yielding a(2+๐๐) ร (2+๐๐)MPO matrix. We write the indexed elements ofยฎ๐ฃ๐,๐คยฎ๐, and๐๐in Eq. (4.11) to remind the reader of the shape of these quantities. For clarity, tensor network diagrams for this matrix are given in Fig. 4.2(a). If the coefficients contained in ห๐๐ ๐๐[๐]can be, to a good approximation, related to the coefficients contained in ห๐๐ ๐๐[๐+1] by a linear transformation, then the MPO matrices for each site can be successively generated
by finding the correct linear transformation on the coefficients contained in the MPO matrix on the previous site. These linear transformations can be found by taking SVDs of certain blocks of the upper triangle of๐๐ ๐. It is observed in [97] that if๐๐ ๐ is a smooth function of the distance |๐ โ๐|, the transformations are often compact (i.e. their dimensions do not scale with ๐ฟ) and highly accurate because sub-blocks of the upper triangle of๐๐ ๐ are low-rank. These ideas are developed in full detail in the supplementary information of Ref. [97] 1.
The form of this MPO matrix can be viewed as a direct generalization of ห๐๐ข๐๐ ๐ ๐๐ ๐. The โcoefficientsโ in adjacent ห๐๐ข๐๐ ๐ ๐๐ ๐ matrices can be related to each other via the simplest possible linear transformation (๐1ร1 = 1, ๐ฃยฎ = ๐คยฎ = 1) because all the interactions are of identical strength and thus all sub-blocks of๐๐ ๐ are rank 1.
However, when the interaction coefficients vary with distance and the sub-blocks of the upper triangle of๐๐ ๐ are rank-๐, the single ห๐ผ in the center of ห๐๐ข๐๐ ๐ ๐๐ ๐ gets generalized to an ๐ ร ๐ block ๐๐ร๐๐ผหin ห๐๐ ๐๐. By extension, ยฎ๐ฃ and ๐คยฎ undergo the same generalization. The MPOs ห๐๐๐ฅ ๐ and ห๐๐พโ๐๐ฅ ๐ are special, simple cases of this generalization.
As a final note, if the interactions in the general Hamiltonian ห๐ป๐ ๐๐become symmetric so that ห๐ป =ร
๐๐ถห๐ +ร
๐โ ๐๐๐ ๐๐ดห๐๐ตห๐ =ร
๐๐ถห๐+ร
๐ < ๐๐๐ ๐๐ดห๐๐ตห๐ +ร
๐ <๐๐๐ ๐๐ตห๐๐ดห๐, then the general MPO matrices become,
ห
๐๐ ๐๐โ๐ ๐ฆ ๐[๐] =
ยฉ
ยญ
ยญ
ยญ
ยญ
ยญ
ยซ ห
๐ผ ห0 ห0 ห0
(๐ฃ๐)๐๐ตห (๐๐)๐ ๐โฒ๐ผห ห0 ห0 (๐ฃโฒ
๐)๐๐ดห ห0 (๐โฒ
๐)๐ ๐โฒ๐ผห ห0 ๐ถห (๐ค๐)๐โฒ๐ดห (๐คโฒ
๐)๐โฒ๐ตห ๐ผห ยช
ยฎ
ยฎ
ยฎ
ยฎ
ยฎ
ยฌ
. (4.12)
Tensor network diagrams representing this matrix are given in Fig. 4.2. Here we have introduced the additional indices ๐ and ๐โฒ to index the new vectors ๐ฃยฎโฒ๐, ๐คยฎโฒ๐ and the new matrix๐โฒ
๐, as described in Fig. 4.2. If we have the additional property that interaction coefficients themselves are symmetric, ๐๐ ๐ = ๐๐ ๐, then the above expression can be simplified according to๐ฃยฎโฒ๐=๐ฃยฎ๐,๐คยฎโฒ๐ =๐คยฎ๐, ๐โฒ
๐ = ๐๐.