LEARNING POWER SYSTEM PARAMETERS FROM LINEAR MEASUREMENTS
6.2 Model and Definitions
show two applications of Theorem 6.4.1 for the uniform sampling of trees and the ErdΕs-RΓ©nyi(π, π)model in Corollary 6.4.1 and 6.4.2, respectively.
3. (Heuristic) Algorithm: Motivated by the three-stage recovery scheme, a heuris- tic algorithm with polynomial (inπ) running-time is reported in Section 6.5, together with simulation results for power system test cases validating its performance in Section 6.6.
Some comments about the above results are as follows:
Outline of This Chapter
The remaining content is organized as follows. In Section 6.2, we specify our models.
In Section 6.3, we present the converse result as fundamental limits for recovery. The achievability is provided in 6.3. We present our main result as the worst-case sample complexity for Gaussian IID measurements in Section 6.4. A heuristic algorithm together with simulation results are reported in Sections 6.5 and 6.6.
Graphical Model
Denote by V = {1, . . . , π} a set of π nodes and consider an undirected graph πΊ = (V,E)(with no self-loops) whose edge setE β V Γ V contains the desired topology information. The degree of each node π is denoted byππ. The connectivity between the nodes is unknown and our goal is to determine it by learning the associatedgraph matrixusing linear measurements.
Definition 6.2.1(Graph matrix). Provided with an underlying graphπΊ =(V,E), a symmetricmatrixY(πΊ) βSπΓπis called agraph matrixif the following conditions hold:
ππ, π(πΊ) =



ο£²



ο£³
β 0 ifπβ π and(π, π) β E 0 ifπβ π and(π, π) βE arbitrary otherwise
.
Remark 9. Our theorems can be generalized to recover a broader class of symmetric matrices, as long as the matrix to be recovered satisfies (1) KnowingY(πΊ) βFπΓπ gives the full knowledge of the topology ofπΊ; (2) The number of non-zero entries in a column ofY(πΊ)has the same order as the degree of the corresponding node, i.e., |supp(ππ) | =π(ππ). for all π β V. To have a clear presentation, we consider specifically the case|supp(ππ) | =ππ.
In this work, we employ a probabilistic model and assume that the graphπΊ is chosen randomly from acandidacy setC(π)(withπnodes), according to some distribution Gπ. Both the candidacy setC(π) and distributionGπare not known to the estimator.
For simplicity, we often omit the subscripts ofC(π)andGπ.
Example7. We exemplify some possible choices of the candidacy set and distribution:
(a) (Mesh Network) When πΊ represents a transmission (mesh) power network and no prior information is available, the corresponding candidacy setG(π) consisting of all graphs withπnodes andπΊ is selected uniformly at random fromG(π). Moreover, |G(π) |=2(π2) in this case.
(b) (Radial Network) WhenπΊrepresents a distribution (radial) power network and no other prior information is available, then the corresponding candidacy set T(π) is a set containing all spanning trees of the complete graph withπbuses (nodes) andπΊ is selected uniformly at random fromT(π); the cardinality is
|T(π) | =ππβ2by Cayleyβs formula.
(c) (Radial Network with Prior Information) When πΊ = (V,E) represents a distribution (radial) power network, and we further know that some of the buses cannot be connected (which may be inferred from locational/geographical information), then the corresponding candidacy setTπ»(π)is a set of spanning trees of a sub-graphπ» = (V,Eπ») withπbuses. An edgeπ βEπ» if and only if we knowπβE. The size ofTπ»(π)is given by Kirchhoffβs matrix tree theorem (c.f. [192]).
(d) (ErdΕs-RΓ©nyi (π, π) model) In a more general setting, πΊ can be a random graph chosen from an ensemble of graphs according to a certain distribution.
When a graphπΊis sampled according to the ErdΕs-RΓ©nyi (π, π)model, each edge ofπΊ is connected IID with probabilityπ. We denote the corresponding graph distribution for this case byGER(π, π).
The next section is devoted to describing available measurements.
Linear System of Measurements
Suppose the measurements are sampled discretely and indexed by the elements of the set{1, . . . , π}. As a general framework, the measurements are collected in two matricesAandBand defined as follows.
Definition 6.2.2(Generator and measurement matrices). Letπbe an integer with 1β€ π β€ π. Thegenerator matrixBis anπΓπrandommatrix and themeasurement matrixAis anπΓπmatrix with entries selected fromFthat satisfy the linear system (6.1):
A=BY(πΊ) +Z
whereY(πΊ) βSπΓπis a graph matrix to be recovered, with an underlying graphπΊ andZβFπΓπdenotes the randomadditive noise. We call the recoverynoiselessif Z=0. Our goal is to resolve the matrixY(πΊ)based on given matricesAandB.
In the remaining contexts, we sometime simplify the matrixY(πΊ) asYif there is no confusion.
Applications to Electrical Grids
Various applications fall into the framework in (6.1). Here we present two examples of the graph identification problem in power systems. The measurements are modeled as time series data obtained via nodal sensors at each node, e.g., PMUs, smart switches, or smart meters.
Example1: Nodal Current and Voltage Measurements
We assume data is obtained from a short time interval over which the unknown parameters in the network aretime-invariant. YβCπΓπdenotes thenodal admittance matrixof the network and is defined
ππ, π :=


ο£²

ο£³
βπ¦π, π ifπ β π π¦π+Γ
πβ ππ¦π, π ifπ = π
(6.5) where π¦π, π βCis the admittance of line(π, π) β E and π¦π is the self-admittance of busπ. Note that if two buses are not connected thenππ, π =0.
The corresponding generator and measurement matrices are formed by simultaneously measuring both current (or equivalently, power injection) and voltage at each node and at each time step. For eachπ‘ =1, . . . , π, the nodal current injection is collected in an π-dimensional random vectorπΌπ‘ =(πΌπ‘ ,1, . . . , πΌπ‘ ,π). Concatenating theπΌπ‘ into a matrix we getI:= [πΌ1, πΌ2, . . . , πΌπ]β€ βCπΓπ. The generator matrixV:=[π1, π2, . . . , ππ]β€ β CπΓπis constructed analogously. Each pair of measurement vectors(πΌπ‘, ππ‘)fromI andVmust satisfy Kirchhoffβs and Ohmβs laws,
πΌπ‘ =Yππ‘, π‘ =1, . . . , π . (6.6) In matrix notation, (6.6) is equivalent toI=VY, which is a noiseless version of the linear system defined in (6.1).
Compared with only obtaining one of the current, power injection or voltage measurements (for example, as in [147, 178, 179]), collecting simultaneous current- voltage pairs doubles the amount of data to be acquired and stored. There are benefits however. First, exploiting the physical law relating voltage and current not only enables us to identify the topology of a power network but also recover the parameters of the admittance matrix. Furthermore, dual-type measurements significantly reduce the sample complexity for learning the graph, compared with the results for single-type measurements.
Example2: Nodal Power Injection and Phase Angles
Similar to the previous example, at each timeπ‘ =1, . . . , π, denote byππ‘ , π andππ‘ , π the active nodal power injection and the phase of voltage at node π, respectively. The matricesP βRπΓπandπππ βRπΓπare constructed in a similar way by concatenating the vectorsππ‘ = (ππ‘ ,1, . . . , ππ‘ ,π)andππ‘ = (ππ‘ ,1, . . . , ππ‘ ,π). The matrix representation
of the DC power flow model can be expressed as a linear system P = πππCSCβ€, which belongs to the general class represented in (6.1). Here, the diagonal matrix S β R|E |Γ|E | is the susceptence matrix whose π-th diagonal entry represents the susceptence on theπ-th edge inEandCβ {β1,0,1}πΓ|E |is the node-to-link incidence matrix of the graph. The vertex-edge incidence matrix3Cβ {β1,0,1}πΓ|E |is defined as
πΆπ ,π :=



ο£²



ο£³
1, if bus π is the source ofπ
β1, if bus π is the target of π 0, otherwise
.
Note thatCSCβ€specifies both the network topology and the susceptences of power lines.
Probability of Error as the Recovery Metric
We define the error criteria considered in this chapter. We refer to finding the edge set E ofπΊvia matricesAandBas thetopology identification problemand recovering the graph matrixYvia matricesAandBas theparameter reconstruction problem.
Definition 6.2.3. Let π be a function or algorithm that returns an estimated graph matrixX = π(A,B) given inputsA andB. The probability of error for topology identificationπT is defined to be the probability that the estimated edge set is not equal to the correct edge set:
πT :=P βπβ π
sign(ππ, π) β sign ππ, π(πΊ) (6.7) where the probability is taken over the randomness inπΊ ,BandZ. Theprobability of error for parameter reconstructionπP(π) is defined to be the probability that the Frobenius norm of the difference between the estimate Xand the original graph matrixY(πΊ) is larger thanπ > 0:
πP(π) := sup
YβY(πΊ)
P(||XβY(πΊ) ||F > π) (6.8) where || Β· ||F denotes the Frobenius norm, π > 0 andY(πΊ) is the set of all graph matrices π(πΊ) that satisfy Definition 6.2.1 for the underlying graph πΊ, and the probability is taken over the randomness inπΊ,B and Z. Note that for noiseless parameter reconstruction, i.e.,Z = 0, we always consider exact recovery and set π=0 and abbreviate the probability of error asπP.
3Although the underlying network is a directed graph, when considering the fundamental limit for topology identification, we still refer to the recovery of an undirected graphπΊ.