• Tidak ada hasil yang ditemukan

Model and Definitions

Dalam dokumen Learning-Augmented Control and Decision-Making (Halaman 186-191)

LEARNING POWER SYSTEM PARAMETERS FROM LINEAR MEASUREMENTS

6.2 Model and Definitions

show two applications of Theorem 6.4.1 for the uniform sampling of trees and the ErdΕ‘s-RΓ©nyi(𝑛, 𝑝)model in Corollary 6.4.1 and 6.4.2, respectively.

3. (Heuristic) Algorithm: Motivated by the three-stage recovery scheme, a heuris- tic algorithm with polynomial (in𝑛) running-time is reported in Section 6.5, together with simulation results for power system test cases validating its performance in Section 6.6.

Some comments about the above results are as follows:

Outline of This Chapter

The remaining content is organized as follows. In Section 6.2, we specify our models.

In Section 6.3, we present the converse result as fundamental limits for recovery. The achievability is provided in 6.3. We present our main result as the worst-case sample complexity for Gaussian IID measurements in Section 6.4. A heuristic algorithm together with simulation results are reported in Sections 6.5 and 6.6.

Graphical Model

Denote by V = {1, . . . , 𝑛} a set of 𝑛 nodes and consider an undirected graph 𝐺 = (V,E)(with no self-loops) whose edge setE βŠ† V Γ— V contains the desired topology information. The degree of each node 𝑗 is denoted by𝑑𝑗. The connectivity between the nodes is unknown and our goal is to determine it by learning the associatedgraph matrixusing linear measurements.

Definition 6.2.1(Graph matrix). Provided with an underlying graph𝐺 =(V,E), a symmetricmatrixY(𝐺) ∈S𝑛×𝑛is called agraph matrixif the following conditions hold:

π‘Œπ‘–, 𝑗(𝐺) =







ο£²







ο£³

β‰ 0 if𝑖≠ 𝑗 and(𝑖, 𝑗) ∈ E 0 if𝑖≠ 𝑗 and(𝑖, 𝑗) βˆ‰E arbitrary otherwise

.

Remark 9. Our theorems can be generalized to recover a broader class of symmetric matrices, as long as the matrix to be recovered satisfies (1) KnowingY(𝐺) ∈F𝑛×𝑛 gives the full knowledge of the topology of𝐺; (2) The number of non-zero entries in a column ofY(𝐺)has the same order as the degree of the corresponding node, i.e., |supp(π‘Œπ‘—) | =𝑂(𝑑𝑗). for all 𝑗 ∈ V. To have a clear presentation, we consider specifically the case|supp(π‘Œπ‘—) | =𝑑𝑗.

In this work, we employ a probabilistic model and assume that the graph𝐺 is chosen randomly from acandidacy setC(𝑛)(with𝑛nodes), according to some distribution G𝑛. Both the candidacy setC(𝑛) and distributionG𝑛are not known to the estimator.

For simplicity, we often omit the subscripts ofC(𝑛)andG𝑛.

Example7. We exemplify some possible choices of the candidacy set and distribution:

(a) (Mesh Network) When 𝐺 represents a transmission (mesh) power network and no prior information is available, the corresponding candidacy setG(𝑛) consisting of all graphs with𝑛nodes and𝐺 is selected uniformly at random fromG(𝑛). Moreover, |G(𝑛) |=2(𝑛2) in this case.

(b) (Radial Network) When𝐺represents a distribution (radial) power network and no other prior information is available, then the corresponding candidacy set T(𝑛) is a set containing all spanning trees of the complete graph with𝑛buses (nodes) and𝐺 is selected uniformly at random fromT(𝑛); the cardinality is

|T(𝑛) | =π‘›π‘›βˆ’2by Cayley’s formula.

(c) (Radial Network with Prior Information) When 𝐺 = (V,E) represents a distribution (radial) power network, and we further know that some of the buses cannot be connected (which may be inferred from locational/geographical information), then the corresponding candidacy setT𝐻(𝑛)is a set of spanning trees of a sub-graph𝐻 = (V,E𝐻) with𝑛buses. An edge𝑒 βˆ‰E𝐻 if and only if we knowπ‘’βˆ‰E. The size ofT𝐻(𝑛)is given by Kirchhoff’s matrix tree theorem (c.f. [192]).

(d) (ErdΕ‘s-RΓ©nyi (𝑛, 𝑝) model) In a more general setting, 𝐺 can be a random graph chosen from an ensemble of graphs according to a certain distribution.

When a graph𝐺is sampled according to the ErdΕ‘s-RΓ©nyi (𝑛, 𝑝)model, each edge of𝐺 is connected IID with probability𝑝. We denote the corresponding graph distribution for this case byGER(𝑛, 𝑝).

The next section is devoted to describing available measurements.

Linear System of Measurements

Suppose the measurements are sampled discretely and indexed by the elements of the set{1, . . . , π‘š}. As a general framework, the measurements are collected in two matricesAandBand defined as follows.

Definition 6.2.2(Generator and measurement matrices). Letπ‘šbe an integer with 1≀ π‘š ≀ 𝑛. Thegenerator matrixBis anπ‘šΓ—π‘›randommatrix and themeasurement matrixAis anπ‘šΓ—π‘›matrix with entries selected fromFthat satisfy the linear system (6.1):

A=BY(𝐺) +Z

whereY(𝐺) ∈S𝑛×𝑛is a graph matrix to be recovered, with an underlying graph𝐺 andZ∈Fπ‘šΓ—π‘›denotes the randomadditive noise. We call the recoverynoiselessif Z=0. Our goal is to resolve the matrixY(𝐺)based on given matricesAandB.

In the remaining contexts, we sometime simplify the matrixY(𝐺) asYif there is no confusion.

Applications to Electrical Grids

Various applications fall into the framework in (6.1). Here we present two examples of the graph identification problem in power systems. The measurements are modeled as time series data obtained via nodal sensors at each node, e.g., PMUs, smart switches, or smart meters.

Example1: Nodal Current and Voltage Measurements

We assume data is obtained from a short time interval over which the unknown parameters in the network aretime-invariant. Y∈C𝑛×𝑛denotes thenodal admittance matrixof the network and is defined

π‘Œπ‘–, 𝑗 :=





ο£²



ο£³

βˆ’π‘¦π‘–, 𝑗 if𝑖 β‰  𝑗 𝑦𝑖+Í

π‘˜β‰ π‘–π‘¦π‘–, π‘˜ if𝑖 = 𝑗

(6.5) where 𝑦𝑖, 𝑗 ∈Cis the admittance of line(𝑖, 𝑗) ∈ E and 𝑦𝑖 is the self-admittance of bus𝑖. Note that if two buses are not connected thenπ‘Œπ‘–, 𝑗 =0.

The corresponding generator and measurement matrices are formed by simultaneously measuring both current (or equivalently, power injection) and voltage at each node and at each time step. For each𝑑 =1, . . . , π‘š, the nodal current injection is collected in an 𝑛-dimensional random vector𝐼𝑑 =(𝐼𝑑 ,1, . . . , 𝐼𝑑 ,𝑛). Concatenating the𝐼𝑑 into a matrix we getI:= [𝐼1, 𝐼2, . . . , πΌπ‘š]⊀ ∈Cπ‘šΓ—π‘›. The generator matrixV:=[𝑉1, 𝑉2, . . . , π‘‰π‘š]⊀ ∈ Cπ‘šΓ—π‘›is constructed analogously. Each pair of measurement vectors(𝐼𝑑, 𝑉𝑑)fromI andVmust satisfy Kirchhoff’s and Ohm’s laws,

𝐼𝑑 =Y𝑉𝑑, 𝑑 =1, . . . , π‘š . (6.6) In matrix notation, (6.6) is equivalent toI=VY, which is a noiseless version of the linear system defined in (6.1).

Compared with only obtaining one of the current, power injection or voltage measurements (for example, as in [147, 178, 179]), collecting simultaneous current- voltage pairs doubles the amount of data to be acquired and stored. There are benefits however. First, exploiting the physical law relating voltage and current not only enables us to identify the topology of a power network but also recover the parameters of the admittance matrix. Furthermore, dual-type measurements significantly reduce the sample complexity for learning the graph, compared with the results for single-type measurements.

Example2: Nodal Power Injection and Phase Angles

Similar to the previous example, at each time𝑑 =1, . . . , π‘š, denote by𝑃𝑑 , 𝑗 andπœƒπ‘‘ , 𝑗 the active nodal power injection and the phase of voltage at node 𝑗, respectively. The matricesP ∈Rπ‘šΓ—π‘›andπœƒπœƒπœƒ ∈Rπ‘šΓ—π‘›are constructed in a similar way by concatenating the vectors𝑃𝑑 = (𝑃𝑑 ,1, . . . , 𝑃𝑑 ,𝑛)andπœƒπ‘‘ = (πœƒπ‘‘ ,1, . . . , πœƒπ‘‘ ,𝑛). The matrix representation

of the DC power flow model can be expressed as a linear system P = πœƒπœƒπœƒCSC⊀, which belongs to the general class represented in (6.1). Here, the diagonal matrix S ∈ R|E |Γ—|E | is the susceptence matrix whose 𝑒-th diagonal entry represents the susceptence on the𝑒-th edge inEandC∈ {βˆ’1,0,1}𝑛×|E |is the node-to-link incidence matrix of the graph. The vertex-edge incidence matrix3C∈ {βˆ’1,0,1}𝑛×|E |is defined as

𝐢𝑗 ,𝑒 :=







ο£²







ο£³

1, if bus 𝑗 is the source of𝑒

βˆ’1, if bus 𝑗 is the target of 𝑒 0, otherwise

.

Note thatCSC⊀specifies both the network topology and the susceptences of power lines.

Probability of Error as the Recovery Metric

We define the error criteria considered in this chapter. We refer to finding the edge set E of𝐺via matricesAandBas thetopology identification problemand recovering the graph matrixYvia matricesAandBas theparameter reconstruction problem.

Definition 6.2.3. Let 𝑓 be a function or algorithm that returns an estimated graph matrixX = 𝑓(A,B) given inputsA andB. The probability of error for topology identificationπœ€T is defined to be the probability that the estimated edge set is not equal to the correct edge set:

πœ€T :=P βˆƒπ‘–β‰  𝑗

sign(𝑋𝑖, 𝑗) β‰ sign π‘Œπ‘–, 𝑗(𝐺) (6.7) where the probability is taken over the randomness in𝐺 ,BandZ. Theprobability of error for parameter reconstructionπœ€P(πœ‚) is defined to be the probability that the Frobenius norm of the difference between the estimate Xand the original graph matrixY(𝐺) is larger thanπœ‚ > 0:

πœ€P(πœ‚) := sup

Y∈Y(𝐺)

P(||Xβˆ’Y(𝐺) ||F > πœ‚) (6.8) where || Β· ||F denotes the Frobenius norm, πœ‚ > 0 andY(𝐺) is the set of all graph matrices π‘Œ(𝐺) that satisfy Definition 6.2.1 for the underlying graph 𝐺, and the probability is taken over the randomness in𝐺,B and Z. Note that for noiseless parameter reconstruction, i.e.,Z = 0, we always consider exact recovery and set πœ‚=0 and abbreviate the probability of error asπœ€P.

3Although the underlying network is a directed graph, when considering the fundamental limit for topology identification, we still refer to the recovery of an undirected graph𝐺.

Dalam dokumen Learning-Augmented Control and Decision-Making (Halaman 186-191)