Chapter 1 Introduction
7.2 Problem Formulation
7.2.1 System Model
We first describe the assumptions used in the formulation and define various components used in the formulation. Table 7.1 summarises the notation used in this chapter.
Assumptions: The data centers are over-provisioned with sufficient compute capacity across various locations to accommodate single data center failure. Every data center is aware of the volume of workload being served across all the other data centers.
We now define the input parameters used in the formulation.
Demand: We define λufs as the number of requests mapped from client region u to data center s. Here f = 0 represents no data center failed and f ∈ 1,2..|S|
represents the failure of the fth data center.
Number of active servers: Let the processing rate of the server be µ requests per second. We assume that among ms servers at a data center, mfs are activated after fth data center has failed. A data center is modeled as an M/M/n queue.
The number of servers required to satisfy workload l within the maximum tolerable
7.2 Problem Formulation
Input Symbol Description
Data center
S Set of data center locations s Index of data center
Ms Total number of servers available at data centers
mfs Number of active servers at data centerswhen data centerfhas failed p Percentage of total servers failed at any data center
Client
U Set of client regions u Index of client region
Lu Total number of requests generated from user locationu Pidle Power consumed by server when idle
Ppeak Power consumed by server when running at its peak Server µ Service rate of server (in number of requests per second)
γsf Average utilization of active servers at data centerswhen data centerfhas failed L0 Total workload served by failed data center before failure
L Total workload required to be distributed
Lg Workload required to be served by green energy sources Lr Workload which can be served by any energy source
λufs The number of requests from client regionuserved by data centerswhen data center fhas failed
Demand and
λf Vector formed usingλufs , which denotes the load balancing strategy
workload distri- bution
Φufsi The number of requests from client regionubeing served at data centersbeing served by consuming energy of typeiwhen data centerfhas failed
Φfsi The cumulative workload at data centersby consuming energy of typeiwhen data centerfhas failed
Φfs Vector of Φfsifor data centerswhen data centerfhas failed.
Φf Vector of Φfs, which signifies the total workload distribution among different data centers and energy sources
i Index of energy type
θsi Electricity price per kWh at data centersfor energy of typei θ Vector of different electricity prices across all data centers
Psif Total power of typeiconsumed at data centerswhen data centerfhas failed Energy Θfs The total cost of energy consumed at data centerswhen data centerfhas failed
ρ The minimum green energy usage bound Es PUE at data centers
Psimax Maximum available power of typeiat data centers
Delay
Wsu Propagation delay between data centersand client regionu T Maximum tolerable queuing delay
Table 7.1: Summary of notation used in the chapter
queuing delay T is given by the equation 1
mfsµ−l =T ∀s, f (7.1)
l =f(mfs) = mfsµ− 1
T ∀s, f (7.2)
In Eq. (7.2), the function f(mfs) gives the workload that can be served using mfs servers while meeting the queuing delay constraint. Similarly, we define mfs by
mfs = 1 µ(1
T +l) ∀s, f (7.3)
Server utilization: We define γsf to be the average utilization of active servers at the sth data center after fth data center failed, given by
γsf = P
uλufs µmfs
(7.4) Power consumption model: Let Pidle be the average power drawn in idle condition and Ppeak be the power consumed when server is running at peak utilization. We express the power consumed at a data center s, given workload l, after fth data center failed, as [16]:
Psf = mfs(Pidle + (Es−1)Ppeak) + mfs(Ppeak − Pidle)γsf (7.5)
=c1mfs +c2l (7.6)
where c1 = (Pidle + (Es−1)Ppeak), c2 = (Ppeak µ−Pidle), andEs is the PUE of a data center at s. Substituting mfs and γsf in Eq. (7.6) from Eq. (7.3) and Eq. (7.4) gives
Psf = c1 µ(1
T +l) +c2l
= (c1
µ +c2)l+ c1
T µ (7.7)
Surplus workload: LetL0 be the workload served by a failed data centerf, before the failure. After the failure, letmff be the number of servers available. The surplus
7.2 Problem Formulation
workload violating the queuing delay constraint at failed data center can be obtained as
L=L0−f(mff) ∀f (7.8)
Workload partitioning: We partition the workload into two components: (i) green workload, Lg which is served by servers powered with green energy and (ii) remaining workload, Lr served by other servers. Due to workload conservation rule, the following equation has to be satisfied.
Lg+Lr =L (7.9)
Let ρ denote the minimum fraction of workload to be served by servers powered with green energy. The green energy usage constraint is represented by
Lg ≥ρL (7.10)
In our model, we consider 4 types of energy sources namely, solar, wind, PPA and brown (indexed in the same order). Φufsi denotes the workload from client region u at data center s being served by consuming energy of type i, after fth data center had failed.
The following equation makes sure that all the client demand is served.
4
X
i=1
Φufsi =λufs ∀s, u (7.11) Φfsi denotes the cumulative workload at data center s being served by consuming energy of typei(i.e., Φfsi =P
uΦufsi). Therefore, in order to satisfy the green energy usage we have
X
s 3
X
i=1
Φfsi ≥Lg ∀s (7.12)
Energy cost: Let θsi be the cost of energy of type i consumed at a data center s and θ be the vector of different energy prices across all data centers. The cost of
energy consumed at data center s after fth data center has failed can be expressed as
Θfs =X
i
θsiPsif ∀s, f (7.13) where Psif is the energy of type i is consumed at data center s when data center f has failed.