Multiprocessor Scheduling - A Brief Survey

to satisfy ERfairness, each task T_i should complete at least (^Pⁿ_i=1(e_i/π_i)×t) part of it’s total execution requirement e_i, at any time instant t, subsequent to the start of the task at time S_i. Following the above criterion, ERfair is able to deliver optimal/full resource utilization; hence, any task set is guaranteed to be schedulable if Equation 2.1 is satisfied.

i=1

(e_i/π_i)≤p (2.1)

However both the above schemes attempt to maintain proportional fairness at each time slot and incur unrestricted preemption/migration overheads due to this. Recently, Levin et al. [26] proposed a semi-partitioned approximate proportional fair optimal scheduler called DP-Fair with much lower and bounded context switching overheads. DP-Fair is an optimal two level hierarchical homogeneous multi-resource scheduler. At the outer level, time is partitioned into slices, demarcated by the periods/deadlines of all jobs in the system. At the inner level, within a giventime slice of lengtht time slots (say), each task Ti is allocated a workload equal to its proportional fair share, shri = (ei/πi)∗t and assigned to one or more resources (processor/bus resource) for scheduling. DP-Fair is an optimal algorithm which ensures full resource utilization. Given a set of n tasks to be scheduled on p homogeneous resources, DP-Fair guarantees a feasible schedule if Equation 2.1 is satisfied. Using a combination of techniques including DP-Wrap, mirroring and fairness-oblivious intra-slice execution, DP-Fair is able to ensure atmost n−1 context switches andp−1 migrations per time slice [26]. These facets make DP-Fair a lucrative resource allocation technique which can efficiently combine the twin benefits of high resource utilization and low scheduling related overheads. However, the above schemes can not handle the combined scheduling of tasks and messages as required for Real-Time Cyber-Physical Systems (RT-CPSs) running on distributed systems.

The problem of scheduling PTGs on multiprocessor systems has also received the at- tention of researchers over many decades. Various optimal solution approaches, such as linear programming, Best-first search, and other exhaustive enumeration techniques including model-based formal synthesis mechanisms, have been proposed [27,28]. Prasanna et al. [29] devised a control-theoretic optimal scheduling mechanism for task graphs exe-

cuting on a homogeneous multiprocessor system. Later, they have extended their scheme to include communication overheads between task nodes [30]. Sarad et al. [9] developed a MILP based PTG scheduling strategy for platforms consisting of a set of fully connected homogeneous processing nodes. Kanemitsu et al. [32] presented an ILP based optimal task scheduling scheme for fully-connected heterogeneous distributed systems. Hsiu et al. [33] developed optimal and approximation algorithms for the scheduling of PTGs on heterogeneous distributed platforms with shared buses. However, these schemes do not consider the combined real-time scheduling of tasks and messages as necessary in fully connected or shared-bus based network system.

Although optimal scheduling solutions may potentially deliver significantly better performance, it may be noted that computation of such optimal solutions may often become prohibitively expensive for large problem sizes. Therefore, research in this do- main has also focused towards the design of low-overhead heuristics that provide quick and satisfactory schedules. Heuristic scheduling of PTGs on multiprocessor platforms have often been dealt with list scheduling based techniques. These scheduling strategies typically maintain an ordered priority list of all tasks in the PTG [1, 6, 11–13] and involves two phases, (i) task prioritization: for selecting the highest-priority ready task and (ii) processor selection: for selecting a suitable processor that minimizes execution time. Some examples of this class of techniques include the MCP [34], HLF [35], CPOP [6], HEFT [6], PEFT [1] and HSV [11] algorithms. They attempt to construct a static-schedule for the given PTG to minimize the overall schedule length (makespan) while satisfying resource and precedence constraints. However, none of them are applicable on real-time PTGs, and assume the underlying communication platform to be fully-connected.

Designing energy-efficient schedulers for deadline-constrained PTGs executing on distributed heterogeneous platforms is a computationally hard problem. This is because each task in the PTG consumes distinct amounts of execution time and energy on differ- ent processors [8, 47]. In [48], the authors have presented an algorithm calledDownward Energy Consumption Minimization (DECM) which utilizes the slack available between

the completion instant of the sink node and the deadline of a PTG, in order to minimize processor operating frequencies. Further, they extended DECM to a new strategy namedDownward and Upward Energy Consumption Minimization(DUECM). DUECM improves on DECM by utilizing the slack between adjacent task nodes assigned onto the same processor, while still meeting deadline of the PTG. Huang et al. investi- gated scheduling approaches on processors which are enabled/not-enabled with DVFS, for PTGs with hard real-time constraints running on fully-connected heterogeneous systems [49]. Jian et al. proposed a two-stage algorithm for the energy-aware scheduling of multiple workflows on DVFS enabled heterogeneous computing systems [8]. In the first stage, all tasks in the workflows are scheduled using HEFT by setting all processors at their highest operating voltage. In the next phase, processor operating voltages have been tuned for each task, to save energy. However, this scheme cannot be employed for scheduling workflows with real-time requirements. In [50], the authors have explored the scheduling of multiple deadline-constrained workflows in a cyber-physical cloud environ- ment with the objective of minimizing energy dissipation of the workflows. However, this work [50] is applicable to only aperiodic/sporadic workloads as only one instance of each workflow has been considered in the schedule. It may be noted that none of the above discussed schemes can be employed to minimize the energy dissipation of multiple periodic real-time PTGs executing on a shared bus based heterogeneous distributed platform. We have added Table 2.8 to summarize important related works.

2.5.1 An Overview of HEFT & PEFT

Important concepts related to the heuristic PTG scheduling strategies discussed in Chap- ters 5, 6 and 7 of this thesis are founded on the principles of two classicallist based PTG scheduling algorithms, HEFT and PEFT. We now provide overviews of these two algorithms before concluding this chapter.

Heterogeneous Earliest Finish Time (HEFT) [6] is a list-based heuristic scheduling algorithm for executing PTGs on a fully connected distributed heterogeneous multiprocessor platform, with the objective of minimizing the overallmakespan. It has two major phases, (1) Task Prioritization Phase: In this phase, priorities of tasks are set with the

Existing

work Solution

type Task type Computation resource

type

Data communication

architecture

QoSAdaptive EnergyAware Real-time [19,21,26] Optimal Independent

task set Homogeneous N/A No No Yes

[29] Optimal PTG Homogeneous Data

transmission

overhead ignored No No No [9] Optimal PTG Homogeneous Fully connected,

Routing unaware No No No [32] Optimal PTG Heterogeneous Fully connected,

Routing unaware No No No [33] Optimal,

Heuristic PTG Heterogeneous Shared bus based,

Routing aware No No Yes [1, 6, 11] Heuristic PTG Heterogeneous Fully connected,

Routing unaware No No No [48] Heuristic PTG Heterogeneous Fully connected,

Routing unaware No Yes Yes [8] Heuristic Multiple

PTGs Heterogeneous Fully connected,

Routing unaware No Yes No [50] Heuristic Multiple

PTGs Heterogeneous Fully connected,

Routing unaware No Yes Yes Table 2.8: Summary of related works

upward rank value, rank_u, which is based on mean computation and communication costs. The task list is generated by sorting the tasks in decreasing order of rank_u. The highest priority task is then selected to schedule on the processor. (2) Processor Selec- tion Phase: In this phase, the highest priority task is selected and assigned to the “best”

processor which minimizes the task’s finish time.

Predict Earliest Finish Time (PEFT) [1] is currently one of the most popular state-

of-the-art list-based heuristic algorithms for scheduling PTGs on a fully connected distributed heterogeneous multiprocessor platform. The algorithm is critically pivoted on a function called OCT() which is used to construct a matrix called Optimistic Cost Table (OCT), containing values corresponding to each task-processor pair. This OCT matrix has two important functions: (1) Determination of a rank value for each task based on which a sorted task list is generated during task prioritization phase. This list governs the order in which the tasks are considered for processor assignment, and (2) Determination of the most suitable processor for a task in terms of minimizing the overall makespan of the schedule.

Dalam dokumen Task and Message Co-scheduling Strategies in Real-time Cyber-Physical Systems (Halaman 80-85)