• Tidak ada hasil yang ditemukan

CSA401 - Parallel Processing - EPrints USM

N/A
N/A
Protected

Academic year: 2024

Membagikan "CSA401 - Parallel Processing - EPrints USM"

Copied!
4
0
0

Teks penuh

(1)

...1/- UNIVERSITI SAINS MALAYSIA

Supplementary Examination Academic Session 1996/97

May 1997

CSA401 - Parallel Processing Duration : [3 hours]

INSTRUCTION TO CANDIDATE:

• Please ensure that this examination paper contains FIVE questions in THREE printed pages before you start the examination.

• Answer ALL questions. If you choose to answer the questions in English, at least one question must be answered in Bahasia Malaysia.

ENGLISH VERSION OF THE QUESTION PAPER

(2)

...2/- [CSA401]

- 2 -

1. (a) Describe Flynn's classification of computer systems. Discuss as to how various types of parallel computer systems fit into the classification.

(8 marks)

(b) State and explain Amdahl's law.

(4 marks)

(c) In the light of Amdhal's law. explain why it is important for multiprocessor systems to have specially designed high-bandwidth disk I/O systems.

(4 marks)

(d) A parallel program is to be executed on a MIMD computer with 100 processors. However, 3% of all instructions during program execution must be carried out sequentially, but the rest can be executed in all processors in parallel. What is the speedup of this program on this computer?

(4 marks)

2. (a) What are the important parameters that characterise the multicomputer topology?

How do these parameters influence the cost and the performance of the multicomputer?

(4 marks)

(b) Discuss the hypercube interconnection topology.

(6 marks)

(c) Just as a 2-D mesh can be enhanced to a Torus topology, 3-D mesh can also be enhanced. What is the connectivity and diameter of such an enhanced 3-D Mesh with n processors? Explain your answer.

(6 marks)

(d) Show that mesh topology can be embedded in hypercube topology.

(4 marks)

3. (a) Explain, with the help of sketches, data driven and demand driven parallel computation model.

(4 marks)

(b) State the rigorous and simplified forms of data dependency rule for anti dependency. Explain with an example as to how the simplified form will affect the parallelisation.

(6 marks)

(3)

...3/- [CSA401]

- 3 -

(c) Carry out the following tasks for the program segment below:

(i) Determine all of the data dependencies with directions.

(ii) Determine all of the dependencies that must be synchronised.

A := B+C FOR I = 2 TO N

D(I) = A * E(I) S = E(I) * 10 T = T + S NEXT I

A = D(N) - 5

(6 marks)

(d) Explain, with examples, the use of 'FORK' and 'JOIN' in parallel programming.

(4 marks)

4. (a) What is a semaphore? What are the operations that can be performed on them?

What are the actions taken by the system when those operations are executed on the semaphores?

(8 marks)

(b) Multiple processes wish to access a shared memory area, some just reading and others writing. While a number of processes can read from a shared memory block simultaneously, only one process at a time may be allowed write access.

During the write-access no other process can be allowed to read from the memory area. Present a solution which allows maximum possible parallelism.

Take care to avoid 'starvation' of the write process.

(8 marks)

(c) Explain, with an example, the use of Monitor construct in process communcation.

(4 marks)

5. (a) Write a complete OCCAM program to sum all the elements of a matrix. Assume that any number of transputers are available.

(6 marks)

(b) Explain with examples the constructs available in CSP language.

(4 marks) (c) Write an MIMD procedure to find the root of a nonlinear equation. Explain the

algorithm used.

(4)

...4/- (10 marks) - oooOooo -

Referensi

Dokumen terkait

The Program Memory and Data Memory have separate buses so that concurrent access can occur and is detailed in this section.. The EEPROM data memory block is detailed in Section

In order to achieve high warp efficiency as well as coalesced access on both reading from and writing to global memory, data is moved in slices that can be logically mapped to a 3D block

s/lm 10/lm iii Give the direct mapping algorithm of cache memory managemenl 10/100 iv Assume the volume of a cache memory is 1O24K words and direct maDDrns rs used.. Two main

lrylxl e Write a complete C program which will do the tbllowing rasks: The program will create a link list structure which store the details of patients.. The program will prompt the

in this part, the previous definition of leadership can be applied to examine fihe role played by Haley political leaders in the four important stages of the political development of

[Anda telah memutuskan untuk mengekspresi satu protein yang tidak diketahui dengan menggunakan teknologi protein rekombinan.] [a] List the organisms that can be used as hosts to

Explain elements in a typical block diagram of a feedback system as shown in Figure 1 Terangkan elemen di dalam gambarajah blok sistem maklumbalas seperti yang ditunjukkan dalam Rajah

Coagulation process involves with the construction of one mechanical rapid mixing tank with G equals 750s-1 and a retention time of 1½ minutes.. In the flocculation process, two tanks