• Tidak ada hasil yang ditemukan

Lec 1 Introduction Data Structures Algorithm

N/A
N/A
Protected

Academic year: 2018

Membagikan "Lec 1 Introduction Data Structures Algorithm"

Copied!
42
0
0

Teks penuh

(1)
(2)

Data Structures & Algorithm

Algorithm: Outline, the essence of a

computational procedure, step-by-step

instructions

Program: an implementation of an algorithm in

some programming language

Data structure: Organization of data needed to

(3)

History

Name: Persian mathematician Mohammed

al-Khowarizmi, in Latin became Algorismus

First algorithm: Euclidean Algorithm, greatest

common divisor, 400-300 B.C.

19th century – Charles Babbage, Ada Lovelace.

20th century – Alan Turing, Alonzo Church, John

(4)

Algorithmic problem

Infinite number of input instances satisfying the specification. For

example:

A sorted, non-decreasing sequence of natural numbers. The sequence

is of non-zero, finite length:

1, 20, 908, 909, 100000, 1000000000.3.

Specification

of input

?

Specification

of output as

a function of

(5)

Algorithmic Solution

• Algorithm describes actions on the input instance

• Infinitely many correct algorithms for the same algorithmic

problem

Input instance,

adhering to

the

specification

Algorithm

Output

related to

the input as

(6)

What is a Good Algorithm?

Efciency:

Running time

Space used

Efciency as a function of input size:

Number of data elements (numbers, points)

(7)

Measuring the Running Time

How should we measure the running

time of an

algorithm

?

Approach 1: Experimental Study

Write a program that implements the

algorithm

Run the program with data sets of varying

size and composition.

Use a method like System.currentTimeMillis()

to get an accurate measure of the actual running time.

t (ms)

(8)

Limitation Experimental Studies

Experimental studies have several limitations:

It is necessary to

implement

and test

the algorithm in order to determine its

running time.

Experiments can be done only on a

limited set of

inputs

, and may not be

indicative of the running time on

other inputs not included in the

experiment.

In order to compare two algorithms,

(9)

Beyond Experimental Studies

We will now develop a general methodology for

analyzing the running time of algorithms. In contrast to the "experimental approach", this methodology:

Uses a

high-level description

of the

algorithm instead of testing one of its

implementations.

Takes into account

all possible inputs.

Allows one to evaluate the efciency

of any algorithm in a way that is

independent from the hardware and

(10)

Pseudo-code

• Pseudo-code is a description of an algorithm that is more

structured than usual prose but less formal than a programming language.

Example: finding the maximum element of an array.

Algorithm arrayMax(A, n):

Input: An array A storing n integers. Output: The maximum element in A.

currentMax  A[0] for i 1 to n -1 do

if currentMax < A[i] then currentMax  A[i] return currentMax

Pseudo-code is our preferred notation for describing

algorithms.

(11)

What is Pseudo-code?

• A mixture of natural language and high-level programming

concepts that describes the main ideas behind a generic implementation of a data structure or algorithm.

Expressions:

use standard mathematical symbols to describe numeric and

boolean expressions

use for assignment (“=” in Java)

use = for the equality relationship (“==” in Java)

Method Declarations:

(12)

What is Pseudo-code?

• Programming Constructs:

• decision structures:if ... then ... [else ] • while-loops: while ... Do

repeat-loops: repeat ... until … • for-loop: for ... Do

array indexing: A[i]

Functions/Method

calls: object method(args)

(13)

Analysis of Algorithms

Primitive Operations: Low-level computations

independent from the programming language can be identified in pseudocode.

• Examples:

• calling a function/method and returning from a

function/method

• arithmetic operations (e.g. addition) • comparing two numbers

• indexing into an array, etc.

• By inspecting the pseudo-code, we can count the

(14)

Sort

Example: Sorting

INPUT

sequence of numbers

a

1

, a

2

, a

3

,….,a

n

b

1

,b

2

,b

3

,….,b

n

OUTPUT

a permutation of the sequence of numbers

2 5 4 10 7 2 4 5 7 10

Correctness

For any given input the algorithm halts with the output:

• b1 < b2 < b3 < …. < bn • b1, b2, b3, …., bn is a

permutation of a1, a2, a3,….,an

Correctness

For any given input the algorithm halts with the output:

• b1 < b2 < b3 < …. < bn how (partially) sorted

they are

algorithm

Running time

Depends on

number of elements (n) how (partially) sorted

they are

(15)

Insertion Sort

Start “empty handed” Insert a card in the right

position of the already sorted

hand

Continue until all cards are inserted/sorted

Strategy

Start “empty handed” Insert a card in the right

position of the already sorted

hand

Continue until all cards are

inserted/sorted

for j=2 to length(A) do key=A[j]

“insert A[j] into the sorted sequence A[1..j-1]”

i=j-1 sorted sequence A[1..j-1]”

i=j-1

while i>0 and A[i]>key do A[i+1]=A[i]

(16)

Analysis of Insertion Sort

for j=2 to length(A)

do key=A[j]

“insert A[j] into the sorted sequence A[1..j-1]”

i=j-1

function of the input size

(17)

Best/Worst/Average Case

Best case: elements already sorted ® tj=1,

running time = f(n), i.e., linear time.

Worst case: elements are sorted in inverse

order

® tj=j, running time = f(n2), i.e., quadratic time

Average case: tj=j/2, running time = f(n2), i.e.,

quadratic time

Total Time= n(c1+c2+c3+c7)+

(c4+c5+c6)-(c2+c3+c5+c6+c7)

2

n j jt

(18)

Best/Worst/Average Case (2)

• For a specific size of input n, investigate running times for

different input instances:

(19)

Best/Worst/Average Case (3)

For inputs of all sizes:

1n 2n 3n 4n 5n 6n

R

un

ni

ng

ti

m

e

best-case

(20)

Best/Worst/Average Case (4)

Worst case is usually used:

• It is an upper-bound and in certain application

domains (e.g., air trafc control, surgery) knowing the worst-case time complexity is of crucial importance

For some algorithms worst case occurs fairly

often

The average case is often as bad as the

worst case

• Finding the average case can be very

(21)

Asymptotic Notation

Goal: to simplify analysis by getting rid of unneeded

information (like “rounding” 1,000,001≈1,000,000)

We want to say in a formal way 3n2 ≈ n2

The “Big-Oh” Notation:

given functions f(n) and g(n), we say that f(n) is O(g(n))

(22)

Asymptotic Notation

• The “big-Oh” O-Notation

asymptotic upper bound

(23)

Example

g

(

n

)

=

n

c g

(

n

)

=

4

n

f

(

n

)

=

2

n +

6

For functions

f(n)

and

g(n)

(to the right)

there are positive

constants

c

and

n

0

such that:

f(n)≤

c g(n)

for

n

n

0

(24)

Another Example

On the other hand…

n

2

is not O(

n

) because there is

no

c

and

n

0

such that:

n

2

cn

for

n

n

0

(As the graph to the right

illustrates, no matter how large

a

c

is chosen there is an

n

big

enough that

n

2

>

cn

)

(25)

Asymptotic Notation (cont.)

Note

: Even though it is

correct

to say

“7n - 3

is O(n

3

)”, a better statement is “7n - 3 is O(n)”,

that is, one should make the approximation as

tight as possible (for all values of n).

Simple Rule

: Drop lower order terms and

constant factors

(26)

Asymptotic Analysis of The Running

Time

Use the Big-Oh notation to express the number of

primitive operations executed as a function of the

input size.

For example, we say that the

arrayMax

algorithm

runs in

O

(n) time.

Comparing the asymptotic running time

-an algorithm that runs in O(n) time is better than one that runs in

O(n2) time

-similarly, O(log n) is better than O(n)

-hierarchy of functions: log n << n << n2 << n3 << 2n

Caution!

Beware of very large constant factors. An

algorithm running in time 1,000,000 n is still

O

(n) but

might be less efcient on your data set than one

(27)

Example of Asymptotic Analysis

An algorithm for computing prefix averages

Algorithm prefixAverages1(X):

Input: An n-element array X of numbers.

Output: An n -element array A of numbers such that A[i] is the average of elements X[0], ... , X[i].

Let A be an array of n numbers. for i 0 to n - 1 do

a  0

for j  0 to i do a  a + X[j] A[i]  a/(i+ 1)

return array A

1 step

i iterations

with

i=0,1,2...n-1

(28)

Another Example

A better algorithm for computing prefix

averages:

Algorithm prefixAverages2(X):

Input: An n-element array X of numbers.

Output: An n -element array A of numbers such that A[i] is the average of elements X[0], ... , X[i].

Let A be an array of n numbers. s 0

for i  0 to n do s  s + X[i] A[i]  s/(i+ 1) return array A

(29)

Asymptotic Notation

(terminology)

Special classes of algorithms:

logarithmic: O(log n)

linear: O(n)

quadratic: O(n2)

polynomial: O(nk), k ≥ 1

exponential: O(an), n > 1

“Relatives” of the Big-Oh

(30)
(31)
(32)

• The “big-Omega” W-Notation • asymptotic lower bound

f(n) = W(g(n)) if there exists

constants c and n0, s.t. c g(n) £ f(n) for n ³ n0

• Used to describe best-case running

times or lower bounds of algorithmic problems

• E.g., lower-bound of searching in

an unsorted array is W(n).

(33)

• The “big-Theta” Q-Notation • asymptoticly tight bound

f(n) = Q(g(n)) if there exists

Asymptotic Notation (4)

(34)

Asymptotic Notation (5)

Two more asymptotic notations

"Little-Oh" notation f(n)=o(g(n))

non-tight analogue of Big-Oh

• For every c, there should exist n0 , s.t. f(n) £ c g(n)

for n ³ n0

Used for comparisons of running times. If f(n)=o(g(n)), it is said that g(n) dominates f(n).

"Little-omega" notation f(n)=w(g(n))

(35)

Asymptotic Notation (6)

Analogy with real numbers

f(n) = O(g(n)) @ f £ gf(n) = W(g(n)) @ f ³ g

f(n) = Q(g(n)) @ f = gf(n) = o(g(n)) @ f < gf(n) = w(g(n)) @ f > g

Abuse of notation: f(n) = O(g(n)) actually means

(36)

Comparison of Running Times

Running

Time in

m

s

Maximum problem size (n)

1 second 1 minute 1 hour

400

n

2500

150000

9000000

20

n

log

n

4096

166666

7826087

2

n

2

707

5477

42426

n

4

31

88

244

(37)

Math You Need to Review

Logarithms and Exponents

properties of logarithms:

logb(xy) = logbx + logby logb (x/y) = logbx - logby logbxa = alog

bx

Logba = logxa/logxb

properties of exponentials:

a(b+c) = abac

abc = (ab)c

ab/ac = a(b-c)

b = a log ab

(38)

A Quick Math Review

Geometric progression

• given an integer n0 and a real number 0< a ¹ 1

geometric progressions exhibit exponential growth

• Arithmetic progression

(39)

Summations

The running time of insertion sort is determined

by a nested loop

Nested loops correspond to summations

(40)
(41)

Proof by Induction (2)

(42)

Referensi

Dokumen terkait

Berdasarkan penjelasan diatas, bahwa tugas akhir dengan judul “ Game Adventure Treasure Of Mouse ” penulis dapat menarik beberapa kesimpulan sebagai berikut :..

[r]

Kemampuan tehknik yang saya dapatkan selama bekerja dapat membantu saya didalam menyelesaikan

digunakan sebagai gold standard adalah pemeriksaan pH monitoring, namun pemeriksaan ini masih jauh dari kriteria ideal oleh karena sensitivitasnya dilaporkan hanya 50% sampai

pembelajaran gerak dasar atletik lompat tinggi melalui model inkuiri dan media.. rintangan pada kelas V SDN Buahdua I Kecamatan Buahdua

Berdasarkan hal tersebut di atas maka dengan pemanfaatan Sistem Informasi Geografis (SIG) diketahui tingkat potensi kawasan perumahan dan permukiman terdapat lahan yang berpotensi

Lampiran 1 Data Populasi dan Sampel Penelitian Lampiran 2 Perhitungan Exel Rasio Keuangan Lampiran 3 Peringkat Sukuk. Lampiran 4 Perhitungan SPSS Ordinal

Menurut para misionaris, usaha misi kristenisasi di Loloda ketika itu, nampaknya banyak mengalami kendala, menurut mereka tantangan dan hambatan itu terutama berasal