ARTIFICIAL NEURAL NETWORK (JARINGAN SYARAF TIRUAN) Teny Handhayani

(1)

Teny Handhayani

ARTIFICIAL NEURAL NETWORK

(JARINGAN SYARAF TIRUAN)

(2)

LATAR BELAKANG

(3)

DARI NEURON BIOLOGI KE NEURON BUATAN

(4)

(5)

ARTIFICIAL NEURAL NETWORK

• A network of artificial neurons



Characteristics



Nonlinear I/O mapping



Adaptivity



Generalization ability



Fault-tolerance (graceful

degradation)



Biological analogy

(6)

TIPE-TIPE ARTIFICIAL NEURAL NETWORKS

• Single Layer Perceptron

• Multilayer Perceptrons (MLPs)

• Radial-Basis Function Networks (RBFs)

• Hopfield Network

• Boltzmann Machine

• Self-Organization Map (SOM)

(7)

ARSITEKTUR JARINGAN

(8)

FITUR DARI ARTIFICIAL NEURAL NETWORKS

• Records (examples) need to be represented as a (possibly large) set of tuples of

<attribute, value>

• Nilai output direpresentasikan sebagai nilai diskrit, real, atau vektor

• Memiliki toleransi terhadap noise data input

• Time factor

• Membutuhkan waktu yang lama untuk pelatihan

• Sekali melalui pelatihan, ANN mampu memrpoduksi output dengan cepat

(9)

CONTOH APLIKASI

• NETtalk [Sejnowski]

• Inputs: English text

• Output: Spoken phonemes

• Phoneme recognition [Waibel]

• Inputs: wave form features

• Outputs: b, c, d,…

• Robot control [Pomerleau]

• Inputs: perceived features

(10)

APLIKASI:

AUTONOMOUS LAND VEHICLE (ALV)

• NN learns to steer an autonomous vehicle.

• 960 input units, 4 hidden units, 30 output units

• Driving at speeds up to 70 miles per hour

Weight values

for one of the

hidden units

Image of a

forward

-mounted

camera

ALVINN System

(11)

APLIKASI:

ERROR CORRECTION BY A HOPFIELD NETWORK

original

target data

corrupted

input data

Corrected

data after

10 iterations

Corrected

data after

20 iterations

Fully

corrected

data after

35 iterations

(12)

PERCEPTRON

AND

(13)

ARSITEKTUR DARI SEBUAH PERSEPTRON

• Input: a vector of real values

• Output: 1 or -1 (binary)

• Activation function: threshold function

(14)

HYPOTHESIS SPACE OF PERCEPTRONS

• Free parameters: weights (and thresholds)

• Learning: choosing values for the weights

• Hypotheses space of perceptron learning

• n: dimension of the input vector

• Linear function

}

|

{





( 

1 )



w

n

H



n n

x

w

x

w

f

(x

)



₀



₁ ₁







(15)

PERCEPTRON AND DECISION HYPERPLANE

• Perceptron merepresentasikan sebuah ‘hyperplane’ pada

n-dimensional space dari instance (misalnya titik)

• Output perceptron 1 untuk instance yang terletak pada satu

sisi hyperplane dan output -1 untuk instance yang terletak di

sisi lainnya

• Equation for the decision hyperplane: wx = 0.

• Data positif dan negatif tidak dapat dipisahkan dengan

sembarang hyperplane

• Sebuah perceptron tidak dapat melatih permasalahan pada

data linearly nonseparable

(16)

LINEARLY SEPARABLE

V.S.

LINEARLY

NONSEPARABLE

(a) Decision surface for a linearly separable set of examples (correctly

classified by a straight line)

(17)

REPRESENTATIONAL POWER OF PERCEPTRONS

• Perceptron tunggal dapat digunakan untuk merepresentasikan banyak fungsi

boolean

• AND function: w

₀

= -0.8, w

₁

= w

₂

= 0.5

• OR function: w

₀

= -0.3, w

₁

= w

₂

= 0.5

• Perceptron dapat merepresentasikan semua fungsi boolean primitif : AND, OR,

NAND dan NOR

• Catatan: Beberapa fungsi tidak dapat direpresentasikan dengan perceptron

tunggal contohnya XOR

• Setiap fungsi boolean dapat direpresentasikan dengan beberapa jaringan

perceptron hanya dengan dua level kedalaman

(18)

PERCEPTRON TRAINING RULE

• Note: output value o is +1 or -1

(not a real)

• Perceptron rule: a learning rule for a threshold unit.

• Conditions for convergence

• Training examples are linearly separable.

(19)

CONTOH

• Jika diketahui xi = 0.8,  = 0.1, t = 1 dan o = -1

maka

wi =  (t - o) xi

= 0.1 * ( 1 – (-1)) * 0.8

= 0.1 * 2 * 0.8 = 0.16

(20)

LEAST MEAN SQUARE (LMS) ERROR

• Note: output value o is a real value

(not binary)

• Delta rule: learning rule for an unthresholded perceptron (i.e. linear unit).

• Delta rule is a gradient-descent rule.

(21)

(22)

DELTA RULE FOR ERROR MINIMIZATION

i i i i i

w

E

w













,













D d id d d i

t

o

x

w



(

)

(23)

GRADIENT DESCENT ALGORITHM FOR

PERCEPTRON LEARNING

(24)

PROPERTIES OF GRADIENT DESCENT

• Because the error surface contains only a single global

minimum, the gradient descent algorithm will converge to a

weight vector with minimum error, regardless of whether the

training examples are linearly separable.

• Condition: a sufficiently small learning rate

• If the learning rate is too large, the gradient descent search

may overstep the minimum in the error surface.

(25)

CONDITIONS FOR GRADIENT DESCENT

• Gradient descent adalah strategi umum yang penting untuk

mencari hipotesis space yang besar atau infinite

• Kondisi untuk gradient descent search

• Hypothesis space terdiri atas parameter hipotesis yang

kontinue, contohnya bobot pada unit linear

(26)

DIFFICULTIES WITH GRADIENT DESCENT

• Konvergensi ke local minimum bersifat lambat

• Jika terdapat banyak local minima pada error surfae, maka

tidak ada jaminan bahwa prosedur akan menemukan global

minimum

(27)

PERCEPTRON RULE

V.S.

DELTA RULE

• Perceptron rule

• Thresholded output

• Konvergen setelah melalui sejumlah iterasi terbatas terhadap hipotesis yang dapat mengklasifikasi data latih dengan benar,data latih linear separable.

• Hanya dapat digunakan pada data linearly separable

• Delta rule

• Unthresholded output

• .Konvergen hanya secara asymtitic menuju error minimum, mungkin

membutuhkan waktu yang tak terbatas, tetapi tidak terpaku pada data linearly separable

(28)

(29)

MULTILAYER NETWORK AND

ITS DECISION BOUNDARIES



Wilayah keputusan dari jaringan multilayer feedforward



Jaringan dilatih untuk mengenali 1 dari 10 suara vokal yang terjadi pada konteks

“h_d”



Input jaringan terdiri atas dua parameter F1 dan F2, diperoleh dari analisis

spektral suara.

(30)

DIFFERENTIABLE THRESHOLD UNIT

(31)

BACKPROPAGATION (BP)

ALGORITHM

• BP mempelajari bobot untuk jaringan multilayer, diberikan sebuah jaringan

dengan sekumpulan unit tetap dan interkoneksi

• BP memperkerjakan gradien descent untuk mencoba meminimalkan error

kuadrat antara jaringan output dan nilai target

• Dua tahapan pembelajaran:

• Forwad stage: menghitung output yang diberikan oleh pola x.

(32)

FUNGSI ERROR UNTUK BP

 







D

d

k

outputs

kd

o

t

w

E

(

)

2

1 )

(



• E defined as a sum of the squared errors over all the output units k for all the training

examples d.

• Error surface can have multiple local minima

• Guarantee toward some local minimum

(33)

(34)

(35)

Program 2

(36)

(37)

Program 3

(38)

TUGAS

• Ketikkan kode program berikut kemudian cari keluaran program, misalnya:

1. Data input

2. Neural Network Training

3. Feed-forward Neural Network

4. Output

(39)

Tugas 1

(40)

%PROBLEM DESCRIPTION: Perceptron network with 2-inputs and 2-outputs is trained

%to classify input vectors into 4 categories

(41)

Tugas 2

(42)

(43)

Tugas 3 :

(44)

(45)

(46)

Tugas 4

(47)

(48)

(49)