• Tidak ada hasil yang ditemukan

Anggraini D. 2013. Perbandingan algoritme C4.5 dan CART pada data tidak seimbang untuk kasus prediksi risiko kredit debitur kartu kredit [skripsi]. Bogor (ID): Institut Pertanian Bogor.

Chawla VN. 2003. K-nearest neighbour and imbalance data sets: investigating the effect of sampling method, probabilistic estimate, and decision tree structure.Di dalam: Workshop on Learning from Imbalanced Datasets [Internet]; 2003 Agu 21; Washington DC, Amerika Serikat. Washington DC (US). [diunduh 2013 Mar 27]. Tersedia pada: www.site.uottawa.ca/ ~nat/Workshop2003/chawla.pdf

Chipman H, George EI, McCulloch RE. 1998. Bayesian CART model search. Journal of the American Statistical Association. 93 (443): 935-948.

Gorunescu F. 2011. Data Mining Concepts, Models dan Tehniques. Intelligent Systems Reference Library. Berlin Heidelberg (DE): Springer-Verlag.

Goujon G, Chaoqun, Jianhong W. 2007. Data Clustering: Theory, Algorithms dan Applications. Virginia (US): ASA.

Han J, Kamber M. 2006. Data Mining: Concepts and Techniques, 2nd ed. San Fransisco (US): Morgan Kaufmann.

Larose DT. 2005. Discovering Knowledge in Data : An Introduction to Data Mining. New Jersey (US) : John Wiley.

Laurikkala. 2001. Improving Identification of Difficult Small Classes by Balancing Class Distribution. Tampere (FI): University of Tampere.

Margono. 2004. Metodologi Penelitian Pendidikan. Jakarta (ID): Rineka Cipta. Tan PN, Steinbach M, Kumar V.2006.Introduction to Data Mining. Boston (US):

Pearson Education.

Ulya F. 2013. Klasifikasi debitur kartu kredit menggunakan algoritme k-nearest neighbor untuk kasus imbalanced data [skripsi]. Bogor (ID): Institut Pertanian Bogor.

Wijayanti R. 2013. Klasifikasi nasabah kartu kredit menggunakan algoritme fuzzy k-nearest neighbor pada data tidak seimbang [skripsi]. Bogor (ID): Institut Pertanian Bogor.

Wu X, Kumar V. 2009. The Top Ten Algorithms in Data Mining. New York (US): CRC Press.

LAMPIRAN

Lampiran 1 Atribut-atribut data kreditur bank baik buruk Atribut Keterangan

Pendidikan 1 = SMP/SMA 2 = Akademi 3 = S1/S2 Jenis Kelamin 1 = Pria

2 = Wanita Status Pernikahan 1 = Lajang 2 = Menikah 3 = Bercerai Tipe Perusahaan 1 = Kontraktor

2 = Conversion 3 = Industri Berat 4 = Pertambangan 5 = Jasa

6 = Transportasi Status Pekerjaan 1 = Permanen

2 = Kontrak Pekerjaan 1 = Conversion 2 = PNS 3 = Professional 4 = Wiraswasta 5 = Perusahaan Swasta Masa Kerja Dalam bulan

Lama Tinggal Dalam bulan

Status Pemilikan Rumah 0 = Bukan Milik Sendiri 1 = Milik Sendiri

Banyaknya Tanggungan

Pendapatan Rupiah Banyaknya Kartu Kredit Lain

Persentase Utang Kartu Kredit

Umur Dalam tahun

Kelas 1 = Debitur bad 2 = Debitur good

Lampiran 2 Hasil akurasi (%) data prediksi gaji Algoritma KNN 10-fold

Teknik sampling k=1 k=2 k=3 k=4 k=5

Data asli iterasi 1 77.0 76.4 79.8 80.2 80.7

Data asli iterasi 2 78.3 77.1 81.0 80.6 81.1

Data asli iterasi 3 77.9 77.3 80.6 80.1 81.4

Data asli iterasi 4 78.0 77.1 80.8 80.2 81.5

Data asli iterasi 5 79.3 77.3 80.6 80.6 81.9

Data asli iterasi 6 78.4 77.5 80.7 80.3 80.9

Data asli iterasi 7 78.7 77.8 81.3 81.1 82.5

Data asli iterasi 8 78.8 78.0 81.1 81.1 82.7

Data asli iterasi 9 78.7 77.8 81.4 81.4 82.5

Data asli iterasi 10 77.6 76.8 80.4 80.0 81.0

Oversampling duplikasi iterasi 1 90.8 87.8 85.8 84.3 83.4 Oversampling duplikasi iterasi 2 91.2 87.9 86.0 84.3 83.5 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

90.7 87.7 85.8 84.2 83.2 Oversampling duplikasi iterasi 4 90.6 87.8 85.7 84.1 83.1 Oversampling duplikasi iterasi 5 90.9 87.4 85.1 83.7 83.0 Oversampling duplikasi iterasi 6 91.0 88.0 85.9 84.4 83.4 Oversampling duplikasi iterasi 7 90.8 87.8 85.6 83.9 83.2 Oversampling duplikasi iterasi 8 90.9 87.9 85.7 84.3 83.5 Oversampling duplikasi iterasi 9 91.0 87.7 85.9 84.5 83.7 Oversampling duplikasi iterasi 10 90.5 87.5 85.1 83.6 83.0

Rata-rata 90.84 87.75 85.66 84.13 83.3

Oversampling acak iterasi 1 89.8 86.5 83.2 82.5 82.0 Oversampling acak iterasi 2 88.4 85.5 83.0 82.4 81.6 Oversampling acak iterasi 3 88.8 86.0 83.1 82.5 81.7 Oversampling acak iterasi 4 89.1 85.6 82.9 82.1 81.6 Oversampling acak iterasi 5 88.7 85.6 82.7 82.4 81.4 Oversampling acak iterasi 6 89.0 86.1 83.5 82.5 81.7 Oversampling acak iterasi 7 89.0 85.9 83.7 82.9 81.9 Oversampling acak iterasi 8 88.6 85.6 82.7 82.3 81.4 Oversampling acak iterasi 9 88.6 85.6 82.7 82.3 81.4 Oversampling acak iterasi 10 89.8 86.5 83.2 82.5 82.0

Rata-rata 88.98 85.89 83.07 82.44 81.67

Teknik sampling - Algoritma KNN 5-fold

Data asli iterasi 1 78.3 77.1 81.0 80.6 81.2

Data asli iterasi 2 77.7 76.6 80.8 80.4 81.4

Data asli iterasi 3 78.8 77.2 80.9 80.5 81.5

Data asli iterasi 4 78.3 77.7 80.5 80.8 82.2

Data asli iterasi 5 77.8 77.3 80.8 80.8 82.0

Oversampling duplikasi iterasi 1 90.8 87.4 85.3 83.8 83.3 Oversampling duplikasi iterasi 2 90.4 87.3 85.3 83.6 83.2 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

90.7 87.2 84.9 83.5 83.1 Oversampling duplikasi iterasi 4 90.5 87.3 85.1 83.7 83.3 Oversampling duplikasi iterasi 5 90.3 86.9 84.9 83.5 83.2

Rata-rata 90.54 87.22 85.10 83.62 83.22

Oversampling acak iterasi 1 88.8 85.6 82.8 82.1 81.6 Oversampling acak iterasi 2 87.9 84.9 82.5 81.7 81.1 Oversampling acak iterasi 3 88.3 84.9 82.1 81.7 81.0 Oversampling acak iterasi 4 88.1 85.3 83.2 82.3 81.7 Oversampling acak iterasi 5 88.0 85.0 82.5 82.4 81.6

Lampiran 3 Hasil precision (%) data prediksi gaji Algoritma KNN 10-fold

Teknik sampling k=1 k=2 k=3 k=4 k=5

Data asli iterasi 1 52.0 50.7 58.2 57.9 60.2

Data asli iterasi 2 54.8 52.0 61.1 58.8 61.5

Data asli iterasi 3 54.0 52.3 60.3 57.8 62.2

Data asli iterasi 4 53.9 52.0 60.6 58.0 62.1

Data asli iterasi 5 56.3 52.1 59.5 58.3 62.2

Data asli iterasi 6 54.8 52.5 60.2 58.1 60.7

Data asli iterasi 7 55.4 53.0 61.3 59.6 64.8

Data asli iterasi 8 55.5 53.3 60.7 59.4 64.4

Data asli iterasi 9 55.4 52.9 61.5 60.1 64.6

Data asli iterasi 10 53.2 51.5 59.3 57.3 60.8

Oversampling duplikasi iterasi 1 86.4 82.5 80.1 78.5 77.0 Oversampling duplikasi iterasi 2 86.9 82.7 80.3 78.5 77.8 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

86.1 82.3 80.0 78.2 77.4 Oversampling duplikasi iterasi 4 86.1 82.5 80.0 78.2 77.4 Oversampling duplikasi iterasi 5 86.4 82.0 79.2 77.7 77.1 Oversampling duplikasi iterasi 6 86.6 82.8 80.2 78.6 77.8 Oversampling duplikasi iterasi 7 86.2 82.5 79.8 78.0 77.3 Oversampling duplikasi iterasi 8 86.5 82.6 80.0 78.4 77.7 Oversampling duplikasi iterasi 9 86.6 82.4 80.3 78.6 78.0 Oversampling duplikasi iterasi 10 85.9 82.0 79.2 77.6 77.1

Rata-rata 86.37 82.43 79.91 78.23 77.46

Oversampling acak iterasi 1 83.8 79.5 77.6 76.3 77.1 Oversampling acak iterasi 2 84.5 79.9 77.5 76.1 76.4 Oversampling acak iterasi 3 83.4 79.3 77.2 76.1 76.3 Oversampling acak iterasi 4 83.4 79.5 77.4 76.2 76.4 Oversampling acak iterasi 5 83.8 78.9 76.7 75.4 75.9 Oversampling acak iterasi 6 84.0 79.6 77.2 76.3 76.5 Oversampling acak iterasi 7 83.7 79.5 77.7 76.2 76.0 Oversampling acak iterasi 8 83.9 79.4 77.8 76.4 76.5 Oversampling acak iterasi 9 83.2 78.9 76.7 75.6 75.5 Oversampling acak iterasi 10 83.2 78.9 76.7 75.6 75.5

Rata-rata 83.69 79.34 77.25 76.02 76.21

Algoritma KNN 5-fold

Teknik sampling K=1 K=2 K=3 K=4 K=5

Data asli iterasi 1 54.7 51.8 60.9 58.5 61.5

Data asli iterasi 2 53.5 51.1 60.4 58.2 61.9

Data asli iterasi 3 55.4 52.0 60.5 58.4 61.8

Data asli iterasi 4 54.7 52.8 59.7 59.0 63.8

Data asli iterasi 5 53.5 52.1 60.1 59.0 63.2

Oversampling duplikasi iterasi 1 86.3 82.0 79.5 77.8 77.5 Oversampling duplikasi iterasi 2 85.6 81.7 79.4 77.6 77.3 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

86.1 81.7 79.0 77.5 77.3 Oversampling duplikasi iterasi 4 85.8 81.7 79.2 77.7 77.4 Oversampling duplikasi iterasi 5 85.5 81.3 79.0 77.5 77.3

Rata-rata 85.86 81.68 79.22 77.62 77.36

Oversampling acak iterasi 1 83.9 79.1 77.2 75.9 76.4 Oversampling acak iterasi 2 83.0 78.6 77.0 75.6 76.1 Oversampling acak iterasi 3 83.5 78.5 76.5 75.5 76.1 Oversampling acak iterasi 4 83.1 78.8 77.4 75.9 76.3 Oversampling acak iterasi 5 82.9 78.5 76.8 75.9 76.2

Lampiran 4 Hasil recall (%) data prediksi gaji Algoritma KNN 10-fold

Teknik sampling k=1 k=2 k=3 k=4 k=5

Data asli iterasi 1 57.3 69.0 57.4 65.3 58.8

Data asli iterasi 2 55.2 66.2 58.0 64.5 57.3

Data asli iterasi 3 56.4 67.0 56.8 63.8 58.2

Data asli iterasi 4 59.2 67.7 58.5 64.3 59.2

Data asli iterasi 5 62.9 71.7 61.0 68.4 63.4

Data asli iterasi 6 59.2 69.5 58.5 64.3 58.3

Data asli iterasi 7 59.2 70.0 60.5 66.7 59.7

Data asli iterasi 8 60.8 70.9 61.5 67.2 62.8

Data asli iterasi 9 59.6 69.9 60.5 67.1 60.3

Data asli iterasi 10 59.2 69.2 59.1 66.2 59.9

Oversampling duplikasi iterasi 1 99.1 99.1 99.1 99.1 98.6 Oversampling duplikasi iterasi 2 99.2 99.2 99.2 99.1 98.7 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

99.5 99.5 99.5 99.5 98.9 Oversampling duplikasi iterasi 4 99.2 99.2 99.2 99.2 98.7 Oversampling duplikasi iterasi 5 99.3 99.3 99.3 99.3 99.0 Oversampling duplikasi iterasi 6 99.2 99.2 99.2 99.1 98.5 Oversampling duplikasi iterasi 7 99.3 99.3 99.3 99.3 98.9 Oversampling duplikasi iterasi 8 99.3 99.3 99.3 99.3 98.8 Oversampling duplikasi iterasi 9 99.2 99.2 99.2 99.1 98.7 Oversampling duplikasi iterasi 10 99.4 99.4 99.4 99.3 98.9

Rata-rata 99.27 99.27 99.27 99.23 98.77

Oversampling acak iterasi 1 96.4 96.8 94.5 95.2 93.0 Oversampling acak iterasi 2 97.3 97.6 93.5 94.7 92.7 Oversampling acak iterasi 3 95.9 96.1 93.5 94.5 91.6 Oversampling acak iterasi 4 96.8 97.0 93.6 94.5 91.7 Oversampling acak iterasi 5 96.8 97.2 94.5 95.3 92.5 Oversampling acak iterasi 6 95.7 95.8 93.0 94.0 90.7 Oversampling acak iterasi 7 96.9 97.2 94.0 94.7 92.6 Oversampling acak iterasi 8 96.4 96.9 94.3 95.1 92.2 Oversampling acak iterasi 9 96.8 97.1 94.1 95.4 92.8 Oversampling acak iterasi 10 96.8 97.1 94.1 95.4 92.8

Rata-rata 96.58 96.88 93.91 94.88 92.26

Algoritma KNN 5-fold

Teknik sampling K=1 K=2 K=3 K=4 K=5

Data asli iterasi 1 57.0 68.4 58.6 66.6 58.7

Data asli iterasi 2 57.8 67.4 58.5 65.4 58.9

Data asli iterasi 3 55.4 52.0 60.5 66.2 61.0

Data asli iterasi 4 59.1 70.2 58.9 65.8 60.7

Data asli iterasi 5 59.8 71.1 60.3 67.1 60.5

Oversampling duplikasi iterasi 1 99.3 99.3 99.3 99.2 98.8 Oversampling duplikasi iterasi 2 99.5 99.5 99.5 99.4 99.0 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

99.4 99.4 99.4 99.4 98.9 Oversampling duplikasi iterasi 4 99.5 99.5 99.5 99.4 99.1 Oversampling duplikasi iterasi 5 99.4 99.4 99.4 99.4 99.1

Rata-rata 99.42 99.42 99.42 99.36 98.98

Oversampling acak iterasi 1 96.0 96.7 93.0 94.0 91.7 Oversampling acak iterasi 2 95.4 95.9 92.5 93.6 90.7 Oversampling acak iterasi 3 95.5 96.1 92.7 93.8 90.4 Oversampling acak iterasi 4 95.8 96.6 93.9 94.7 91.9 Oversampling acak iterasi 5 96.0 96.5 93.3 94.8 91.9

Lampiran 5 Hasil f-measure (%) data prediksi gaji Algoritma KNN 10-fold

Teknik sampling k=1 k=2 k=3 k=4 k=5

Data asli iterasi 1 54.5 58.4 57.8 61.4 59.5

Data asli iterasi 2 55.0 58.2 59.5 61.5 59.3

Data asli iterasi 3 55.1 58.7 58.5 60.6 60.1

Data asli iterasi 4 56.4 58.8 59.5 61.0 60.6

Data asli iterasi 5 59.4 60.4 60.2 62.9 62.8

Data asli iterasi 6 56.9 59.8 59.4 61.1 59.5

Data asli iterasi 7 57.2 60.3 60.9 62.9 62.2

Data asli iterasi 8 58.1 60.9 61.1 63.1 63.6

Data asli iterasi 9 57.4 60.3 61.0 63.4 62.4

Data asli iterasi 10 56.1 59.0 59.2 61.5 60.3

Rata-rata 56.61 59.48 59.71 61.94 61.03

Oversampling duplikasi iterasi 1 92.3 90.1 88.6 87.6 86.9 Oversampling duplikasi iterasi 2 92.6 90.2 88.8 87.6 87.0 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

92.3 90.1 88.7 87.6 86.8 Oversampling duplikasi iterasi 4 92.2 90.1 88.6 87.5 86.7 Oversampling duplikasi iterasi 5 92.4 89.8 88.2 87.2 86.7 Oversampling duplikasi iterasi 6 92.5 90.2 88.7 87.7 86.9 Oversampling duplikasi iterasi 7 92.3 90.1 88.5 87.3 86.8 Oversampling duplikasi iterasi 8 92.4 90.2 88.6 87.6 87.0 Oversampling duplikasi iterasi 9 92.5 90.0 88.7 87.7 87.1 Oversampling duplikasi iterasi 10 92.2 89.9 88.2 87.1 86.7

Rata-rata 92.37 90.07 88.56 87.49 86.86

Oversampling acak iterasi 1 89.7 87.3 85.2 84.7 84.3 Oversampling acak iterasi 2 90.5 87.9 84.8 84.4 83.8 Oversampling acak iterasi 3 89.2 86.9 84.6 84.3 83.3 Oversampling acak iterasi 4 89.6 87.4 84.7 84.4 83.4 Oversampling acak iterasi 5 89.8 87.1 84.7 84.2 83.4 Oversampling acak iterasi 6 89.5 86.9 84.3 84.2 83.0 Oversampling acak iterasi 7 89.8 87.5 85.1 84.4 83.5 Oversampling acak iterasi 8 89.7 87.3 85.3 84.7 83.6 Oversampling acak iterasi 9 89.5 87.1 84.5 84.4 83.3 Oversampling acak iterasi 10 89.5 87.1 84.5 84.4 83.3

Rata-rata 89.68 87.25 84.77 84.41 83.49

Algoritma KNN 5-fold

Teknik sampling K=1 K=2 K=3 K=4 K=5

Data asli iterasi 1 55.8 59.0 59.7 62.3 60.1

Data asli iterasi 2 55.5 58.1 59.4 61.6 60.3

Data asli iterasi 3 58.0 59.9 59.9 62.0 61.4

Data asli iterasi 4 56.8 60.2 59.3 62.2 62.2

Data asli iterasi 5 56.5 60.2 60.2 62.8 61.8

Oversampling duplikasi iterasi 1 92.3 89.8 88.3 87.2 86.9 Oversampling duplikasi iterasi 2 92.0 89.7 88.3 87.2 86.8 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

92.3 89.7 88.0 87.1 86.8 Oversampling duplikasi iterasi 4 92.1 89.8 88.2 87.2 86.9 Oversampling duplikasi iterasi 5 92.0 89.5 88.0 87.1 86.9

Rata-rata 92.14 89.70 88.16 87.16 86.86

Oversampling acak iterasi 1 89.5 87.0 84.4 84.0 83.3 Oversampling acak iterasi 2 88.8 86.4 84.1 83.7 82.7 Oversampling acak iterasi 3 89.1 86.4 83.8 83.6 82.6 Oversampling acak iterasi 4 89.0 86.8 84.8 84.3 83.4 Oversampling acak iterasi 5 88.9 86.6 84.2 84.3 83.3

Lampiran 6 Hasil akurasi (%) data kreditur bank baik / buruk

Algoritma KNN 10-fold

Teknik sampling k=1 k=2 k=3 k=4 k=5

Data asli iterasi 1 51.8 51.8 66.8 66.8 77.1

Data asli iterasi 2 36.1 32.7 59.5 56.4 70.9

Data asli iterasi 3 12.9 7.2 35.1 28.1 51.0

Data asli iterasi 4 50.0 50.0 66.8 66.8 78.1

Data asli iterasi 5 76.0 76.0 81.7 81.7 83.5

Data asli iterasi 6 82.0 82.0 83.8 83.8 83.8

Data asli iterasi 7 82.7 82.7 83.5 83.5 83.8

Data asli iterasi 8 80.7 80.7 83.5 83.5 83.8

Data asli iterasi 9 65.2 65.2 76.8 76.8 81.7

Data asli iterasi 10 70.0 50.5 24.3 16.9 30.8

Oversampling duplikasi iterasi 1 82.4 72.1 64.4 61.6 59.1 Oversampling duplikasi iterasi 2 73.8 66.6 61.5 59.2 57.7 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

61.0 57.1 55.9 55.5 55.0 Oversampling duplikasi iterasi 4 81.4 71.8 64.9 60.8 58.9 Oversampling duplikasi iterasi 5 95.6 88.7 81.7 75.9 71.4 Oversampling duplikasi iterasi 6 99.0 97.3 95.9 94.1 92.6 Oversampling duplikasi iterasi 7 99.4 98.2 95.5 93.9 92.6 Oversampling duplikasi iterasi 8 98.3 96.3 94.6 92.5 90.5 Oversampling duplikasi iterasi 9 89.8 84.0 79.3 74.6 72.4 Oversampling duplikasi iterasi 10 57.6 56.7 56.2 55.3 55.2

Rata-rata 83.83 78.88 74.99 72.34 70.54

Oversampling acak iterasi 1 80.3 69.5 60.5 59.5 56.6 Oversampling acak iterasi 2 71.9 64.0 58.3 57.2 56.8 Oversampling acak iterasi 3 58.0 53.5 51.7 51.9 51.7 Oversampling acak iterasi 4 79.5 69.7 63.2 60.2 58.9 Oversampling acak iterasi 5 94.9 88.9 81.7 78.8 73.7 Oversampling acak iterasi 6 98.8 96.9 95.2 94.0 92.2 Oversampling acak iterasi 7 100.0 98.5 95.5 95.2 92.5 Oversampling acak iterasi 8 96.9 95.2 92.9 91.9 87.7 Oversampling acak iterasi 9 88.3 82.3 77.5 75.1 73.9 Oversampling acak iterasi 10 53.4 52.6 52.1 51.4 51.4

Rata-rata 82.2 77.11 72.86 71.52 69.54

Algoritma KNN 5-fold

Data asli iterasi 1 18.4 16.6 33.9 30.2 48.1

Data asli iterasi 2 25.3 19.0 38.7 31.6 48.5

Data asli iterasi 3 77.9 77.9 82.6 82.6 83.5

Data asli iterasi 4 79.4 79.4 83.0 83.0 83.7

Data asli iterasi 5 33.7 27.3 40.7 36.3 45.2

Oversampling duplikasi iterasi 1 64.1 58.3 55.9 55.4 54.9 Oversampling duplikasi iterasi 2 67.9 62.9 59.1 57.1 56.5 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

96.1 90.4 85.2 81.8 78.7 Oversampling duplikasi iterasi 4 97.6 94.9 90.5 87.4 84.5 Oversampling duplikasi iterasi 5 72.4 68.9 66.5 64.1 62.4

Rata-rata 79.6

2

75.08 71.44 69.16 67.4

Oversampling acak iterasi 1 60.1 54.1 51.0 50.9 48.8 Oversampling acak iterasi 2 65.1 59.9 56.1 54.8 53.8 Oversampling acak iterasi 3 95.5 90.1 85.3 83.6 79.2 Oversampling acak iterasi 4 100.0 98.8 97.2 96.7 94.5 Oversampling acak iterasi 5 69.4 66.0 62.8 60.9 59.5

Lampiran 7 Hasil precision (%) data kreditur bank baik / buruk

Algoritma KNN 10-fold

Teknik sampling k=1 k=2 k=3 k=4 k=5

Data asli iterasi 1 0 0 0 0 0

Data asli iterasi 2 0 0 0 0 0

Data asli iterasi 3 0 0 0 0 0

Data asli iterasi 4 0 0 0 0 0

Data asli iterasi 5 0 0 0 0 0

Data asli iterasi 6 0 0 0 0 0

Data asli iterasi 7 0 0 0 0 0

Data asli iterasi 8 0 0 0 0 0

Data asli iterasi 9 0 0 0 0 0

Data asli iterasi 10 0 0 0 0 0

Oversampling duplikasi iterasi 1 75.4 65.9 60.3 58.4 55.9 Oversampling duplikasi iterasi 2 67.3 61.8 58.3 57.0 56.0 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

58.1 55.7 55.1 54.8 54.5 Oversampling duplikasi iterasi 4 74.4 65.7 60.6 57.9 56.8 Oversampling duplikasi iterasi 5 92.5 82.6 74.7 69.1 65.4 Oversampling duplikasi iterasi 6 98.2 95.3 92.9 90.1 88.0 Oversampling duplikasi iterasi 7 99.0 96.7 92.3 89.9 88.0 Oversampling duplikasi iterasi 8 96.9 93.6 90.9 87.8 85.0 Oversampling duplikasi iterasi 9 84.1 77.1 72.3 68.0 66.1 Oversampling duplikasi iterasi 10 55.8 55.4 55.0 54.6 54.5

Rata-rata 80.17 74.98 71.24 68.76 67.02

Oversampling acak iterasi 1 72.3 62.3 56.2 55.4 53.9 Oversampling acak iterasi 2 64.1 58.2 54.6 54.0 53.8 Oversampling acak iterasi 3 54.3 51.8 50.9 51.0 50.9 Oversampling acak iterasi 4 71.2 62.3 57.7 55.7 55.0 Oversampling acak iterasi 5 91.2 81.9 73.8 70.4 67.3 Oversampling acak iterasi 6 97.9 94.5 92.2 90.2 89.6 Oversampling acak iterasi 7 100.0 97.0 97.1 95.4 96.6 Oversampling acak iterasi 8 96.4 92.5 90.0 88.0 86.1 Oversampling acak iterasi 9 81.7 74.2 69.7 67.0 66.4 Oversampling acak iterasi 10 51.8 51.3 51.1 50.7 50.7

Rata-rata 78.09 72.6 69.33 67.78 67.03

Algoritma KNN 5-fold

Data asli iterasi 1 0 0 0 0 0

Data asli iterasi 2 0 0 0 0 0

Data asli iterasi 3 0 0 0 0 0

Data asli iterasi 4 0 0 0 0 0

Data asli iterasi 5 0 0 0 0 0

Oversampling duplikasi iterasi 1 60.0 56.4 55.1 54.7 54.5 Oversampling duplikasi iterasi 2 62.7 59.2 56.9 55.7 55.4 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

93.3 84.9 78.5 74.7 71.7 Oversampling duplikasi iterasi 4 95.7 91.4 85.1 81.1 77.7 Oversampling duplikasi iterasi 5 66.1 63.4 61.7 60.0 58.9

Rata-rata 75.56 71.06 67.46 65.24 63.64

Oversampling acak iterasi 1 55.8 52.2 50.5 50.4 49.3 Oversampling acak iterasi 2 59.0 55.5 53.3 52.5 52.1 Oversampling acak iterasi 3 92.4 83.7 78.6 76.1 73.4 Oversampling acak iterasi 4 100.0 97.7 97.2 95.8 95.6 Oversampling acak iterasi 5 62.3 59.6 57.6 56.3 55.6

Lampiran 8 Hasil recall (%) data kreditur bank baik / buruk Algoritma KNN 10-fold

Teknik sampling k=1 k=2 k=3 k=4 k=5

Data asli iterasi 1 0 0 0 0 0

Data asli iterasi 2 0 0 0 0 0

Data asli iterasi 3 0 0 0 0 0

Data asli iterasi 4 0 0 0 0 0

Data asli iterasi 5 0 0 0 0 0

Data asli iterasi 6 0 0 0 0 0

Data asli iterasi 7 0 0 0 0 0

Data asli iterasi 8 0 0 0 0 0

Data asli iterasi 9 0 0 0 0 0

Data asli iterasi 10 0 0 0 0 0

Oversampling duplikasi iterasi 1 100 100 100 100 100 Oversampling duplikasi iterasi 2 100 100 100 100 100 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

100 100 100 100 100 Oversampling duplikasi iterasi 4 100 100 100 100 100 Oversampling duplikasi iterasi 5 100 100 100 100 100 Oversampling duplikasi iterasi 6 100 100 100 100 100 Oversampling duplikasi iterasi 7 100 100 100 100 100 Oversampling duplikasi iterasi 8 100 100 100 100 100 Oversampling duplikasi iterasi 9 100 100 100 100 100 Oversampling duplikasi iterasi 10 100 100 100 100 100

Rata-rata 100 100 100 100 100

Oversampling acak iterasi 1 98.2 99.1 94.5 97.2 91.4 Oversampling acak iterasi 2 99.1 99.7 97.8 98.5 96.0 Oversampling acak iterasi 3 100.0 100.0 97.5 98.8 96.0 Oversampling acak iterasi 4 99.1 99.7 99.4 99.7 97.5 Oversampling acak iterasi 5 99.4 100.0 98.2 99.4 92.3 Oversampling acak iterasi 6 99.7 99.7 98.8 98.8 95.4 Oversampling acak iterasi 7 100.0 100.0 93.8 95.1 88.0 Oversampling acak iterasi 8 97.5 98.5 96.6 96.9 89.8 Oversampling acak iterasi 9 98.8 99.1 97.5 98.8 96.6 Oversampling acak iterasi 10 98.5 98.5 98.2 98.5 96.1

Rata-rata 99.03 99.43 97.23 98.1

7

93.9 1

Algoritma KNN 5-fold

Data asli iterasi 1 0 0 0 0 0

Data asli iterasi 2 0 0 0 0 0

Data asli iterasi 3 0 0 0 0 0

Data asli iterasi 4 0 0 0 0 0

Data asli iterasi 5 0 0 0 0 0

Oversampling duplikasi iterasi 1 100 100 100 100 100 Oversampling duplikasi iterasi 2 100 100 100 100 100 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

100 100 100 100 100 Oversampling duplikasi iterasi 4 100 100 100 100 100 Oversampling duplikasi iterasi 5 100 100 100 100 100

Rata-rata 100 100 100 100 100

Oversampling acak iterasi 1 97.5 98.2 95.2 96.6 90.9 Oversampling acak iterasi 2 99.5 99.8 98.0 99.1 94.0 Oversampling acak iterasi 3 99.2 99.5 97.1 98.0 91.6 Oversampling acak iterasi 4 100.0 100.0 97.1 97.7 93.4 Oversampling acak iterasi 5 98.5 99.2 96.9 98.0 94.7

Rata-rata 98.94 99.34 96.86 97.88 92.9

Lampiran 9 Hasil f-measure (%) data kreditur bank baik / buruk

Teknik sampling KNN10-fold k=1 k=2 k=3 k=4 k=5

Data asli iterasi 1 0 0 0 0 0

Data asli iterasi 2 0 0 0 0 0

Data asli iterasi 3 0 0 0 0 0

Data asli iterasi 4 0 0 0 0 0

Data asli iterasi 5 0 0 0 0 0

Data asli iterasi 6 0 0 0 0 0

Data asli iterasi 7 0 0 0 0 0

Data asli iterasi 8 0 0 0 0 0

Data asli iterasi 9 0 0 0 0 0

Data asli iterasi 10 0 0 0 0 0

Oversampling duplikasi iterasi 1 86.0 79.5 75.2 73.8 72.5 Oversampling duplikasi iterasi 2 80.5 76.4 73.7 72.6 71.8 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

93.5 71.5 71.0 70.8 70.6 Oversampling duplikasi iterasi 4 85.3 79.3 75.4 73.3 72.4 Oversampling duplikasi iterasi 5 96.1 90.5 85.5 81.8 79.0 Oversampling duplikasi iterasi 6 99.1 97.6 96.3 94.8 93.6 Oversampling duplikasi iterasi 7 99.5 98.3 96.0 94.7 93.6 Oversampling duplikasi iterasi 8 98.4 96.7 95.2 93.5 91.9 Oversampling duplikasi iterasi 9 91.4 87.1 83.9 81.0 79.6 Oversampling duplikasi iterasi 10 71.7 71.3 71.0 70.6 70.6

Rata-rata 90.15 84.82 82.32 80.69 79.56

Oversampling acak iterasi 1 83.3 76.5 70.5 70.6 67.8 Oversampling acak iterasi 2 77.9 73.5 70.1 69.7 69.0 Oversampling acak iterasi 3 70.4 68.3 66.9 67.2 66.5 Oversampling acak iterasi 4 82.9 76.7 73.0 71.4 70.4 Oversampling acak iterasi 5 100.0 88.3 80.6 77.7 76.0 Oversampling acak iterasi 6 95.1 90.0 84.3 82.4 77.8 Oversampling acak iterasi 7 98.8 97.0 95.4 94.3 92.4 Oversampling acak iterasi 8 99.4 98.0 94.7 94.8 92.2 Oversampling acak iterasi 9 96.9 95.4 93.2 92.2 88.0 Oversampling acak iterasi 10 89.4 84.8 81.3 79.9 78.7

Rata-rata 89.41 84.85 81 80.02 77.88

Algoritme KNN 5-fold

Data asli iterasi 1 0 0 0 0 0

Data asli iterasi 2 0 0 0 0 0

Data asli iterasi 3 0 0 0 0 0

Data asli iterasi 4 0 0 0 0 0

Data asli iterasi 5 0 0 0 0 0

Oversampling duplikasi iterasi 1 75.0 72.2 71.0 70.7 70.5 Oversampling duplikasi iterasi 2 77.1 74.4 72.5 71.6 71.3 Oversampling duplikasi iterasi 3

duplikasi iterasi 1

96.5 91.8 88.0 85.5 83.5 Oversampling duplikasi iterasi 4 97.8 95.5 91.9 89.6 87.4 Oversampling duplikasi iterasi 5 79.6 77.6 76.3 75.0 74.1

Rata-rata 85.2 82.3 79.94 78.48 77.36

Oversampling acak iterasi 1 70.9 68.2 66.0 66.3 64.0 Oversampling acak iterasi 2 74.1 69.1 68.7 67.0 95.7 Oversampling acak iterasi 3 71.4 90.9 86.9 85.7 81.5 Oversampling acak iterasi 4 100.0 98.9 97.2 96.7 94.5 Oversampling acak iterasi 5 76.3 74.5 72.2 71.5 70.0

Dokumen terkait