• Tidak ada hasil yang ditemukan

Detection of phishing websites using machine learning approaches

N/A
N/A
Protected

Academic year: 2024

Membagikan "Detection of phishing websites using machine learning approaches"

Copied!
1
0
0

Teks penuh

(1)

Detection of phishing websites using machine learning approaches ABSTRACT

As the world responded to the Coronavirus Disease 2019 (COVID-19) pandemic in 2020, digital operations became more important, and people started to depend on new initiatives such as the cloud and mobile infrastructure. Consequently, the number of cyberattacks such as phishing has increased. Phishing websites can be detected using machine learning by classifying the websites into legitimate or illegitimate websites. The purpose of the study is to conduct a mini-review of the existing techniques and implement experiments to detect whether a website is malicious or not. The dataset consists of 11,055 observations and 32 variables. Three supervised learning models are implemented in this study: Decision Tree, K- Nearest Neighbour (KNN), and Random Forest. The three algorithms are chosen because it provides a better understanding and more suitable for the dataset. Based on the experiments undertaken, the result shows Decision Tree has an accuracy of 91.16% which is the lowest compared to the other models, 97.6% for the KNN model which is the highest among all the models and 94.44% accuracy for the Random Forest model. Through comparisons between the three models, KNN was the prime candidate for the best model considering that it has the highest accuracy. However, Random Forest is deemed more suitable for the dataset even though the accuracy is lesser because of the lowest false-negative value than the other models. The experiments can be further investigated with different datasets and models for comparative analysis.

Referensi

Dokumen terkait

in the fourth experiment by adding the number of trees used, namely one hundred, then each tree has ten branches, adding learning rate with a value of 0.4 which is used as a

Among the classification techniques J48, Random Forest, Random Tree, Decision Table, MLP, Naïve Bayes, and Bayes Network, the Random Forest classifier has achieved the highest accuracy

Detection rate covers 49% of analyzed data, Accuracy covers 28% of analyzed data Detection is given more importance than analysis during performance evaluation Table 5:

The objectives of this research were to 1 develop icing detection algorithms for a portion of East Asia based on machine learning approaches with COMS MI and Himawari-8 AHI products; 2

2.The accuracy results of KNN 6 Label using TF-IDF In Figure 2, it can be observed that the highest accuracy is obtained from the data splitting of 80:20, which is 58%.. Previous

4.2 Experimental Results and Analysis We have used supervised algorithms like Decision tree, Naive Bayes, Random Forest, Support Vector Machine SVM, K-Nearest Neighbors KNN and

Rank 2 is the KNN algorithm with an accuracy rate of 55.56%, Precision 66.67% and Recall 66.67% and the last is the Decision Tree algorithm with an accuracy rate of 46%, Precison

The results of performance calculations for the K-Nearest Neighbor, Decision Tree, Random Forest, and Naïve Bayes algorithms using data validation, namely 10 k-fold and performance