• Tidak ada hasil yang ditemukan

Implementation of Naive Bayes as a Quality of Service Determination on Traffic Network Communication Protocol.

N/A
N/A
Protected

Academic year: 2017

Membagikan "Implementation of Naive Bayes as a Quality of Service Determination on Traffic Network Communication Protocol."

Copied!
9
0
0

Teks penuh

(1)
(2)

Executive Editor

Prof. Manju Sharma,

India

Editorial Board Members

:

Associate Editorial Board members

SUKUMAR SENTHILKUMAR Universiti Sains Malaysia,School of Mathematical

Sciences,Malaysia.

Dr. Prasanta K Sinha

Durgapur Institute of Advanced Technology & Management, Durgapur

Dr. Bensafi Abd-El-Hamid

Abou Bekr Belkaid University of Tlemcen,Algeria

Dr. A.V.Senthil Kumar

Director, MCA depart, Hindusthan College of Arts and Science, Tamilnadu, India

Dr. Prasanta K Sinha

Deputy Director, Durgapur Institute of Advanced Technology & Management,West Bengal

DR. SURESH PRASAD SINGH H.O.D.(Chemical Engineering), B. I. T. Sindri,

Dhanbad

Hari Mohan Pandey

Middle East College of Information Technology, Under Coventry University, U.K.

PATRICK TIONG LIQ YEE Universiti Teknologi Malaysia, Malaysia,

Dr V S GIRIDHAR AKULA

Professor and Principal, Avanthi's Scientifi Technological and Research Academy (JNTU), Hyderabad

Dr. Santosh K. Pandey Department of Information Technology Board of Studies , The Institute of Chartered

Accountants of India

Reviewer

Members

:

Dr. Ahmed Nabih Zaki Rashed, Menoufia University, Egypt

Prof. Priyavrat Thareja, HOD, PEC Univ. of Technology, Chandigarh ,

India Dr. Axnirleta , HOD, Electrical and Computer engineering,

New zealand

Dr. Nouby Mahdy Ghazaly, Phd, Egyptian

Prof. Shailesh Shriniwas Angalekar, P.hd, Shivaji University, Kolhapur

Er. Shobhit Jaiswal, Researcher, Associated Electronics Research Foundation,

India

Prof. Manoj Gupta, Central University of Rajasthan, Ajmer

Dr. Rahul V. Ralegaonkar

Associate Professor in Civil Engineering, VNIT, Nagpur

Dr. (Mrs.) R. Uma Rani Asso.Prof., Department of Computer Science, Sri Sarada College For Women,

Tamil Nadu

Jasvinder Singh Sadana

Ph.D research scholar at University School of Information and Communication Technology, GGSIPU, New Delhi Dr. SAURABH DUTTA

Professor and Head of MCA, Dr. B. C. Roy Engineering College,

West Bengal,

Dr.(Prof) Alexane Axil, Brazil

Yudhishthir Raut, NRI-IIST, Bhopal

Bindeshwar Singh,

Kamla Nehru Institute of Technology, Sultanpur

ANUJ KUMAR GUPTA, Associate Prof., RIMT, Mandi Gobindgarh

Dr.R.Seyezhai,

Associate Professor, SSN College of Engineering, Tamilnadu

LAXMI CHAND, DIT, Delhi, India

Dr.Hadi Arabshahi, Iran A.K.M Nazmus Sakib, Bangladesh

Dr.Eugen Axinte, Romanian Dr.Rao, P.hd, USA

Dr. Yaduvir Singh, India Dr. Aknuas William, Australia

(3)

International Journal of Engineering Research and Applications (IJERA)

ISSN: 2248-9622 www.ijera.com

www.ijera.com www.ijera.com www.ijera.com

Journal Information

International Journal of Engineering Research and Applications (IJERA) is an open access online peer

reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.

International Journal of Engineering Research and Applications (IJERA) aims to cover the latest

outstanding developments in the field of all Engineering Technologies & science.

International Journal of Engineering Research and Applications (IJERA) is a team of researchers not

publication services or private publications running the journals for monetary benefits, we are association of scientists and academia who focus only on supporting authors who want to publish their work. The articles published in our journal can be accessed online, all the articles will be archived for real time access.

Our journal system primarily aims to bring out the research talent and the works done by sciaentists, academia, engineers, practitioners, scholars, post graduate students of engineering and science. This journal aims to cover the scientific research in a broader sense and not publishing a niche area of research facilitating researchers from various verticals to publish their papers. It is also aimed to provide a platform for the researchers to publish in a shorter of time, enabling them to continue further All articles published are freely available to scientific researchers in the Government agencies,educators and the general public. We are taking serious efforts to promote our journal across the globe in various ways, we are sure that our journal will act as a scientific platform for all researchers to publish their works online.

IJERA Impact Factor 1.69 (By Google Scholar Citation Report)

IJERA Impact Factor 1.324 (By African Quality Centre for Journals)

IJERA is ISO 3297:2007 certified Journal.

IJERA is index in world top indexing database like Nasa Ads, ArXiv.org, ANED (American National Engineering Database), NCBI (US National Library).

(4)

TABLE OF CONTENTS

Volume 3 Number 6 (November-December 2013)

Discussion on Different Algorithm for Text Classification and Feature 1271-1275

Selection

Nilophar Mulani, Yoginath Kalshetty

Study of load balancing protocols in MANET 1276-1280

Prof. S.A.Jain, Ms. Kunjal Patel

High-Speed Motorcycle Dynamics: Quick Turns While Going Straight and 1281-1285

Around Curves

Ronald L. Huston, Larry S. Schartman, John Connelly

Implementation of Naive Bayes as a Quality of Service Determination on 1286-1290

Traffic Network Communication Protocol

Sudarma, M., Pramana, D.H

16-Bit RISC Processor Design for Convolution Applications 1291-1294

Ramana K, Mrs. G. Sree Lakshmi and Dr. M. Ch. P. Jagdissh

Fuzzy Reliability Analysis for the Effect of TRH Based On Gamma Distribution 1295-1298

A. Venkatesh, S. Elango

A Novel Secured RGB LSB Steganography with Enhanced Stego-Image Quality 1299-1303

Koyi Lakshmi Prasad, Dr. T.Ch.Malleswara Rao
(5)

Sudarma, M et al Int. Journal of Engineering Research and Applications

www.ijera.com

ISSN : 2248-9622, Vol. 3, Issue 6, Nov-Dec 2013, pp.1286-1290

www.ijera.com 1286 |P a g e

Implementation of Naive Bayes as a Quality of Service

Determination on Traffic Network Communication Protocol

Sudarma, M.*, Pramana, D.H **

* Computer System and Informatics, Department of Electrical Engineering Faculty of Engineering, Udayana University, Bali - Indonesia

** Department of Information System, School of Information Technology and Computer (STMIK STIKOM BALI),

ABSTRACT

The utilization of communication model based on computer network technology is a technology that has been widely used. Communication model on computer network, using communication protocol which fits in with standard communication type that widely known as network protocol. The number of ports which identify network protocol according to IANA is numbering 65,536 ports. The utilization of network protocol in communication of computer network, is sometimes demanding communication priority which often known as QoS (Quality of Service). Referred communication priority is based on the number of communication model utilization which using network protocol as in HTTP protocol.

The analysis of computer network traffic is one of the methods to find out the utilization of computer network communication protocol, so that can be a basis of priority setting (QoS). The utilization of Naïve Bayes is used to classify communication protocol in computer network. The utilization of wireshark application is used as network capture tool and the result achieved is classifying network protocol used as a setting reference of QoS.

Keywords

-

Network protocol, QoS, Network capture

I.

INTRODUCTION

The right classification for internet traffic is very important to be done especially in the case of designing network architecture design, network management and network security[1]. The classification being conducted is based on the number of communication activity types. Communication activity in computer network is regulated in communication process using network protocol. The development of network protocol which identified into port number is developed based on the utilization of still developing communication model and standardized internationally. The amount of port numbers set in communication process is 65,536[2].

The number of network protocol utilization in a communication sometimes demanding the use of communication priority such as throughput quality, delay time, reliability and communication security[3]. The utilization of service priority is often called in the term of QoS. The basis of providing QoS priority is by analyzing of a Network Traffic. Network Traffic or Internet Traffic is data communication traffic in a network marked with one set of statistical flow by the implementation of a structured pattern[4]. The referred structure pattern is information from the header of communication information data. Wireshark application is a reliable application in case of network traffic capture[5]. The outcome of network traffic capture consists of network traffic record from communication transaction running in computer network.

The utilization of Naïve Bayes method in the research conducted is used as classification method for network traffic. The outcome of classification process will be used as a reference in determining to provide QoS for Network protocol which often used in network communication..

II.

THEORETICAL

BASE

2.1. Naïve Bayes Classification

Bayes method is a statistical approach to perform induction inference in issue of classification. This method is using conditional probability as its basis. Naïve Bayes Classification Method is based on Bayes theorem, with an assumption that the effect of attribute value in certain class is not dependent on the value of other attributes. This assumption is often

called as “independent feature model”. The probability

for conditional model classifier is as follows:

The conditional above is a dependent class variable C with a small amount of outcome or class, depends on several variables of F1 until Fn features. So that the writing of Bayes theory is:

or

Naïve conditional independent assumption has played a role. Considering that each Fi feature is

(6)

Sudarma, M et al Int. Journal of Engineering Research and Applications

www.ijera.com

ISSN : 2248-9622, Vol. 3, Issue 6, Nov-Dec 2013, pp.1286-1290

www.ijera.com 1287 |P a g e conditionally independent to the other feature Fj for j

≠ i. It means that:

for i ≠ j, so that joint model can be stated as:

It means that under independent assumption above, conditional distribution of variable class C can be

stated like this:

where Z (evidence) is a scale factor depended only on F1,…Fn, namely a constant if the value of feature

variable is known.

The model of this form far more easily to be managed, since they are divided into class prior p(C) and independent probability distribution p(Fi฀C). If there is k classes and if model for each p(Fi฀C = c) can be stated in parameter form, then appropriate Bayes naïve model has the parameter of (k – 1) + n r k. In practice, often k = 2 (binary classification) and r = 1 (Bernoulli variable as a feature) in general, so that the number of Naïve Bayes parameter model is 2n + 1, where n is the number of binary features used for classification and prediction.

2.2. Quality of Service (QoS)

QoS in Computer Network is referred to providing quality priority for communication process occurred in computer network. The providing of quality is based on priority setting. QoS utilization is often implemented on communication protocol in computer network which often called as Network protocol. In general the final purpose of QoS is to provide a better and planned network service with dedicated bandwidth jitter and latency which under control and increasing characteristic loss[6].

2.3. Network Traffic

Measurement and analysis of network traffic is important to do in order to get the knowledge regarding network traffic characteristic[7]. Normally a network traffic data has information such as:

a. IP Address: IP Address is often known as computer address. This computer address serves as a computer identity in a network communication. This address is divided into two parts which is as source and destination identity. IP Source Address is the source address which is identical with the sender when data communication process occurred. Whereas IP Destination Address is the destination address that identical with the data receiver in data communication process.

b. Protocol: Protocol is the rule applied in data communication process which the processing is identified based on the type of service. Each protocol running will be named according to the

process performed in communication process of computer network. Examples of communication protocols including tcp, udp, http, ftp, icmp protocol, etc.

c. Length: Length is the size of data which running in computer network. The common size used in network traffic is in the form of byte.

Network traffic itself can be displayed in the form of raw data (data in the form of traffic record as in a result) or in the finished form (as in the form of graph).

2.4. Wireshark

Wireshark is one of many Network Analyzer tools that widely used by Network Administrators to analyze their network performances. Wireshark is widely preferable due to its interface which using Graphical User Interface (GUI) or graphic display. Wireshark is used for network troubleshooting, software analysis and communication protocol development, and in education. Wireshark is widely used by network administrators to analyze their network performance. Wireshark is able to capture data/information which passing over a network that being observed in the form of network traffic. The benefits of using Wireshark application are as follows: a. Capturing packet of information or data sent and

received in computer network.

b. Knowing the activities that occurring in computer network.

c. Knowing and analyzing our computer network performance such as access speed/data sharing and network connection to internet.

d. Observing the security of our computer network. Some information that can be captured by wireshark tool as network traffic information among others are time elapse (the time recorded in certain period), source address (of data sender, constitutes of IP address or mac address), destination address (of data sender, constitutes of IP address or mac address), protocol (service running in computer network), length (data size that being sent), and info (additional information of each service running in computer network).

III.

METHODOLOGY

3.1. Network Capturing

(7)

Sudarma, M et al Int. Journal of Engineering Research and Applications

www.ijera.com

ISSN : 2248-9622, Vol. 3, Issue 6, Nov-Dec 2013, pp.1286-1290

www.ijera.com 1288 |P a g e

Figure 1. The capturing of network traffic.

3.2. Data Filtering

Data Filtering is performed by selecting the data which will be used as a classification counting process. Record format from the result of network capture constitutes raw data which is processed and filtered using Microsoft SQL Server 2008. Counting is conducted to calculate the same traffic records. In data filter, field being counted later using Naïve Bayes method only employs Protocol, Length and total counting fields. From the information of the three fields above, it can be used as a reference in QoS setting by analyzing the number of counting.

3.3. Naive Bayes Classification

Network traffic data which is being mapped in Naïve Bayes method as a classification class is as follows:

a. Protocol types in class label including the types of ARP, DHCP, DHCPv6, DNS, HTTP, ICMP, ICMPv6, IGMP, MDNS, MNDP, NBNS, NTP, SSDP, SSHv2, TCP.

b. Length Ranges are 0 – 32, 33 – 64, 65 – 128, 129 – 256, 257 – 512, 513 – 1024, 1025 – 2048. c. Counting Ranges are 0 – 500, 501 – 1000, 1001 –

1500, 1501 – 2000, 2001 – 2500, 2501 – 3000, x > 3000.

d. QoS priorities as the setting of network traffic result including low, intermediate, and high priority.

The number of classifications which representing classification based on priority are at 105 classes. The following is the examples of label classes based on priority.

Table 1. The sample of label classes.

From the classification classes above we perform Naïve Bayes counting on sample data of the Network Traffic records.

Table 2. Sample data of network traffic.

Sample data calculation to P is as follows:

 P (Low) : 33/105 = 0.3142857.

 P (Intermediate) : 60/105 = 0.57142857.

 P (High) : 12/105 = 0.11428571. P calculation to Protocol Names:

 P value (HTTP ! Low) : 2/33 = 0.06060606.

 P value (HTTP ! Intermediate) : 2/60 = 0.03333333.

 P value (HTTP ! High) : 3/12 = 0.25. P calculation to the Length:

 P value (1025 – 2048 ! Low) : 4/33 = 0.12121212.

 P value (1025 – 2048 ! Intermediate) : 6/60 = 0.1.

(8)

Sudarma, M et al Int. Journal of Engineering Research and Applications

www.ijera.com

ISSN : 2248-9622, Vol. 3, Issue 6, Nov-Dec 2013, pp.1286-1290

www.ijera.com 1289 |P a g e P calculation to the Counting:

 P value (x > 3000 ! Low) : 4/33 = 0.12121212.

 P value (x > 3000 ! Intermediate) : 6/60 = 0.12121212.

 P value (x > 3000 ! High) : 5/12 = 0.41666667.

The calculation of Low Posterior: 0.3142857 x 0.06060606 x 0.12121212 x 0.12121212 = 0.000279854804193053

The calculation of Intermediate Posterior: 0.57142857 x 0.03333333 x 0.1 x 0.12121212 =

0.000230880204906205

The calculation of High Posterior: 0.11428571 x 0.25 x 0.41666667 x 0.41666667 =

0.00496031735367063

Based on the calculation of Posterior value of each priority class can be seen that High Posterior has the highest value, so the calculation outcome with Naïve Bayes method results in High priority classification for Network Traffic Data in table 2.

IV.

RESULTS

AND

DISCUSION

Calculation trial in the research conducted is to perform calculation for 91 network traffic records which already having filtration in the stage of Data filtering where in raw data the number of traffic records which have not been filtered (to eliminate record duplication) reaching millions of records. In the table above, is mapped into priority graphics as follows:

Figure 2. The number of protocols based on priority.

Figure 3. The number of length ranges based on priority.

Figure 4. The number of counting range based on priority.

In Figure 2, it can be seen that the result of network traffic classification from protocol side toward the priority explaining that HTTP protocol has an intermediate up to high priority in QoS setting. Figure 3 shows that Length which can be given intermediate priority is for communication which having data size from 1025 – 2048 bytes and 513 – 1024. Meanwhile for high QoS priority is for data transmitting communication in the size of 1025 – 2048 bytes. Figure 3 shows that QoS priority can be given for the same communication process which having many appearances of more than 3000 records. So that in network management implementation, QoS policy can be done based on priority toward protocol, length size and counting (the number of equal activity record data).

V.

CONCLUSION

A conclusion Based on the test result so it can be concluded that the utilization of Naïve Bayes method is able to show the classification result based on the using of Protocol, the size of data communication which being transmitted in the form of byte length, and also based on counting (records which often appeared as activities). The classification yielded is a reference for the policy of QoS utilization by a network administrator.

The expansion of the research conducted in this study, can be maximized by the use of Scheduling Process. So that the result of the classification obtained is based on the requirement of QoS priority based on the time pattern of communication activity.

REFERENCES

[1] Yuhai Liu, Zhiqiang Li, Shanqing Guo, Taiming Feng: Efficient, Accurate Internet Traffic Classification using Discretization in Naive Bayes. ICNSC 2008: 1589-1592 [2] IANA Port Numbers, http://www.iana.org/

assignments/port/numbers. Diakses tanggal 24 September 2013

[3] Stephen S. Yau, Yin Yin: QoS-Based Service Ranking and Selection for Service-Based Systems. IEEE SCC 2011: 56-63

[4] Jun Zhang, Chao Chen, Yang Xiang, Wanlei Zhou, Yong Xiang: Internet Traffic Classification by Aggregating Correlated Naive Bayes Predictions. 5-15

Medium Medium High

Medium Medium High

(9)

Sudarma, M et al Int. Journal of Engineering Research and Applications

www.ijera.com

ISSN : 2248-9622, Vol. 3, Issue 6, Nov-Dec 2013, pp.1286-1290

www.ijera.com 1290 |P a g e [5] SecTools.Org: Top 125 Network Security

Tools, http://sectools.org. Diakses tanggal 24 September 2013

[6] Fatoni. (2012). Analisis Kualitas Layanan Jaringan Intranet (Studi Kasus Universitas Bina Darma). Diperoleh (tanggal akses 25-9-2013)

darihttp://blog.binadarma.ac.id/fatoni/wp-content/uploads /2011/04/Jurnal-QoS.pdf [7] Jiqing Liu, Jinhua Huang.2010.Broadband

Network.

BIBLIOGRAPHY

OF

AUTHORS

Dr. Ir. Made Sudarma, M.A.Sc.

Computer System and Informatics, Department of Electrical Engineering, Udayana University

Dandy Pramana Hostiadi, S.T.

Gambar

Table 1. The sample of label classes.
Figure 4. The number of counting range based on

Referensi

Dokumen terkait

CONCLUCION Based on the results of the classification by the Naive Bayes algorithm to carry out a sentiment analysis for halal tourism on Lombok island which is divided into 2

Conclusion Based on the results of research that has been done, the conclusions obtained from this study are Classification of data on degrees of hearing loss for mild, moderate,

Naïve Bayes Comparison Classifier Precision Recall F1-Score Gaussian Naïve Bayes 50.36% 80.22% 61.84% Multinomial Naïve Bayes 96.62% 76.67% 85.50% Based on the analysis in

Based on the table above, predicting student graduation on time using the Naïve Bayes method has a higher accuracy value compared to the accuracy value using the C4.5 algorithm method

Based on the above problems, predicting student graduation by utilizing data mining algorithm naive Bayes classification method using rapid miner tools where the purpose is to be able

Naïve Bayes Algorithm Accuracy Label Accuracy Sentiment Positive and Negative 73,08% Service Quality Dimensions 44,79% The experiment's results also showed that the model

Based on the test results that have been carried out, in classifying helpdesk tickets, by comparing the use of the Naïve Bayes algorithm with the Support Vector Machine, the results

This system is also equipped with sentiment analysis process using Naïve Bayes which is optimized by the PSO method, based on the results of testing the sentiment analysis method of the