• Tidak ada hasil yang ditemukan

View of A REVIEW ON CLASSICAL AND MODERN METHODS OF SENTIMENT ANALYSIS

N/A
N/A
Protected

Academic year: 2023

Membagikan "View of A REVIEW ON CLASSICAL AND MODERN METHODS OF SENTIMENT ANALYSIS"

Copied!
4
0
0

Teks penuh

(1)

ACCENT JOURNAL OF ECONOMICS ECOLOGY & ENGINEERING Peer Reviewed and Refereed Journal (International Journal) ISSN-2456-1037

Vol.04,Special Issue 05, (ICIR-2019) September 2019, Available Online: www.ajeee.co.in/index.php/AJEEE

1

A REVIEW ON CLASSICAL AND MODERN METHODS OF SENTIMENT ANALYSIS Mohammad Danish

Research Scholar (Ph.D), College of Engineering Department of Computer Science & Engineering Dr. A.P.J Abdul Kalam University, Indore (M.P), India-452014

Abstract - In today's world of social media where people prefer to point their views on various social platform, sentiment analysis is the best technique to analyze people's opinion and choices for modern recommendation. Most of the people are proactively involved in posting comments and views regarding different issues. These issues may be related to some product, politics or environment. Sentiment analysis is an easy way to predict the possible outcomes of such issues. Sentiment analysis was earlier notified as opinion mining where it was limited to certain extent, however in modern computing it has broad dimensions and major applications to various domains. Sentiment analysis was earlier computed using some classical methods which were quite effective in certain fields with limited accuracy. Now a days there has been an enhancement introduced in these methods which involves the majority of machine learning algorithms responsible for improving accuracy than classical methods. In this paper we will discuss some traditional methods along with some modern techniques to explore the hidden potential of sentiment analysis applicability to computing and recommendation systems. It produces the polarity based outcomes which are quite easy to predict and to compute. Sentiment analysis works around three essential parameters of polarity, subject and opinion holder.

Keywords: Sentiment analysis, Machine learning, Recommendation, Computing.

1 INTRODUCTIO

Sentiment analysis is the study of analyzing the people's attitude and emotions for a particular entity. These entity may represent some individual, any event or some real time topic . Majority of the social media platforms are flooded by such different topics of people's interest.

People raise their voices and express their

views either in terms of reviews or comments [1,2,3]. The major part of the sentiment analysis is to extract the people's opinion in text and classify their polarity as labels accordingly. Following figure (Fig-1) depicts the classification process of sentiment analysis in general way.

Classification in sentiment analysis is done at three levels: document level, sentence level and aspect level. The main aim of document level is to classify the opinions extracted from the documents as

a positive or negative. In this methodology the entire document is considered as a basic unit of information and the output generated is also considered as a single entity. On the other hand at sentence

(2)

ACCENT JOURNAL OF ECONOMICS ECOLOGY & ENGINEERING Peer Reviewed and Refereed Journal (International Journal) ISSN-2456-1037

Vol.04,Special Issue 05, (ICIR-2019) September 2019, Available Online: www.ajeee.co.in/index.php/AJEEE

2 level the major objective is to parse the sentiments at sentence level or it can also be defined as sentence analysis in each sentence. Prior classification it is important to identify whether sentence is subjective or objective. Wilson et al. [2]

finds out that sentiments related expressions are not subjective in nature.

Aspect level classification deals with sentiments analysis with respect to aspects of entities. Sometimes each sentence generate more than two aspect and outcome of polarity could be of dual nature. For example, "The student is not good at English, but he will have bright future". All of these methods are engaged according to use of organisation and the extent at which the sentiments analysis is to be discovered [4,5].

2 CLASSICAL METHODS A. Bag of Words

One of the most promising methods of NLP in sentiments analysis involves Bag- of-Words. Generally a bag of words consists of all unique words in a document and their frequencies of occurrence. In the language of mathematics you can consider it as the set of words which do not have duplicates value. Usually we avoid the sequences of words and focuses on the frequency of repetition of these words [7]. Such representation need a training set to identify the polarity of each sentence in a review of document and it is called a Corpus.

Once the corpus is ready then it becomes quite easy to identify the polarity of target review and its polarity. On the behalf of compared value we can easily define the predicted outcomes. These outcomes may be differ than actual status, so for every outcome we measure the accuracy factor for every output set corresponding to every input set [8,9].

B. Stemming

Stemming process involved the cutting off the end and beginning of the word considering the list of common prefixes and suffixes that may be found in an inflected word. The stemming approach works for the maximum events along with some limitation. Under mentioned are few examples of stemming [13].

C. Lemmatization

Lemmatization process involves the morphological analysis of words. To proceed such activity we need detailed dictionaries so the necessary algorithm need to create a link to form back to its lemma. Following is an example of lemmatization [15].

Above table easily depicts the word form and morphological information derived from it. Finally we can easily detect the lemma as final output.

3 MODERN METHODS A. Machine learning approach

Modern methodologies involved machine learning where the predicted output value is better than classical based methods. In this approach we usually use the supervised learning techniques are which are quite effective due to availability of highly trained data set. Another approach is probabilistic classifier which consists of mixture models. some of important classifiers are as follows [17,18]:

3.1 Naive Bayes Classifier

The Naive bayes classifier is a simple technique which is commonly used in initial phases. It usually computes the posterior probability of class .

3.2 Bayesian Network

A Bayesian network model is a acyclic graph whose nodes are variables and edges are presented as conditional dependencies. During the text classification computational complexities in Bayesian network become quite expensive [19].

(3)

ACCENT JOURNAL OF ECONOMICS ECOLOGY & ENGINEERING Peer Reviewed and Refereed Journal (International Journal) ISSN-2456-1037

Vol.04,Special Issue 05, (ICIR-2019) September 2019, Available Online: www.ajeee.co.in/index.php/AJEEE

3 3.3 Support Vector Machine Classifier (SVM)

SVM are best suited for text based classification due to the sparse nature of text where few are irrelevant in nature and are closely correlated to each other.

SVM are used in text classification and for categorizing the reviews based on their quality[15,18].

3.4 Neural Network

Neural networks uses the neuron structure for counting word frequencies in the document. Linear function of a neural network is Pi=A . Xi. [22] One of the advantage of using neural network is that it minimizes the error a lot in comparison to other techniques.

3.5 Decision Tree Classifier

Decision tree classifier provides a hierarchical decomposition of the training

data space

in which a condition on the attribute value is used to divide the data. The division of the data space is done recursively until the leaf nodes contain certain minimum numbers of records which are used for the purpose of classification [21].

3.6 Rule Based Classifiers

In case of rule based classifiers the data space is provided with a pre defined set of rules. The left hand side represents a condition on the feature set expressed in disjunctive normal form while the right hand side is the class label. Lot of disperse methods are available to generate the rules. The major and two common criteria are support and confidence. The support is the absolute number of instances in the training data set which are relevant to the rule. The Confidence refers to the conditional probability that the right hand side of the rule is satisfied if the left-hand side is satisfied.

3.7 Reinforcement and Unsupervised Learning

Using a pre defined labeled training data set is easy to use to accurately classify the unknown data. However sometime we encounter certain data sets where no training data set is available to us. Such kind of learning is known as un- supervised learning. In this approach un- classified documents are arranged in

clusters where certain polarity is analyzed[24].

B. LSTM Approach

Long Short-Term Memory Recurrent Neural Network model is very popular to solve natural language based problems. It reads the sentence from the first word to the last one. And it tries to figure out the sentiment after each step. For example, for the sentence “The food sucks, the wine was worse.”. It will read “The”, then

“food”, then “sucks”, “the” and “wine”. It will keep in mind both a vector that represents what came before (memory) and a partial output. For instance, it will already think that the sentence is negative halfway through. Then it will continue to update as it processes more data [23].

3.8 LSTM Network

This RNN structure looks very accurate for sentiment analysis tasks. It performs well for speech recognition and for translation. However, it slows down the evaluation process considerably and doesn’t improve accuracy that much in our application so should be implemented with care.

4 CONCLUSION

This paper has presented few basic classical and modern methods of approaches used in sentiments analysis and in Natural language processing. Lot more methods are yet to explore especially in field of statistics.

REFERENCES

1. Tsytsarau Mikalai, Palpanas Themis.

Survey on mining subjective data on the web. Data Mining knowledge discovery 2012;24:478–514.

2. Wilson T, Wiebe J, Hoffman P. Recognizing

contextual polarity

in phrase-level sentiment analysis. In:

Proceedings of HLT/EMNLP; 2005.

3. Liu B. Sentiment analysis and opinion

mining. Synth Lecture

Human Lang Technol 2012.

4. Yu Liang-Chih, Wu Jheng-Long, Chang Pei- Chann, Chu Hsuan-Shou. Using a contextual entropy model to expand emotion words and their intensity for the

(4)

ACCENT JOURNAL OF ECONOMICS ECOLOGY & ENGINEERING Peer Reviewed and Refereed Journal (International Journal) ISSN-2456-1037

Vol.04,Special Issue 05, (ICIR-2019) September 2019, Available Online: www.ajeee.co.in/index.php/AJEEE

4

sentiment classification of stock market news. Knowl-Based Syst 2013;41:89–97.

5. Michael Hagenau, Michael Liebmann, Dirk Neumann. Automated news reading: stock price prediction based on financial news using context-capturing features. Decis Supp Syst; 2013.

6. Tao Xu, Peng Qinke, Cheng Yinzhao.

Identifying the semantic orientation of terms using S-HAL for sentiment analysis.

Knowledge Based System 2012;35:279–89 7. Maks Isa, Vossen Piek. A lexicon model for

deep sentiment analysis and opinion mining applications. Decis Support System 2012;53:680–8.

8. Pang B, Lee L. Opinion mining and sentiment analysis. 2008.

9. Cambria E, Schuller B, Xia Y, Havasi C.

New avenues in opinion mining and sentiment analysis. IEEE Intell System 2013;28:15–21.

10. Feldman R. Techniques and applications for sentiment analysis. ACM,2013;56:82–9.

11. Montoyo Andre´ s, Martı´nez-Barco Patricio,

Balahur Alexandra.

Subjectivity and sentiment analysis: an overview of the current state of the area and envisaged developments. Decision Support System 2012; 53: 675–9.

12. Qiu Guang, He Xiaofei, Zhang Feng, Shi

Yuan, Bu Jiajun, Chen

Chun. DASA: dissatisfaction-oriented advertising based on sentiment analysis.

Expert Syst Appl 2010;37:6182–91.

13. Lu Cheng-Yu, Lin Shian-Hua, Liu Jen- Chang, Cruz-Lara Samuel, Hong Jen-Shin.

Automatic event-level textual emotion sensing using mutual action histogram between entities. Expert System, 37:1643–

53.

14. Neviarouskaya Alena, Prendinger Helmut, Ishizuka Mitsuru Recognition of Affect, Judgment, and Appreciation in Text. In:

Proceedings of the 23rd international conference on computational linguistics (Coling 2010), Beijing; 2010. p. 806–14.

15. Bai X. Predicting consumer sentiments from online text. Decision Support Syst 2011;50:732–42.

16. Zhao Yan-Yan, Qin Bing, Liu Ting.

Integrating intra- and inter document evidences for improving sentence sentiment classification. Acta Automatica Sinica 2010;36(October’10).

17. Yi Hu, Li Wenjie. Document sentiment classification by exploring description model of topical terms. Computer Speech Lang 2011; 25:386–403.

18. Cao Qing, Duan Wenjing, Gan Qiwei.

Exploring determinants of voting for the ‘‘helpfulness’’ of online user

reviews: a text mining

approach. Decision Support System 2011;50:511–21.

19. He Yulan, Zhou Deyu. Self-training from labeled features for sentiment analysis. Inf Process Manage 2011;47:606–16.

20. Tan Songbo, Wu Qiong. A random walk algorithm for automatic construction of domain-oriented sentiment lexicon. Expert Syst Appl 2011:12094–100.

21. Tan Songbo, Wang Yuefen. Weighted SCL model for adaptation of sentiment classification. Expert Syst Appl 2011; 38:

10524–31.

22. Qiong Wu, Tan Songbo. A two-stage framework for crossdomain sentiment classification. Expert Syst Appl 2011;38:14269–75.

23. Jiao Jian, Zhou Yanquan. Sentiment

Polarity Analysis based

multi-dictionary. In: Presented at the 2011 International Conference on Physics Science and Technology (ICPST’11); 2011.

24. Lambov Dinko, Pais Sebastia˜ o, Dias Ga˜

el. Merged agreement

algorithms for domain independent sentiment analysis. In: Presented at the Pacific Association for, Computational Linguistics.

Referensi

Dokumen terkait

ACCENT JOURNAL OF ECONOMICS ECOLOGY & ENGINEERING Peer Reviewed and Refereed Journal International Journal ISSN-2456-1037 Vol... ACCENT JOURNAL OF ECONOMICS ECOLOGY & ENGINEERING

ACCENT JOURNAL OF ECONOMICS ECOLOGY & ENGINEERING Peer Reviewed and Refereed Journal International Journal ISSN-2456-1037 Vol... ACCENT JOURNAL OF ECONOMICS ECOLOGY & ENGINEERING