Sentiment analysis or opinion mining is the calculation of opinions, sentiments and subjectivity of text. In this report, we look at the various challenges and applications of sentiment analysis. We will discuss in detail various approaches to perform a computational treatment of sentiments and opinions.
Various supervised or data-driven techniques for SA such as Naïve Byes, Maximum Entropy, SVM and Voted Perceptrons will be discussed and their strengths and weaknesses will be touched upon. We will also look at a new dimension of sentiment analysis from Cognitive Psychology mainly through the work of Janyce Wiebe, where we will look at ways to reveal subjectivity, perspective in narrative and the meaning of discourse structure. We will also study some specific topics in Sentiment Analysis and contemporary work in those areas.
Introduction
Domain Dependency
In the first example, the sentiment conveyed is positive while the sentiment conveyed in the second is negative. The third example has a positive feeling in the field of the book, but a negative feeling in the field of the film (where the director is asked to go and read the book).
Thwarted Expectations
Pragmatics
World Knowledge
Subjectivity Detection
Entity Identification
Negation
The AAC mood score is calculated by iteratively changing the adjective's score as each adverb is added to it. To calculate the sentimental value of the AAC, a score is associated with it based on the score of the adjective and adverb. There are certain axiomatic rules that determine the way in which adverbs modify the sense of an adjective.
Thus, the resulting score of the AAC is the score of the adjective appropriately adjusted with the effect of the adverb. They give priority to adjectives over the adverbs and modify the score of the adjective with a weight r. This can be exploited by creating several classes of features based on the SO values of phrases given their position in relation to the topic of the text.
The average ACT value of all adjectives in a text k. The average EVA value of all adjectives that share a sentence with the subject of the text l. The average POT value of all adjectives that share a sentence with the subject of the text m. The average ACT value of all adjectives that share a sentence with the subject of the text.
Discourse-Level Analysis
Subjective sentences portray a character's thoughts - represented thoughts or present a scene that the character perceives - represented perception, private state such as seeing, wanting or feeling sick - that is a perceptual, psychological or experiential state not open to objective observation or verification. For narrative understanding, tracking the POV is absolutely critical, as it distinguishes between characters' beliefs and the facts of the story. For example, the use of a full noun when a pronoun would have sufficed indicates that there has been a change in POV.
A character's private state reports, which state whether the subjective character is sick or angry, can only be reported. Now, if "John was furious" is the subjective statement of a character Mary, it is only her thought or opinion represented about John. Thus, the distinction between private state report and represented thought is essential to the elaboration of the discourse.
This is because since the subjective character is always the subject of a private state report, pronouns can be used to refer to him despite references to another entity of the same number and gender. However, in a represented thought, the referent of a pronoun can be someone in a previously represented thought. Here the last sentence is objective but the preceding sentences are subjective sentences by Dwayne.
He' in the last sentence refers to Dwayne, even though John is the last entity mentioned earlier. This suggests that there has been a change in POV and that a discourse analysis is needed to detect it.
Subjective Contexts
In conversation, they are "generally associated with the speaker's emotional involvement with the subject of his utterance" (Lakoff in third-person narrative, they are associated with the subjective character's emotional involvement with the subject of his thoughts or perceptions. 4.4 After all, Aunt Margaret had her own problems - she was the mother of the terrible dill.
Identifying a Subjective Character
If it is not identifiable, then it is one of the 2 previously mentioned characters- 1. the subjective character of the previous subjective sentence. It either continues the POV of the subjective character or resumes it. actor of an action indicated by a previous objective sentence Examples:. But the last subjective character is Jake, so August's point of view is initiated, not simply resumed or continued.
In order to identify the subjective character, it is necessary to follow the expected subjective signs encountered. However, drastic spatial and temporal discontinuities can block the continuation or restoration of a character's POV. An exception is when the subjective sentence is followed by a private state report without a paragraph break, where the subject is different from the subjective character.
It is a private-state sentence and the subjective character, Call, is the experiencer of the marked private state. The subjective character of the private state report is always the experiencer which can be determined directly from the sentence. If the private state report is shown to be a represented opinion, then the subjective character is the expected subjective character.
This is so even though (8.3) contains the subjective element 'old bag' and even though there is an expected subjective character (the girl) when we meet him. Since the 'old bag' appears in the context of the private-state expression 'believe', it is not taken into account in the identification of the subjective character. On the other hand, the subjective element 'obviously' in (7.1) is not in the scope of 'realisation' (ie not subordinated) and can therefore be used to identify subjective character.
Identifying Perspective in Narrative
The Enemy' is an anaphoric reference occurring in a subjective context (established by (9.5), a psychological report). It reflects Zoe's belief that the man is an enemy spy, although it is not at all clear to the reader that he is. Personal pronouns can also reflect a character's beliefs. someone', 'something', 'someone' refer to certain people, things, etc., without identifying them.
A character's thoughts and perceptions are not directed toward an addressee, and so the first of these uses is the predominant one in subjective contexts. There is no explicit statement in the novel that Zoe does not know who she touched; this must be inferred from the use of "someone". Certain references are used only if the speaker believes that the addressee has sufficient information to interpret them.
Specific indefinite references are used in a subjective context when the referent is unfamiliar with the subjective character. However, the referent must not be unknown to the reader or to the other characters. 11] There they [the king and his men] saw a great ruin right next to them; and suddenly they became aware of two small figures reclining on it, clad in gray, barely visible among the stones.
The reader knows that the king and his men came across two hobbits, Merry and Pippin. The King and his men do not know the hobbits, but other characters who are also present in the scene do know them. When the King and his man are on top of the CP (after 'seen' and continued by 'was aware of'), the hobbits are not referred to but as 'two small figures'.
Thus, new referents are created and in the space of belief of the king and his men, suggestions are built that they are small figures. The new referents can be argued to be co-extensional with the concepts that the reader and other characters believe to be called "Merry" and "Pippin".
Evaluation
SVMs were used for feature vector classification and IWSD was used for automatic WSD. Extensive experiments were done to compare the performance of the 4 feature representations with the lexeme representation. Banerjee & Pedersen (2003) assign connectivity scores based on the overlap between the brightness of the two concepts.
Patwardhan (2003) used a vector representation of gloss based on the context vector of gloss expressions. Luhn (1958) attempts to find the highest ranked meaningful sentences based on the frequency of content words present in them. It provides a detailed overview of the various applications and potential challenges of sentiment analysis that make this task difficult.
It has been seen that adjectives as word features can capture most of the sentiment. Argamon, Using Rating Groups for Sentiment Analysis, In Proceedings of the ACM SIGIR Conference on Information and Knowledge Management (CIKM), pp. Grishman, R., Adaptive information extraction and sublanguage analysis, In Proceedings of the 17th International Joint Conference on Artificial Intelligence , 2001.
Wiebe, Identificeren van subjectieve karakters in verhaal, Proceeding COLING '90, Proceedings of the 13th conference on Computational linguistics - Volume 2, 1990. Rapaport, A computational theory of perspective and reference in narration, Proceedings of the 26th annual meeting on Association for Computational Linguistics , 1988. Becker, Opinion Summarization of Web Comments, Proceedings of the 32nd European Conference on Information Retrieval, ECIR, 2010.
Pang, Bo and Lee, Lillian and Vaithyanathan, Shivakumar, Thumbs up?: sentiment classification using machine learning techniques, In Proceedings of the ACL-02 conference on Empirical Methods in Natural Language, 2002. Pang, Bo and Lee, Lillian, A Sentimental Education : Sentiment analysis using subjectivity summarization based on minimal cuts, Proceedings of the ACL, 2004 85. Church, Proceedings of the Third Workshop on Very Large Corpora, Association for Computational Linguistics, Somerset, New Jersey, pp.
Collier, Sentiment analysis using support vector machines with diverse information sources, In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.
Sequence Kernel
Combining Sequential Kernels of Different Order