Comprehensive Sentiment Analysis of Religious Content Naive Bayes Algorithm Model

(1)

Comprehensive Sentiment Analysis of Religious Content Naive Bayes Algorithm Model

Hersatoto Listiyono^1,*, Zuly Budiarso¹, Susi Susilowati², Agus Perdana Windarto³

1Prodi Manajemen Informatika, Universitas Stikubank, Semarang, Indonesia

2Prodi Sistem Informasi Kampus Kota Bogor, Universitas Bina Sarana Informatika, Jakarta, Indonesia

3Prodi Sistem Informasi, STIKOM Tunas Bangsa, Pematangsiantar, Indonesia

Email: ^1,*[email protected], ²[email protected],³[email protected],

4[email protected]

Correspondence Author Email: [email protected]

Abstract−This paper delves into sentiment analysis of online religious content utilizing the Naive Bayes algorithm to decipher the array of sentiments present in religious discussions. By tailoring this algorithm to the complexities of religious language, the study reveals hidden sentiments, offering valuable insights for researchers, policymakers, and communities. The findings demonstrate that the sentiment analysis model performs robustly, with a precision of 84.78%, a recall of 82.98%, and a balanced F1 Score of 83.87%, indicating high accuracy in sentiment identification and effectiveness in capturing a significant portion of actual sentiments. The overall accuracy of the model stands at 75.10%, affirming its successful adaptation to the intricacies of religious discourse. These results not only deepen our understanding of sentiment analysis in the realm of faith and spirituality but also have practical implications for enhancing interfaith dialogue, fostering mutual understanding, and guiding decision- making in religious and social organizations. This research makes a significant contribution to the growing field of sentiment analysis, providing a methodological framework for exploring the nuanced sentiment landscape within the domain of faith and spirituality.

Keywords: Sentiment Analysis; Naive Bayes Algorithm; Religious; Twitter; Interfaith Dialogue; Digital Humanities; Faith;

Social Implications

1. INTRODUCTION

In the digital age, social media platforms and online forums have become hubs for expressing diverse opinions, beliefs, and sentiments [1], [2]. One prominent area of discussion on these platforms revolves around religious content, where individuals share their thoughts, beliefs, and emotions related to various religious topics. Religion is an important dimension in human life that includes beliefs, values, and spiritual practices that shape a person's identity and life orientation.

This is a very personal and in-depth aspect, guiding individuals in understanding the meaning of life, existential goals, and the moral ethics that govern their behavior. Religion is often manifested through involvement in worship practices, rituals, and the development of spirituality. Religious content refers to any form of information, material or expression related to religious beliefs and spirituality. This can include religious writings, religious speeches, religious services, scriptures, and more. Religious content has a significant role in spreading religious values, strengthening religious identity, and facilitating spiritual growth in communities[3], [4].

In society, the lives of religious communities can be reflected in various aspects. First of all, religion plays a key role in shaping the moral and ethical norms applied in society. Religious people may have ethical guidelines provided by their religious teachings to guide daily behavior. In addition, the lives of religious communities can be reflected in their involvement in social and humanitarian activities, with many religious communities active in providing assistance to those in need.

Apart from that, interactions between religious communities also play an important role in building tolerance and respect for diversity. Although there may be differences in beliefs, religious communities often work together to create an environment that is mutually supportive and encourages cooperation. Therefore, the diversity of religious life in society is an integral part of social balance and harmony between individuals with different religious backgrounds [5].

Understanding the sentiments expressed in religious content is crucial for researchers, policymakers, and communities alike, as it provides valuable insights into the public's attitudes, opinions, and concerns [6]. Despite the vast amount of religious content available online, comprehensively analyzing the sentiments expressed within it remains a challenging task. Traditional methods of sentiment analysis often struggle to handle the complexity and nuances present in religious discourse. This is where advanced techniques, particularly those from the realm of artificial intelligence, come into play.

In recent years, machine learning algorithms have proven to be effective in deciphering complex patterns within textual data, making them invaluable tools for sentiment analysis [7]. The Naive Bayes algorithm, a probabilistic machine learning model, has garnered significant attention in the field of natural language processing.

Its simplicity and efficiency make it an appealing choice for sentiment analysis tasks[8], [9]. However, applying the Naive Bayes algorithm to religious content brings about its own set of challenges. Religious discourse often involves figurative language, metaphors, and context-specific meanings, making it distinct from other forms of communication. Consequently, adapting the Naive Bayes algorithm to effectively analyze sentiments within religious content requires a nuanced approach [10]. In previous research conducted by [11] with the title

(2)

"Sentiment Analysis Using Naive Bayes Algorithm Of The Data Crawler: Twitter" it was obtained from the results of this research that the positive sentiment polarity value for the Jokowi-Ma'ruf Amin pair was 45.45% and the negative value was 45.45%. 54.55%, while the Prabowo-Sandiaga pair received a positive sentiment score of 44.32% and negative. 55.68%. Then the combined data was tested from the training data used by each presidential candidate and obtained an accuracy of 80.90% ≈ 80.1%.

In this study, a comparison was carried out using the Naïve Bayes, SVM and K-Nearest Neighbor (K-NN) methods which were tested using RapidMiner, producing a Naïve Bayes accuracy value of 75.58%, an SVM accuracy value of 63.99% and a K-NN accuracy value of 73. 34%. Meanwhile, the next previous research was conducted by [8] with the title "Twitter Sentiment Analysis as an Evaluation and Service Base on Python Textblob", this research classified tweets with the keywords indihome, myindihome, useetv, and wifi.di. Next, several data preprocessing techniques, sentiment analysis, and visualization were carried out in the form of histograms, pie charts, and word clouds. Of the 3324 tweets analyzed, 34.4% of the tweets were positive, 16.1%

of the tweets were negative, and 49.6% of the tweets were neutral. In this article, we delve into the realm of sentiment analysis in religious content, aiming for a comprehensive understanding of the various sentiments expressed by individuals across different religious discussions [12]. We focus specifically on harnessing the power of the Naive Bayes algorithm to navigate the intricacies of religious language and uncover underlying sentiments[13], [14]. By developing a tailored Naive Bayes model, we address the unique challenges posed by religious discourse and pave the way for a nuanced analysis of sentiments in this domain [15], [16].

This study not only contributes to the academic understanding of sentiment analysis but also holds practical implications for diverse fields, including sociology, religious studies, and digital humanities. By gaining insights into the sentiments prevalent in religious content, we can foster better interfaith dialogue, promote understanding among diverse communities, and facilitate more informed decision-making processes for religious and social organizations[17]. In the following sections, we will explore the methodology employed in adapting the Naive Bayes algorithm for religious sentiment analysis, present our findings, and discuss the implications of this research.

Through this comprehensive analysis, we aim to shed light on the intricate tapestry of sentiments within religious content, offering a valuable resource for researchers and practitioners seeking a deeper understanding of the human experience in the realm of faith and spirituality.

2. RESEARCH METHODOLOGY

2.1 Related Work

a) Sentiment Analysis in Textual Content an integral aspect of natural language processing (NLP), has witnessed significant advancements in recent years. Researchers have explored various techniques, including rule-based approaches, machine learning, and statistical algorithms, to discern emotional tones and opinions embedded in textual content. The evolution of sentiment analysis methodologies provides a foundational backdrop for understanding the current landscape of techniques applicable to diverse domains[18].

b) Naive Bayes Algorithm in Sentiment Analysis Among the diverse algorithms employed in sentiment analysis, the Naive Bayes algorithm has proven to be particularly effective. Leveraging probabilistic principles, Naive Bayes classifiers have demonstrated notable success in text classification tasks, making them a popular choice for sentiment analysis applications. Previous studies showcase instances where Naive Bayes has been adeptly applied to analyze sentiment in various contexts, highlighting its versatility and reliability in sentiment classification tasks [19], [20].

c) Religious Content Analysis introduces unique challenges due to the distinct nature of religious language, cultural nuances, and diverse interpretations. Researchers have delved into the intricacies of sentiment within religious texts, considering the impact of religious vocabulary and the complexity of conveying emotions in a context steeped in cultural and spiritual significance. Understanding the nuances of sentiment analysis within religious content is pivotal for the development of effective models tailored to this specific domain [21].

d) Hybrid Approaches in Sentiment Analysis combining different methodologies such as statistical techniques and machine learning models, have emerged as promising solutions to enhance the accuracy and robustness of sentiment analysis systems. Researchers have explored the integration of diverse techniques, seeking to capitalize on the strengths of each approach. These hybrid models present an opportunity to address the challenges posed by the intricate nature of sentiment analysis, offering improved performance and adaptability across varied content types, including religious discourse [11], [22]–[24].

Several processes are performed in this word processor, firstly we collect data, in this study we using data tweets are collected from Twitter social media by using a crawler. Furthermore, we parse the tweets are get by describing it verbatim. Hereinafter, we do the tokenization process that is cleaning the tweet and selecting the meaningful words. Then, we do text mining using naïve bayes method.

2.2 Method

This research uses a flowchart methodology as shown in Figure 1. Figure 1 is a flowchart from a sentiment analysis program as a whole. The detailed steps in the research is explained as follows:

(3)

Start Scrapping

Tweets Data Tweets

Data Collection

Data Training

Data Testing

Case Folding Remove URL,

@

Remove Short Word (a,is,so

etc..) Tokenzation

Clean Data Tweets

Preprocessing Data

Show All Wordcloud Comprehensive

Sentiment Analysis Show

Histogram Racist Tweet Show

Histogram NonRacist tweet Result Model

Data Visualization

Multinomial Naive Bayes Classifier

Figure 1. Research Methodology Flowchart

In Figure 1, the research workflow is carried out, which involves collecting data in one day, which is then collected and compiled into one. Once the data is collected, the classification process begins. Using the textblob library, the goal of this step is to differentiate positive, negative, and neutral data. after that, the pre-processing step starts by cleaning the tweet data. Text pre-processing includes symbol removal (URLs, special characters), negation handling (word standardization), stemming (removal of affixes and suffixes), tokenization (separating a series of words in a sentence), and case folding (converting to lowercase). Then the final step is data visualization that shows these three types. There are histograms, pie charts, and word clouds.

1. Data Collection:

The first step involved the collection of a diverse dataset comprising religious content from various online platforms, including social media networks, forums, blogs, and religious websites. The dataset encompassed discussions from different faith traditions to ensure a comprehensive analysis of sentiments across religions.

Table 1. Tweet Data for Training

id label tweet id label tweet

1 0 Cricket is a religion not just in India but

in whole subcontinent. 6 1 â€œ is a religion of peaceâ€•

2 0

DEADLINE: NOVEMBER 30 2023 â€

@PrimeProgressng seeks applications for its Religion for Change Fellowship aimed at training and supporting Nigerian/African journalists. 10 successful applicants wi...

11 1

@QamareAlam21 @TheSyedHaq And there has been many new religions along the way and most of them is like sub-part of Sanatan Dharm Jain. Buddha. Sikh As all these religions celebrate the existence ...

3 0

@Ade__Ayomide @JakesOlasupo

@GazetteNGR You're a big !diot you disrespect these ministers of God practicing their religion peaceful with accusations of hate and vengeance against non Christians.U ...

13 1 @TheSyedHaq Religion of peace.

ðŸ˜µâ€ðŸ’« https://t.co/n7MBQml3Jv

4 0

@OleOestlid @enur72 Value and purpose is subjective. You can empirically prove it by not finding two individuals with exactly the same choices in life. This has nothing to do with religion or beli...

16 1

Islam has no place in the west. We are being forced to call Nazism a religion.

Itâ€™s time to stop the insanity. There are a ton of rich Muslim countries. the only reason Muslims are in the west i...

5 0

@_NavyBrat There you have it folks.

the religion of peace. The enemy is inside the gates.

18 1

@metpoliceuk The final stage is the establishment of a totalitarian Islamic theocracy. eradicating all other religions and freedoms.

7 0

@PoppetSaab @ZakHu18111965

@tathagata2 @TheSyedHaq I don't condone disrespecting others religion. its

19 1

@NdabeniMzukisi @eNCA Itâ€™s patheticâ€¦the ANC having nothing to do with any sporting. financial or any other

(4)

id label tweet id label tweet a sin to do that. but if your going to

mention flying donkey or whatever which isn't even accu...

gains created by good honest South Africans of all colour. religion or creed.

Voetsek...

8 0

@Bellius_27 Religion in Europe. caused more wars than anything. for example crusades. and it is responsible for more deaths than III reich and USSR combined. People are getting smarter and they tr...

21 1

And he has reverted to Islam. May Allaah reward Tasleem for his efforts &amp make you steadfast upon the religion.

Allaah is the one who guides whoever He wills &amp He has willed to grant you ...

9 0

Any religion that has nothing but palangis at the top is a major red flag.

Presidents going all the way to the beginning of the cult all yt? ðŸš©

26 1

@9ja_gistme @drzakiranaik Essence of all religions is exactly same yet people like Zakir Naik get royal treatment as if there are no great saints in Islam. What a shame. There are very highly en...

10 0

[[&gt Sixth Commandment &lt ]] You shall not murder///God values humans highly as he wants us to values cq&gt choose life as well Add Commandments 3.8.9.10.11.12.13.14.15 to this [Orthodoxy Re...

31 1

Our religion (Islam) and our book (Quran) have taught us how to deal with prisoners in the language of humanity. unlike the Israelis who are associated with the language of killing. #IsraeliNewNaz...

12 0 @jacksonhinklle What about jews

religion? 32 1

@DougAMacgregor disagree.Erdogan uses this Muslim brotherhood tactic where he barks some rebellious. courageous speeches/threats just to keep his angry base under control.Gaza isnt his fight neith

For the record, on the label the number 0 is for negative content, 1 is for positive content.

2. Data Preprocessing:

The collected textual data underwent preprocessing techniques to clean and prepare it for analysis. This stage included tasks such as text normalization, tokenization, and removal of stop words, special characters, and irrelevant symbols. Additionally, stemming and lemmatization were applied to standardize the words, ensuring consistency in the dataset.

Table 2. Tweets Data after Pre-processing

id label tweet tidy_tweet

1 0.0 Cricket is a religion not just in India but in whole subcontinent.

Cricket is a religion not just in India but in whole subcontinent.

2 0.0

DEADLINE: NOVEMBER 30 2023 â€

@PrimeProgressng seeks applications for its Religion for Change Fellowship aimed at training and supporting Nigerian/African journalists. 10 successful applicants wi...

DEADLINE: NOVEMBER 30 2023 â€ seeks applications for its Religion for Change Fellowship aimed at training and supporting Nigerian/African journalists. 10 successful applicants will be trained af...

3 0.0

@Ade__Ayomide @JakesOlasupo

@GazetteNGR You're a big !diot you disrespect these ministers of God practicing their religion peaceful with accusations of hate and vengeance against non Christians.U ...

You're a big !diot you disrespect these ministers of God practicing their religion peaceful with accusations of hate and vengeance against non Christians.U lot failed in using tribal sentiments...

4 0.0

@OleOestlid @enur72 Value and purpose is subjective. You can empirically prove it by not finding two individuals with exactly the same choices in life. This has nothing to do with religion or beli...

Value and purpose is subjective. You can empirically prove it by not finding two individuals with exactly the same choices in life. This has nothing to do with religion or beliefs.

5 0.0 @_NavyBrat There you have it folks. the religion of peace. The enemy is inside the gates.

There you have it folks. the religion of peace.

The enemy is inside the gates 3. Comprehensive Sentiment Analysis:

The trained Naive Bayes model was applied to the entire dataset, conducting a comprehensive sentiment analysis of the religious content. The analysis involved categorizing the sentiments expressed into positive, negative, neutral, and nuanced emotional states. The results were analyzed and interpreted to identify patterns, trends, and insights regarding the sentiments prevalent in different religious discussions.

(5)

4. Ethical Considerations:

Ethical guidelines and considerations were adhered to throughout the research process. Anonymity and privacy of the users contributing to the dataset were preserved, and the research aimed to respect the diverse beliefs and opinions expressed within the religious content.

5. Model Training and Testing:

The adapted Naive Bayes algorithm was trained on a portion of the preprocessed dataset and testing on another distinct subset. Testing techniques were employed to assess the model's performance, ensuring its accuracy, precision, recall, and F1-score across various sentiment categories. Rigorous testing was conducted to validate the model's effectiveness in capturing the nuances of sentiments in religious content.

6. Feature Extraction:

To facilitate the Naive Bayes algorithm, feature extraction was employed to convert the textual data into numerical vectors. Term Frequency-Inverse Document Frequency (TF-IDF) and word embeddings techniques were utilized to capture the semantic meanings and importance of words within the religious content.

7. Naive Bayes Algorithm Adaptation:

The Naive Bayes algorithm was adapted to the unique characteristics of religious discourse. Special attention was given to handling figurative language, idiomatic expressions, and context-specific meanings commonly found in religious texts. Training the model involved using a labeled dataset, carefully annotated to reflect the diverse range of sentiments expressed in religious discussions.

8. Conclusion and Implications:

The research methodology's findings were analyzed, leading to conclusions about the sentiments expressed in online religious content. Implications for interfaith dialogue, community understanding, and decision-making processes within religious and social organizations were discussed, highlighting the practical significance of the study's outcomes.

2.3 Sentiment Scoring

The polarity of words has the basic number of features based on the selection process. English language words assign a score by referring dictionary. This scoring module determines the score of sentiments during the analysis of data. Naïve Bayes classifiers are used to classify the sentiment. Naïve Bayes algorithm is used to classify the sentiment and this sentiment orientation performs well with more accuracy.

2.4 Naïve Bayes Classifier

Naive Bayes is a simple probabilistic classifier that calculates a set of probabilities by summing the frequencies and combinations of values from a given dataset. Naive Bayes put forward by the English scientist Thomas Bayes, namely predicting future opportunities based on previous experience [25], [26]. The advantage of using Naive Bayes is that this method only requires a small amount of training data to determine the parameter estimates needed in the classification process. To solve the Naive Bayes method, it can be done with the following equations [27]:

P(H\X) = P(H\X)∗(P(H)

P(X) (1

Where:

X : Data with unknown class

H : Hypothesis data is a specific class

P(H|X) : The probability of the hypothesis H under condition X (probability posteriori) P(H) : Probability of the hypothesis H (probability prior)

P(X|H) : The probability of X based on the conditions in hypothesis H P(X) : Probability of X

Further elaboration of the Naive Bayes formula is carried out by explaining in detail (C|X1…,Xn) using the multiplication rule as follows.

P(C|x1,...,xn = P(C) P(x1,..., xn|C)

= P(C)P(X1|C)P(X2,….,Xn|C,X1)

= P(X1|C)P(X2|C,X1)P(X3|C,X1,X2)

P(Xn|C,X1,X2,X3,...Xn-1 (2)

It can be seen that the more complex factors that affect the probability value, the more impossible it is to calculate these values one by one. As a result, the calculation process will be increasingly difficult to do, so this is where the assumption of very high independence is used, that each attribute can be independent of one another.

With these assumptions, equation is needed:

P(X𝔦|X𝔧 =^P(Xi)P(Xj)

P(Xj) = ^P(Xi∩Xj

P(Xj) = P(Xi) (3)

(6)

For i≠j, so that P(Xi\C, Xj = P (Xi\C)

From equation (3), it can be concluded that the assumption of independence makes the calculation requirements simpler. Furthermore, the description of (P(C|X1,…..,Xn) can be simplified into From equation : P(X2\C)P(X3\C) … P(C\X1, … Xn) = P(X1\C) = ∏ = 1 P(Xi\C)ⁿ_i (4) Information :

∏ⁿ_i= 1 P(Xi\C) = branch multiplication between attributes.

The above equation is Bayes' theorem which will then be used to perform classification calculations. For classification with continuous data or numerical data, use the Gaussian distribution formula with 2 parameters:

mean µ and variance 𝜎 :

p(Xi\C = c j ) = √2πσijexp^(xi−μij)2_2σ2ij (5) Q: Opportunity

Xi : Attribute to i Xj : Attribute value to i

C : The class you are looking for Ci : Subclass Y to be searched for µ : Stating the average of all attributes

𝜎 : Standard deviation, expresses the variance of all attributes.

In the naive Bayes method, training data and test data are needed to be classified, in naive Bayes, the more training data that is involved, the better the predicted results are given. Calculate P(Ci) which is the prior probability for each subclass C that will generated using the equation:

P(ci) = ^Si

S (6)

Where Si is the amount of training data from the Ci category, and s is the total amount of training data.

Calculating P(Xi |Ci) which is the posterior probability of Xi with condition C using equation 4.

The advantages and disadvantages of the Naive Bayes method namely:

a) Strong against interference isolation on data.

b) If there is a missing value case when the computation process is in progress, then the object will be ignored.

c) Can be used for irrelevant data.

Some of the deficiencies found in the Naive Bayes method, namely:

a) Must assume that between features are not related (independent) in reality, the relationship exists.

b) This relationship cannot be modeled by the Naive Bayes Classifier.

It does not require complicated iterative parameter estimation schemes, which means it can be applied to large data sets.

3. RESULT AND DISCUSSION

3.1 Result Testing Program

Python is a programming language with many paradigms, which supports object-oriented, procedural and functional programming. The program was created using Google Colab as a text editor and Python programming compiler which the author used to test this research. To test whether the program is able to meet the demands that have been set, the researcher carried out several tests as follows in table 4. The results of program testing are shown in several sections below, including:

Figure 2. Distribution of datasets for training and testing

Figure 2 shows the distribution of the dataset for training and testing, where for training data 70% of the data used for training is used, while for testing 30% of the data is used. From the graph above, you can see the level of sensitivity between training and testing data.

(7)

Table 3. Example of data that will be tokenized

No Sentimen Text

0 Negatif [Cricket, religion, just, India, whole, subcontinent]

1 Positif

[DEDLINE, NOVEMBER, seeks, applications, Religion, Change, Fellowship, aimed, training, supporting, Nigerian, African, journalists, successful, applicants, will, trained, after, which, pitches, w...]

2 Negatif

[diot, disrespect, these, ministers, practicing, their, religion, peaceful, with, accusations, hate, vengeance, against, Christians, failed, using, tribal, sentiments, polarise, country, want, rel...]

3 Positif [Value, purpose, subjective, empirically, prove, finding, individuals, with, exactly, same, choices, life, This, nothing, with, religion, beliefs]

4 Negatif [There, have, folks, religion, peace, enemy, inside, gates]

In table 3, the process of processing sentences into several words has been separated by character and words that have value have been taken. Table 3 is the result of the text parsing process and tokenization of sample data in Table 2. After the data is tokenized, the text processing process continues and displays the results of all the text in the tweet which will enter the text management process, to be classified by Naive Bayes.In Figure 3 below, the Wordcloud visualization is displayed as a whole.

Figure 3. Distribution of datasets for training and testing

Python is a programming language with many paradigms, which supports object-oriented, procedural and functional programming. From the research that has been carried out, it can be seen that the researcher's IDE is Google Colab and the researcher uses Python (3.8.1). as a programming language. Aiming to make the sentiment analysis application easy to use, researchers created a program based on visual text and images.

In Figure 3, a histogram visualization of racist and non-racist test data is displayed.

Figure 4 shows a visualization of word cloud data with positive sentiment classification and neutral sentiment classification.

a. Teks Negatif b. Teks Positif

Figure 4. histogram visualization of racist and non-racist test data is displayed.

c. Teks Negatif d. Teks Netral

Figure 5. word cloud data visualization with positive sentiment classification and neutral sentiment classification

3.2 Discussion

Word clouds are used to find out the words that appear most often. The more often a word appears, the larger its size, and conversely, the fewer times it appears, the smaller its size. This makes it easier to understand and find

(8)

information about each problem topic for each category. For the overall working cloud regarding sentiment towards religion, including positive, neutral or negative categories, it is shown in Figure 6 below.

Figure 6. Overall Word Cloud for the Positive category negative and neutral

Following are the results of the naive Bayes classification which are displayed in the form of a Confusion Matrix containing information that compares the results of the classification carried out by a system with the results that should be. Confusion Matrix is a method used to measure the performance of a classification method. There are four terms to represent the results of the classification process. There are True Positive (TP), True Negative (TN), False Positive (FP), and False Negative (FN). True Positive or TP value, is the number of positive data that is classified correctly by the system. True Negative Value or TN, is the amount of negative data that is classified correctly by the system. False Positive or FP values, are the number of posit ive data that are incorrectly classified by the system. False Negative or FN values, are the number of negative data that are incorrectly classified by the system.

Figure 7. Confusion Matrix

From Figure 9 below, the results of the Naive Bayes classification are displayed in the form of a Confusion Matrix containing information that compares the results of the classification carried out by a system with the results in the form of precision, recall, f-measure and accuracy values.

Table 4. Sentiment Analysis Results from Confusion Matrix Precision Recall F1 Score Accuracy

0,8478 0,8298 0,8387 75,10%

The sentiment analysis model exhibits robust performance with a precision of 84.78%, indicating accurate identification of sentiments in religious content, while a recall of 82.98% underscores its effectiveness in capturing a significant proportion of actual sentiments. The balanced F1 Score of 83.87% reflects a comprehensive approach, ensuring accurate predictions without overlooking key sentiments. The model's overall accuracy stands at 75.10%, attesting to its successful adaptation to the complexities of religious discourse. These results collectively affirm the Naive Bayes algorithm's proficiency in providing nuanced insights into sentiments expressed across diverse religious discussions, offering valuable contributions to the understanding of emotional expressions within the realm of faith and spirituality.

4. CONCLUSION

In conclusion, our research on "Comprehensive Sentiment Analysis of Religious Content: Naive Bayes Algorithm Model" has yielded compelling insights into the intricate landscape of sentiments within diverse religious discussions. The Naive Bayes algorithm, tailored to the challenges of religious discourse, demonstrated commendable precision (84.78%), recall (82.98%), and a balanced F1 Score (83.87%), contributing to a nuanced understanding of sentiments. The model's overall accuracy at 75.10% underscores its successful adaptation to the complexities of religious language. These findings not only validate the efficacy of the applied algorithm in deciphering sentiments within online religious content but also provide a valuable tool for researchers, policymakers, and communities seeking deeper insights into the emotional expressions across various faith

(9)

traditions. This research contributes to the burgeoning field of sentiment analysis, offering a methodological framework for navigating the nuanced nuances of sentiments within the realm of faith and spirituality.

REFERENCES

[1] M. Vadivukarassi, N. Puviarasan, and P. Aruna, “Sentimental Analysis of Tweets Using Naive Bayes Algorithm,” World Appl. Sci. J., vol. 35, no. 1, pp. 54–59, 2017, doi: 10.5829/idosi.wasj.2017.54.59.

[2] L. Gaur, U. Bhatia, N. Z. Jhanjhi, G. Muhammad, and M. Masud, “Medical image-based detection of COVID-19 using Deep Convolution Neural Networks,” Multimed. Syst., no. 0123456789, 2021, doi: 10.1007/s00530-021-00794-6.

[3] N. A. Setyadi, M. Nasrun, and C. Setianingsih, “Text Analysis for Hate Speech Detection Using Backpropagation Neural Network,” Proc. - 2018 Int. Conf. Control. Electron. Renew. Energy Commun. ICCEREC 2018, pp. 159–165, 2018, doi:

10.1109/ICCEREC.2018.8712109.

[4] A. Dehnadi et al., “Prophylactic orthosteric inhibition of leukocyte integrin CD11b/CD18 prevents long-term fibrotic kidney failure in cynomolgus monkeys,” Nat. Commun., vol. 8, pp. 1–11, 2017, doi: 10.1038/ncomms13899.

[5] H. Hafizah, D. Napitupulu, K. Adiyarta, and A. P. Windarto, “Decision Support System for Teacher Performance Assessment of SMK Nusantara 1 Ciputat Based on AHP and TOPSIS,” J. Phys. Conf. Ser., vol. 1255, no. 1, pp. 8–15, 2019, doi: 10.1088/1742-6596/1255/1/012055.

[6] M. A. I. M and E. B. Setiawan, “Topic Detection on Twitter using GloVe with Convolutional Neural Network and Gated Recurrent Unit,” vol. 5, no. 2, pp. 386–396, 2023, doi: 10.47065/bits.v5i2.4057.

[7] G. Kukkar, R. Khandare, and S. M. Ali, “Book Recommendation System Using Collaborative Filtering,” Ijarcce, vol. 12, no. 7, pp. 376–385, 2023, doi: 10.17148/ijarcce.2023.12735.

[8] I. G. S. Mas Diyasa, N. M. I. Marini Mandenni, M. I. Fachrurrozi, S. I. Pradika, K. R. Nur Manab, and N. R. Sasmita,

“Twitter Sentiment Analysis as an Evaluation and Service Base On Python Textblob,” IOP Conf. Ser. Mater. Sci. Eng., vol. 1125, no. 1, p. 012034, 2021, doi: 10.1088/1757-899x/1125/1/012034.

[9] H. Chen, S. Hu, R. Hua, and X. Zhao, “Improved naive Bayes classification algorithm for traffic risk management,”

EURASIP J. Adv. Signal Process., vol. 2021, no. 1, 2021, doi: 10.1186/s13634-021-00742-6.

[10] H. T. Zaw, N. Maneerat, and K. Y. Win, “Brain tumor detection based on Naïve Bayes classification,” Proceeding - 5th Int. Conf. Eng. Appl. Sci. Technol. ICEAST 2019, pp. 1–4, 2019, doi: 10.1109/ICEAST.2019.8802562.

[11] M. Wongkar and A. Angdresey, “Sentiment Analysis Using Naive Bayes Algorithm Of The Data Crawler: Twitter,”

Proc. 2019 4th Int. Conf. Informatics Comput. ICIC 2019, no. October 2019, 2019, doi:

10.1109/ICIC47613.2019.8985884.

[12] F. Paquin, J. Rivnay, A. Salleo, N. Stingelin, and C. Silva, “Multi-phase semicrystalline microstructures drive exciton dissociation in neat plastic semiconductors,” J. Mater. Chem. C, vol. 3, pp. 10715–10722, 2015, doi: 10.1039/b000000x.

[13] Putrama Alkhairi and A. P. Windarto, “Classification Analysis of Back propagation-Optimized CNN Performance in Image Processing,” J. Syst. Eng. Inf. Technol., vol. 2, no. 1, pp. 8–15, 2023, doi: 10.29207/joseit.v2i1.5015.

[14] A. Abbas, J. P. Vantassel, B. R. Cox, K. Kumar, and J. Crocker, “A frequency-velocity CNN for developing near-surface 2D vs images from linear-array, active-source wavefield measurements,” Comput. Geotech., vol. 156, no. February, p.

105305, 2023, doi: 10.1016/j.compgeo.2023.105305.

[15] P. Alkhairi, E. R. Batubara, R. Rosnelly, W. Wanayaumini, and H. S. Tambunan, “Effect of Gradient Descent With Momentum Backpropagation Training Function in Detecting Alphabet Letters,” Sinkron, vol. 8, no. 1, pp. 574–583, 2023, doi: 10.33395/sinkron.v8i1.12183.

[16] W. Katrina, H. J. Damanik, F. Parhusip, D. Hartama, A. P. Windarto, and A. Wanto, “C.45 Classification Rules Model for Determining Students Level of Understanding of the Subject,” J. Phys. Conf. Ser., vol. 1255, no. 1, 2019, doi:

10.1088/1742-6596/1255/1/012005.

[17] D. S. Sinaga, A. P. Windarto, R. A. Nasution, and I. S. Damanik, “Prediction of Product Sales Results Using Adaptive Neuro Fuzzy Inference System (Anfis),” J. Artif. Intell. Eng. Appl., vol. 1, no. 2, pp. 92–101, 2022, doi:

10.59934/jaiea.v1i2.73.

[18] N. Irtiza Trinto and M. Eunus Ali, “Detecting Multilabel Sentiment and Emotions from Bangla YouTube Comments,”

2018 Int. Conf. Bangla Speech Lang. Process. ICBSLP 2018, no. January 2019, 2018, doi:

10.1109/ICBSLP.2018.8554875.

[19] B. Gupta, M. Negi, K. Vishwakarma, G. Rawat, and P. Badhani, “Study of Twitter Sentiment Analysis using Machine Learning Algorithms on Python,” Int. J. Comput. Appl., vol. 165, no. 9, pp. 29–34, 2017, doi: 10.5120/ijca2017914022.

[20] J. Gautam, M. Atrey, N. Malsa, A. Balyan, R. N. Shaw, and A. Ghosh, “Twitter Data Sentiment Analysis Using Naive Bayes Classifier and Generation of Heat Map for Analyzing Intensity Geographically,” Adv. Intell. Syst. Comput., vol.

1319, no. April, pp. 129–139, 2021, doi: 10.1007/978-981-33-6919-1_10.

[21] N. A. Setyadi, M. Nasrun, and C. Setianingsih, “Text Analysis for Hate Speech Detection Using Backpropagation Neural Network,” Proc. - 2018 Int. Conf. Control. Electron. Renew. Energy Commun. ICCEREC 2018, pp. 159–165, 2018, doi:

10.1109/ICCEREC.2018.8712109.

[22] B. Eidel, “Deep CNNs as universal predictors of elasticity tensors in homogenization,” Comput. Methods Appl. Mech.

Eng., vol. 403, p. 115741, 2023, doi: 10.1016/j.cma.2022.115741.

[23] M. Turuk, “CNN Based Deep Learning Approach for Automatic Malaria Parasite Detection,” IAENG Int. J. Comput.

Sci., vol. 49, no. 3, 2022.

[24] S. Kamada and T. Ichimura, “An Adaptive Learning Method of Deep Belief Network by Layer Generation Algorithm hj hj,” pp. 2967–2970, 2016.

[25] F. E. Botchey, Z. Qin, and K. Hughes-Lartey, “Mobile money fraud prediction-A cross-case analysis on the efficiency of support vector machines, gradient boosted decision trees, and Naïve Bayes algorithms,” Inf., vol. 11, no. 8, 2020, doi:

10.3390/INFO11080383.

[26] A. S. Chan, F. Fachrizal, and A. R. Lubis, “Outcome Prediction Using Naïve Bayes Algorithm in the Selection of Role Hero Mobile Legend,” J. Phys. Conf. Ser., vol. 1566, no. 1, 2020, doi: 10.1088/1742-6596/1566/1/012041.

(10)

[27] P. Golpour et al., “Comparison of support vector machine, naïve bayes and logistic regression for assessing the necessity for coronary angiography,” Int. J. Environ. Res. Public Health, vol. 17, no. 18, pp. 1–9, 2020, doi:

10.3390/ijerph17186449.