Comparative Analysis of ARIMA and LSTM Models for Predicting Physical Fatigue in Bandung Workers

(1)

Comparative Analysis of ARIMA and LSTM Models for Predicting Physical Fatigue in Bandung Workers

Kiki Dwi Prasetyo^*, Rifki Wijaya, Gia Septiana Wulandari School of Informatics, Informatics, Telkom University, Bandung, Indonesia Email: ^1*[email protected], ²[email protected],

3[email protected]

Correspondence Author Email: [email protected]

Abstract−In today's era of rapid economic growth, there is an increasing demand for workers to increase productivity by working longer and harder. However, these demands often lead to irregular and excessive working hours, which can potentially lead to negative consequences, such as physical fatigue-a state in which the body feels tired after physical activity. Factors that influence this fatigue include age, gender, health conditions, workload and work environment. Physical fatigue poses a significant challenge in ensuring workplace safety, especially in the transportation and industrial sectors, as it can reduce overall performance, productivity and quality of work. In addition, physical fatigue also increases the likelihood of decision-making errors and workplace accidents. Predicting physical fatigue is crucial to addressing these challenges. Heart rate serves as a parameter to measure fatigue, given its proven efficacy as a marker to predict physical fatigue, which is derived from the electrocardiogram and regulated by the autonomic nervous system. This research utilizes two machine learning algorithms - ARIMA and LSTM - with heart rate (bpm) and number of steps as variables. Performance evaluation, using Mean Squared Error (MSE) and Root Mean Squared Error (RMSE), showed that the LSTM model outperformed the ARIMA model. The LSTM model showed better performance, with MSE of 0.1108 and RMSE of 0.3329, compared to the ARIMA model with MSE of 0.2397 and RMSE of 0.4895.

Keywords: Autoregressive Integrated Moving Average; Long Short Term Memory; Heart Rate; Time Series; Physical Fatigue

1. INTRODUCTION

The modern era of rapid economic development requires workers to work harder and longer to increase productivity. This demand often results in excessive and irregular working hours. One of the negative impacts that can arise is physical fatigue, which is a condition where the body feels tired after doing physical activity. Fatigue levels can be affected by a range of factors, including age, gender, health conditions, as well as workload and the environment in which one works [1].

In carrying out work, physical fatigue is a serious problem in the field of occupational safety, especially in the transportation and industrial sectors. Physical fatigue can lead to reduced levels of performance, productivity and quality of work [2]. In addition, physical fatigue can also increase the risk of errors in decision-making, and can even lead to accidents in the workplace [3]. Short-term effects of physical fatigue include discomfort, decreased ability and strength, reduced levels of alertness, poor and slowed perception, and feelings of lethargy, drowsiness, and dizziness [1], [2], [4]. This condition of fatigue can also lead to difficulty concentrating, reduced levels of alertness, poor perception, and loss of control over one's strength [1], [2]. Meanwhile, the long-term effects of physical fatigue involve musculoskeletal and cardiovascular disorders, and can even develop into chronic fatigue syndrome [5]. It's important to remember that physical fatigue not only affects work performance, but can also have an impact on overall health.

Knowing when physical fatigue will occur is an important step in preventing its negative effects. The way to know when physical fatigue is likely to occur is through prediction. Physical fatigue prediction is the process of estimating the physical fatigue that will occur at a later time by considering current information and the factors that influence it [6]. One of the parameters that can be used to predict physical fatigue is heart rate [7]. In daily life, the most commonly felt change in physiological parameters is heart rate. Heart rate refers to the frequency of heartbeats in one minute. Heart rate can be used to predict physical fatigue because heart rate, which the autonomous nervous system regulates and which is obtained from the electrocardiogram, can be shown to be a potential indicator for forecasting physical fatigue [7]. Heart rate data is time series data, which means that this information is obtained continuously over a certain period of time. Therefore, physical fatigue analysis can be more effective if it considers the temporal aspect of heart rate data, allowing to identify patterns or trends that may be potential predictors of future physical fatigue.

This research aims to predict physical fatigue based on heart rate using two algorithms, namely Autoregressive Integrated Moving Average (ARIMA) and Long Short-Term Memory (LSTM). Both algorithms were chosen because they can learn temporal patterns contained in the data and are suitable for predicting time series data, especially if the prediction time is long [8]. The accuracy of the prediction results from these two algorithms will be compared to evaluate their performance. As a foundation, several previous studies have shown the success of ARIMA and LSTM in predicting time series data, including heart rate data prediction.

As examples, previous research [9] focused on Bitcoin price prediction using ARIMA, GRU, and LSTM models. The results showed that ARIMA gave the best results with MAPE of 2.76% and RMSE of 302.53. Another research [8], which compared several machine learning models such as ARIMA, linear regression, SVR, KNN

(2)

regressor, decision tree regressor, random forest regressor, and LSTM, showed that ARIMA, with walk-forward validation and linear regression, outperformed other models in predicting heart rate values in the next time. The findings of these previous studies consistently confirm the reliability of the ARIMA model in handling time series data prediction problems.

Furthermore, research [10],showed that LSTM was able to accurately predict driver fatigue levels using heart rate and percentage of eyelid closure (PERCLOS) data. The prediction results from the LSTM achieved a true positive rate of 75% and an overall accuracy of 88%, demonstrating the ability of the LSTM model to predict fatigue levels with a high degree of accuracy. Then, in [11], a comparison was made between the Long Short-Term Memory (LSTM) and Autoregressive Integrated Moving Average (ARIMA) models for predicting time series data. This paper uses intraday data from the Tehran Stock Exchange (TSE) for 10 years to forecast the next 2 months. LSTM produces better output compared to the ARIMA model. The results show that, although in long- term prediction, the prediction error rates of both models decrease, LSTM significantly outperforms ARIMA in terms of prediction error rates. These studies demonstrate the ability of the LSTM model to predict fatigue levels and time series data with good accuracy.

In performing the comparison, the performance evaluation metrics used are Mean Squared Error (MSE) and Root Mean Squared Error (RMSE). MSE measures the average of the squares of the difference between the predicted value and the true value, while RMSE gives an indication of the prediction error in the same units as the predicted variable. Using these two metrics, the research aims to identify algorithms that provide heart rate-based physical fatigue predictions with lower error rates, thus providing a better understanding of the performance of ARIMA and LSTM in this research.

2. RESEARCH METHODOLOGY

2.1 Research Stages

In this research, a system for predicting physical fatigue is created using two machine learning techniques, namely Autoregressive Integrated Moving Average (ARIMA) and Long Short-Term Memory (LSTM) which will be compared. The flowchart of the system is described in Figure 1 which illustrates the stages to be gone through.

The process starts by involving the input dataset which then goes through a preprocessing stage. The dataset is divided into two parts, where one part is used as training data to train the model, while the other part is used as testing data to evaluate the performance of the model. The final process is characterized by an assessment of the system's performance.

Figure 1. Flowchart System 2.2 Data collection

This research uses a smartwatch to collect heart rate data for workers in Bandung City for 2 days per worker. Heart rate is considered a physiological parameter that is often felt in everyday life. The variability of heart measurements

(3)

is considered an indicator of the level of fatigue. Heart Rate is widely used in identifying fatigue states and measuring performance responses to workload, serving as a direct regulator of cardiovascular function in the autonomic nervous system (ANS). The aim of this research was to analyze heart rate to identify patterns associated with levels of physical fatigue. After data collection for 2 days, interviews were conducted to obtain further information about fatigue time.

2.3 Data preprocessing

Preprocessing refers to a series of steps or processes applied to data before it is utilized for model training or analysis. In this research, preprocessing is carried out with the aim of preparing the data to suit the needs of the model or analysis to be run. In this research, several preprocessing steps were conducted, and they include:

a. Data preparing

The data collected from the smartwatch needs to follow an export step before it can be processed further. The exported data from the smartwatch can be found in the global export data folder, where all exported data will reside. Initially, the data is presented in JSON format, and then needs to be converted to CSV format for further processing. In Table 1, an example of raw data that has been converted to CSV format is shown.

Table 1. Sample Raw Data

dateTime value/bpm value/confidence

09/23/23 00:00:02 99 1

09/23/23 00:00:07 98 1

09/23/23 00:00:12 96 1

09/23/23 00:00:17 95 2

09/23/23 00:00:32 95 2

09/23/23 00:00:37 95 2

b. Data interpolation

The quality of the dataset is still not optimal due to several cases where data was not recorded within the 5- second interval, resulting in empty values. Therefore, it is necessary to interpolate the data to fill in the gaps of heart rate information at unrecorded times. Table 2 shows that the data has not gone through the interpolation process, where the data is still not optimal with some time ranges that do not have heart rate data. Meanwhile, Table 3 shows the data that has gone through the interpolation process, where the dataset becomes more optimal with the empty values that have been filled.

Table 2. Before Interpolation Data dateTime bpm 2023-10-06 00:00:02 57 2023-10-06 00:00:17 56 2023-10-06 00:00:22 57 2023-10-06 00:00:32 58 2023-10-06 00:00:37 59 Table 3. After Interpolation Data

dateTime bpm 2023-10-06 00:00:02 57 2023-10-06 00:00:07 56 2023-10-06 00:00:12 56 2023-10-06 00:00:17 56 2023-10-06 00:00:22 57 c. Data labeling

Labeling is done by providing information about heart rate and step data taken using a smartwatch. The label given is "yes" to indicate that the worker is experiencing fatigue and "no" to indicate that the worker is not experiencing fatigue. This labeling process is adjusted to the results of the interviews that have been conducted.

Table 4 shows the data that has gone through the labeling process where a fatigue label column is added to indicate the state of fatigue or not on each data entry.

Table 4. Data labelling fatigueness with bpm and step as feature.

id dateTime bpm step fatigue ID001 2023-09-24 07:22:17 86 2 yes ID001 2023-09-24 17:17:07 77 0 yes ID002 2023-09-27 06:39:52 83 0 no ID002 2023-09-27 19:50:07 63 0 yes

(4)

id dateTime bpm step fatigue ID003 2023-10-06 23:38:02 64 0 no ID003 2023-10-06 23:38:07 65 0 no d. Data normalization

Data normalization is the process of transforming the values in a dataset into a specific range to ensure that all features are similarly scaled [12]. Normalization is performed using methods such as Min-Max Scaling, which converts each value in the dataset into a value between 0 and 1. The normalization formula can be seen in formula (1).

x_norm = ^{x − min(x)}

max(x)−min(x) (1)

Figure 2. Data Normalization

Figure 2 shows the normalized dataset, where the dateTime, bpm, and step data have changed to a single range of values between 0 and 1.

2.4 Split dataset

The process of splitting the dataset is done by separating the dataset into two subsets, which are the training set and the test set. The training subset is used to train the model, while the test subset is used to test the performance of the trained model on data that has never been seen before. The dataset division ratio applied is 70% for the training set and 30% for the test set.

2.5 Autoregressive Integrated Moving Average

ARIMA, introduced by Box and Jenkins in 1970, is an efficient tool in forecasting future values in time series [13] . This method is used to describe time series based on observed values and is often applied for short-term forecasting with the advantage of outperforming complex structural models [14] . In the general formula of ARIMA (p, d, q), p refers to the Autoregressive (AR) order which indicates the impact of the dependent variable in the previous period, d is the difference order, and q is the moving average (MA) order which indicates that the independent variable is the residual value in the previous period [15]. This model not only focuses on the influence of past values, but also takes into account the difference and moving average effects, providing a comprehensive view of the time series. ARIMA is widely applied and effective in forecasting periodic series with patterns without random white noise, without seasonality, and can model changes in data efficiently [13]. in an ARIMA model, the forthcoming value of a variable is a linear combination of its previous value and the associated error, represented as follows:

Yt = ϕ0 + ϕ1Yt − 1 + ϕ2Yt − 2 + ⋯ + ϕpYt − p + εt − θ1εt − 2 − ⋯ − θqεt − q (2) Yt represents the actual value, Et is the random error at time t, ϕi and θj denote coefficients, and p and q are integers commonly known as autoregressive and moving average parameters, respectively [12].

In this research, the ARIMA method is implemented using the pmdarima library and the auto_arima function. This approach provides higher automatization in determining the best parameters for the ARIMA model.

The ARIMA model, which stands for AutoRegressive Integrated Moving Average, is a statistical model that is generally used to analyze time series. The ARIMA model formula involves parameters p, d, and q, which respectively indicate the autoregressive order, differential level, and moving average order. The auto_arima function takes a time series (train_set_arima['fatigue']) as input and automatically carries out the process of finding the best parameters. Additionally, exogenous variables (train_set_arima[columns_to_normalize]) can be included to train the model using additional information. The results of the modeling process are stored in the model_arima variable which is then called to make predictions regarding the level of physical fatigue.

2.6 Long Short-Term Memory

Long Short-Term Memory or commonly abbreviated as "LSTM" is a variant of the Recurrent Neural Network (RNN) model. The LSTM model is a type of artificial neural system specifically designed to overcome the

(5)

constraints of bursting or disappearing gradients that often occur in long-term dependency learning, especially when there are long enough time intervals [16]. Like any other artificial neural network, LSTM has several hidden layers called memory cells [17], [18]. A memory cell consists of an input gate, forget gate, and output gate [16], [19]. LSTMs are known to be very effective in classification, processing, and forecasting tasks of time series data, especially since there is often an unknown time lag between important events in the time series [20].

Figure 3. LSTM architecture [16]

In this research, the Long Short-Term Memory (LSTM) method is implemented with a configuration using an LSTM layer which has a total of 90 neuron units, followed by a Dropout layer with a dropout rate of 0.2. The Dropout layer aims to reduce the potential for overfitting by randomly ignoring a number of neurons during the training process. Next, a Dense layer is added as an output layer with one neuron and a linear activation function.

Once the model structure is determined, the model is compiled using the Adam optimizer, Mean Squared Error (MSE) loss function, and accuracy measures for model performance. The training process begins by configuring the EarlyStopping callback, which incorporates losses on validation data and stops training if no improvement occurs after a specified number of epochs and recovers the best weights.

The model is then drilled using training data with input X_train which includes datetime data, heart measurements, and steps, while y_train contains fatigue label data. Model training was carried out for 50 epochs, with a batch size of 128. Model evaluation was carried out using data validation. With this approach, LSTM models can recognize and utilize information from long-term time series, producing accurate predictions.

2.7 Performance evaluation

The final step in evaluating the performance of ARIMA and LSTM models involves measuring prediction accuracy using Mean Squared Error (MSE). This research chose MSE as the evaluation metric, which is a metric in statistics to calculate the average squared error among the predicted value and the actual value [21]. The Mean Squared Error (MSE) offers a comprehensive measure of proximity between the predicted and actual values. The smaller the MSE value, the higher the accuracy of the model [21]. the assessment of prediction accuracy involves the use of the Root Mean Squared Error (RMSE), derived by taking the square root of the MSE value. A smaller RMSE value indicates a more accurate model [10]. The prediction error rate is measured as reflected in equation (3) and equation (4).

MSE = ¹

n∑ⁿ_i=1(y_i− ŷ_i)² (3)

RMSE = √^∑^Nⁿ⁼¹ ^(xⁿ^−x̂ⁿ⁾²

N (4)

3. RESULT AND DISCUSSION

. 3.1 Result Datasets

The data used in this research includes dateTime, heart rate, and step count information collected through the use of a smartwatch. The source of data came from six workers in the city of Bandung, who provided data for two days for each individual. After the data collection period, interviews were conducted to obtain additional information regarding the time to fatigue. The data obtained from the smartwatches and the interview information were combined in the dataset labeling process. In this process, a "yes" label is used to indicate a fatigued state, while a "no" label indicates a normal or non-fatigued state. Table 5 below shows the results of the preprocessing stage of the dataset:

Table 5. Sample datasets

id dateTime bpm step fatigue ID001 2023-09-23 01:06:07 97 1 no

(6)

id dateTime bpm step fatigue ID001 2023-09-24 17:21:37 77 0 yes ID002 2023-09-28 15:05:12 95 2 yes ID002 2023-09-27 19:19:42 68 0 yes ID003 2023-10-07 03:06:52 58 0 no ID003 2023-10-07 17:59:42 101 2 yes ID004 2023-10-12 17:57:37 71 0 yes ID004 2023-10-12 16:42:27 72 0 yes ID005 2023-10-28 12:23:47 100 4 yes ID005 2023-10-27 21:00:37 61 0 no ID006 2023-11-01 18:02:57 91 0 yes ID006 2023-10-31 09:19:12 94 0 no

The data description in Figure 4 reflects a dataset with 149,796 entries and five main columns. First, the 'id' column uses an object data type (string) to provide the numbering of each distinct subject in the dataset. Then, the 'dateTime' column provides timestamp information for each heart rate and step measurement, represented in the int64 (integer) data type. Next, the 'bpm' and 'step' columns store the heart rate (beats per minute) and step count values at each measurement time, both using the int64 (integer) data type. Finally, the 'fatigue' column uses the object data type (string) to present data in the form of text that provides information about fatigue.

Figure 4. Data description 3.2 Result of Autoregressive Integrated Moving Average

In this research, the search for the optimal Autoregressive Integrated Moving Average (ARIMA) model was conducted using pmdarima library. This library facilitates the process of automatically searching for ARIMA models without the need to manually determine ARIMA parameters such as p, d, and q. The response variables used involved a dataset that included dateTime, heart rate, and number of steps. The response variables used involved datasets that included dateTime, heart rate, and number of steps. Physical fatigue data was used as the response variable, while dateTime, heart rate, and number of steps were considered as predictors or external variables that could potentially affect fatigue levels.

The search process is performed through the use of the auto_arima function. The trace parameter setting was set to True to allow observation of additional information during the search, such as the AIC (Akaike Information Criterion) value for each model tested. This step helps in the selection of the best model based on such evaluation criteria. The result of the search process, the best ARIMA model, can be used to perform physical fatigue level prediction based on the heart rate dataset.

Figure 5. Auto ARIMA

After conducting the search process, it was found that the best ARIMA model to predict the level of physical fatigue is ARIMA (1, 0, 0). This model was then implemented to make predictions based on the heart rate dataset in the test data. The total time taken to fit this model was 112,100 seconds. Next, the best model was tested on the

(7)

test data, and the performance evaluation results showed a Mean Squared Error (MSE) of 0.23965755663597266, and a Root Mean Squared Error (RMSE) of 0.48954831900025214.

These evaluation results give an idea of how well the model can generalize the prediction to new data, measuring how close the prediction is to the true value on the test data set. With relatively low MSE and RMSE values, it can be concluded that the ARIMA (1, 0, 0) model is able to provide a fairly accurate prediction of the level of physical fatigue based on the heart rate dataset in the test data.

3.3 Result of Long Short-Term Memory

The results of training and testing for the Long Short-Term Memory (LSTM) model will be illustrated using graphs that can be seen below, there are 2 graphs, namely graphs that describe the change in loss per epoch during the training process and graphs that describe the increase in accuracy during the training process.

Figure 6. Model Loss per Epoch

Analysis based on the model loss per epoch graph shows an increase in model accuracy as the training time progresses. The loss on the training data shows a consistent decrease as the number of epochs increases. The final Mean Squared Error (MSE) value reaches 0.11079745451460128, indicating that the model has successfully optimized its performance on the training data. The final Root Mean Squared Error (RMSE) value is 0.3328625159350348. This value indicates that the LSTM model is able to provide predictions that are closer to the true value on the training dataset. This result shows that the LSTM model has learned well from the training data and is able to minimize the prediction error.

Furthermore, the evaluation of training and testing accuracy levels can be seen in the following graph. This graph will provide a clearer picture of how the model performs during the training and testing process. By comparing the accuracy rate between training and testing data.

Figure 7. Training and Validation Accuracy per Epoch

The accuracy graph visualizes the performance of the model on the training and validation sets of each epoch. The increase in training accuracy over time reflects the improvement of the model's precision during the training process. There is a positive trend in the accuracy graph, indicating that the model continues to improve in understanding the patterns in the data. Overall, the model performed well, achieving an accuracy rate of 84.33%.

3.4 Comparison of ARIMA and LSTM

The comparison is done by comparing the performance evaluation results of the Autoregressive Integrated Moving Average (ARIMA) and Long Short-Term Memory (LSTM) models. Model performance evaluation is performed using the Mean Squared Error (MSE) and Root Mean Squared Error (RMSE) metrics. The results of the performance evaluation of the two models can be found in Table 4.

(8)

Table 4. Comparation MSE, RMSE between ARIMA and LSTM (Lower is Better)

MSE RMSE

ARIMA 0.23965755663597266 0.48954831900025214 LSTM 0.11079745451460128 0.3328625159350348

From the comparison results table, it can be seen that the LSTM model has better performance than the ARIMA model. This can be observed from the lower Mean Squared Error (MSE) and Root Mean Squared Error (RMSE) values in the LSTM model (MSE=0.1108, RMSE=0.3329) compared to the ARIMA model (MSE=0.2397, RMSE=0.4895). The lower MSE and RMSE values indicate that the LSTM model provides more accurate predictions and is closer to the true value than the ARIMA model.

4. CONCLUSION

This research conducted a comparison between two models, namely Autoregressive Integrated Moving Average (ARIMA) and Long Short-Term Memory (LSTM), in estimating the level of physical fatigue based on heart measurement data. From the performance evaluation results, it can be concluded that the LSTM model shows superior performance compared to the ARIMA model. First of all, the LSTM model shows a Mean Squared Error (MSE) value of 0.1108 and a Root Mean Squared Error (RMSE) of 0.3329. On the other hand, the ARIMA model shows an MSE value of 0.2397 and an RMSE of 0.4895. These results indicate that the LSTM model provides predictions that are more accurate and closer to the actual values, with significantly lower MSE and RMSE values compared to the ARIMA model. Further analysis suggests that the superiority of the LSTM model may stem from its ability to capture complex patterns in cardiac measurement time data. LSTM can effectively model and predict the variability of physical fatigue levels based on heart rate signals, which may be difficult to capture by simpler ARIMA models. Thus, these findings support the use of LSTM models as a more effective choice in this context, especially when dealing with complex temporal data.

ACKNOWLEDGMENT

The author would like to express his deepest gratitude to Telkom University for their invaluable support and facilitation during the implementation of this research. Their commitment and assistance played an important role in the successful implementation of this research. The authors highly appreciate their contributions and collaborative efforts, which have enriched the research process significantly.

REFERENCES

[1] G. P. Humairoh, R. Dani, and E. Putra, “Analisis Faktor-Faktor Yang Berhubungan Dengan Kelelahan Fisik Karyawan (Studi Kasus PT. X),” Serambi Engineering, vol. 5, no. 3, pp. 1177–1187, 2020.

[2] S. R. Aghdam, S. Alizadeh, Y. Rasoulzadeh, and A. Safaiyan, “Fatigue Assessment Scales: A Comprehensive Literature Review Background,” Journal Homepage: Rahimian Aghdam S, Alizadeh SS, Rasoulzadeh Y, Safaiyan A. Fatigue Assessment Scales: A Comprehensive Literature Review. Arch Hyg Sci, vol. 8, no. 3, pp.

145–153, 2019.

[3] J. Fan and A. P. Smith, “The impact of workload and fatigue on performance,” Communications in Computer and Information Science, vol. 726, pp. 90–105, 2017, doi: 10.1007/978-3-319-61061-0_6.

[4] H. M. Abd-Elfattah, F. H. Abdelazeim, and S. Elshennawy, “Physical and cognitive consequences of fatigue:

A review,” J Adv Res, vol. 6, no. 3, pp. 351–358, May 2015, doi: 10.1016/j.jare.2015.01.011.

[5] S. A. Useche, V. G. Ortiz, and B. E. Cendales, “Stress-related psychosocial factors at work, fatigue, and risky driving behavior in bus rapid transport (BRT) drivers,” Accid Anal Prev, vol. 104, pp. 106–114, Jul. 2017, doi: 10.1016/j.aap.2017.04.023.

[6] L. A. Ratna Winanda, T. Wahyu Adi, and N. Anwar, “MODEL PREDIKSI KELELAHAN PEKERJA KONSTRUKSI DI LOKASI PROYEK,” A SCIENTIFIC JOURNAL OF CIVIL ENGINEERING, vol. 21, no. 2, pp. 99–109, 2017.

[7] Z. Ni, F. Sun, and Y. Li, “Heart Rate Variability-Based Subjective Physical Fatigue Assessment,” Sensors, vol. 22, no. 9, May 2022, doi: 10.3390/s22093199.

[8] M. Oyeleye, T. Chen, S. Titarenko, and G. Antoniou, “A Predictive Analysis of Heart Rates Using Machine Learning Techniques,” Int J Environ Res Public Health, vol. 19, no. 4, Feb. 2022, doi:

10.3390/ijerph19042417.

[9] P. T. Yamak, L. Yujian, and P. K. Gadosey, “A comparison between ARIMA, LSTM, and GRU for time series forecasting,” ACM International Conference Proceeding Series, pp. 49–55, Dec. 2019, doi:

10.1145/3377713.3377722.

(9)

[10] D. Utomo, T.-H. Yang, D. Thi Thanh, and P.-A. Hsiung, “Driver Fatigue Prediction Using Different Sensor Data with Deep Learning,” 2019 IEEE International Conference on Industrial Cyber Physical Systems (ICPS), pp. 242–247, 2019, doi: 10.1109/ICPHYS.2019.8780225.

[11] G. Abdoli, M. Mehrara, and M. E. Ardalani, “COMPARING THE PREDICTION ACCURACY OF LSTM AND ARIMA MODELS FOR TIME-SERIES WITH PERMANENT FLUCTUATION,” vol. 9, no. 2, pp.

314–339, 2020.

[12] A. Harmain, Paiman, H. Kurniawan, kusrini, and D. Maulina, “DATA NORMALIZATION FOR K-MEANS EFFICIENCY ON GROUPS OF AREAS WITH POTENTIAL FORES AND /LAND FIRE BASED ON HEAT SPOTS DISTRIBUTION,” TEKNIMEDIA, vol. 2, no. 2, pp. 83–89, 2021.

[13] S. Khan and H. Alghulaiakh, “ARIMA Model for Accurate Time Series Stocks Forecasting,” IJACSA) International Journal of Advanced Computer Science and Applications, vol. 11, no. 7, pp. 524–528, 2020.

[14] A. A. Adebiyi, A. O. Adewumi, and C. K. Ayo, “Stock price prediction using the ARIMA model,” in Proceedings - UKSim-AMSS 16th International Conference on Computer Modelling and Simulation, UKSim 2014, Institute of Electrical and Electronics Engineers Inc., 2014, pp. 106–112. doi: 10.1109/UKSim.2014.67.

[15] B. P. Yafitra, Indwiarti, and A. R. Atiqi, “PERBANDINGAN PREDIKSI HARGA SAHAM DENGAN MODEL ARIMA DAN ARTIFICIAL NEURAL NETWORK,” Ind. Journal on Computing, vol. 4, no. 2, pp.

189–198, 2019, doi: 10.21108/indojc.2019.4.2.344.

[16] G. Van Houdt, C. Mosquera, and G. Nápoles, “A review on the long short-term memory model,” Artif Intell Rev, vol. 53, no. 8, pp. 5929–5955, Dec. 2020, doi: 10.1007/s10462-020-09838-1.

[17] F. Shahid, A. Zameer, and M. Muneeb, “Predictions for COVID-19 with deep learning models of LSTM, GRU and Bi-LSTM,” Chaos Solitons Fractals, vol. 140, Nov. 2020, doi: 10.1016/j.chaos.2020.110212.

[18] G. Bathla, “Stock price prediction using LSTM and SVR,” in PDGC 2020 - 2020 6th International Conference on Parallel, Distributed and Grid Computing, Institute of Electrical and Electronics Engineers Inc., Nov. 2020, pp. 211–214. doi: 10.1109/PDGC50313.2020.9315800.

[19] Y. Hua, Z. Zhao, R. Li, X. Chen, Z. Liu, and H. Zhang, “Deep Learning with Long Short-Term Memory for Time Series Prediction,” IEEE Communications Magazine, vol. 57, no. 6, pp. 114–119, Oct. 2018, doi:

10.1109/MCOM.2019.1800155.

[20] A. Khumaidi, R. Raafi, I. Permana Solihin, and J. Rs Fatmawati, “Pengujian Algoritma Long Short Term Memory untuk Prediksi Kualitas Udara dan Suhu Kota Bandung,” Jurnal Telematika, vol. 15, no. 1, 2019.

[21] M. Ayitey Junior, P. Appiahene, and O. Appiah, “Forex market forecasting with two-layer stacked Long Short-Term Memory neural network (LSTM) and correlation analysis,” Journal of Electrical Systems and Information Technology, vol. 9, no. 1, Dec. 2022, doi: 10.1186/s43067-022-00054-1.