Accuracy,Precision,Recall and F-1 Score - Nazarbayev University Repository

The six different models were trained by modification of these models on 5 datasets that were taken from the same Breakhis database, that is, we divided by the magnification factor and took all the data. Using an 80% training set and 20% test set, all models fine-tuning all layers and classified malignant and benign cancer types.

Several conventional performance metrics were extracted from confusion matrices, including true positive (TP), false positive (FP), true negative (TN), false negative (FN), accuracy, precision, recall, and F-1 score.

𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = 𝑇 𝑃 +𝑇 𝑁

𝑇 𝑃 +𝑇 𝑁+𝐹 𝑃 +𝐹 𝑁 (1)

𝑃 𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛== 𝑇 𝑃

𝑇 𝑃 +𝐹 𝑃 (2)

𝑅𝑒𝑐𝑎𝑙𝑙= 𝑇 𝑃

(𝑇 𝑃 +𝐹 𝑁) (3)

𝐹 −1𝑠𝑐𝑜𝑟𝑒= 2* 𝑃 𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛𝑅𝑒𝑐𝑎𝑙𝑙

(𝑃 𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛+𝑅𝑒𝑐𝑎𝑙𝑙) (4)

Models Accuracy Precision Recall F-1 Score Efficientnetv2-b0 0,945 0,95 0,945 0,945

Mobilenet-v2 0,9349 0,935 0,935 0,9349

Resnetv2-50 0,94075 0,94 0,94 0,94075

Inception-v3 0,8395 0,855 0,84 0,8395 Inception-Resnet-v2 0,8468 0,845 0,845 0,8468

VGG16 0,9378 0,938 0,938 0,9378

Table 4.1: The results of models on overall BreaKHis data

Figure 4-2: The graphical representation of model results on Table 4.1

Figure 4-3: Confusion Matrix of Efficientnetv2-b0

According to the results of Table 4.1, the highest results were shown by the models: Efficientnetv2, Mobilenet-v2, Resnetv2-50, and VGG16. However, the best result was shown by Efficientnetv2, with a confident 94.5% accuracy. Overall, Efficientnetv2 has worked very well and represented top results in comparison to other CNN architectures, but for data with a magnification factor of 400 had worked as expected Table 4.2. In Figure 4.4, we can see the model execution and variation of train and validation accuracy. For the data which 400x magnification through a microscope, better results were shown by the pre-trained CNN architecture Inception-Resnet-v2 with 88,5% accuracy. Based on the experiments done, it can be assumed that the Efficientnetv2-b0 is a suitable model for the analysis of histopathological images. As the results show, the magnification factor affects the evaluation of the model because the parameters set in the models and the image parameters differ because of this, the evaluation of the model decreases. For example, with a magnification coefficient of 400, almost all models show a relatively low accuracy, since when histopathological images are magnified by 400, a lot of picture parameters appear that do not correspond to the expectation of the models.

Figure 4-4: Model Execution and Plot of Training Accuracy by Validation Accuracy The best model, as discussed above, Efficientnetv2-b0, was trained by modification where we added 3 layers and updated all weights of these pre-trained deep CNN architecture for 12 epochs with cross-validation and batch size was 64, validation

accuracy was generated between 92-99%, but in test data, it showed 94.5% on other shots it varies from 94% to 97% that shows a high result. In Figure 4.3, we can observe a confusion matrix that summarizes the prediction results of the improved Efficientnetv2-b0 model.

Papers Methods 40x 100x 200x 400x

Fabio et al.[38] PFTAS + SVM 0.816 0.799 0.851 0.823

Han et al.[39] AlexNet + Aug 0.856 0.835 0.831 0.808

Farjana Parvin et al.[24] pre-trained CNN 0.89 0.92 0.94 0.9 Our proposed study pre-train CNN + Aug + Opt Par 0.955 0.93 0.98 0.885 Table 4.3: The comparison results of previous studies by our proposed study

Table 4.3 displays the accuracy results of previous papers related to the classification of breast cancer type on histopathological images. All this research done on BreaKHis data set and our proposal work represents higher accuracy results than state-of-the-art other models. However, Han et al.[39] achieved an average 93.8%

accuracy by the CSDCNN model, which means that the results are less than our proposed study results. In addition, we can inform that our model is such an excellent methodology for the detection of breast cancer.

Models Magnification Factor Accuracy Precision Recall F-1 Score

Efficientnetv2-b0 40x 0,955 0,955 0,95 0,95

100x 0,93 0,93 0,93 0,93

200x 0,98 0,98 0,98 0,98

400x 0,81 0,81 0,845 0,81

Mobilenet-v2 40x 0,645 0,645 0,645 0,645

100x 0,755 0,755 0,805 0,755

200x 0,84 0,83999 0,85 0,84

400x 0,87 0,87 0,86 0,86

Resnet-50 40x 0,87 0,87 0,86 0,86

100x 0,805 0,805 0,85 0,805

200x 0,84 0,84 0,845 0,84

400x 0,74 0,74 0,75 0,74

Inception-v3 40x 0,71 0,71 0,715 0,72

100x 0,67 0,67 0,785 0,67

200x 0,715 0,715 0,8 0,715

400x 0,595 0,595 0,745 0,595

Inception-Resnet-v2 40x 0,925 0,925 0,925 0,925

100x 0,67 0,67 0,785 0,67

200x 0,945 0,945 0,95 0,945

400x 0,885 0,885 0,89 0,885

VGG16 40x 0,925 0,925 0,95 0,9

100x 0,905 0,905 0,92 0,89

200x 0,93 0,93 0,97 0,925

400x 0,825 0,825 0,845 0,82

Table 4.2: The results of models according to magnification factors

Chapter 5 Conclusion

Breast cancer is the most common type of cancer among women, and it is the most common single cause of death among all women aged 35 to 54. Nowadays, with the help of advanced technologies and the development of machine learning, tumor type detection is becoming accessible and fast. This paper compared and modified several deep trained models by changing the parameters and adding extra layers and realized that it is not always possible to get the desired results through trained CNN architectures. It is essential to be able to prepare data and build favorable data for these pre-trained architectures. After all the experiments, we can make an assumption that the quality of the data decides 60-70% of the success of the model. Our initial task was to compare and modify the models, select convenient parameters for these models and develop these models by using optimization methods, fine-tuning, apply suitable data augmentations methods to these models. Based on the BreaKHis data, a total of 30 models were trained, of which, on average, we achieved 90% and above.

The best result showed EfficientNet 94.5% accuracy on total data, and on average, it showed 91.87% accuracy on test data for 4 magnifying factors and in comparison with other state of the art studies represented the higher result. For future work, we will implement a data fusion methodology that will compare the results of the best models to help improve accuracy and add a Generative adversarial network technique to enrich the complex images to improve the quality of the model. In conclusion, as we discussed earlier, methods for detecting malignant and benign breast cancer gland

is a complex procedure, and we expect our methods to work as intended, and these long processes can help to identify the type of tumor, helping to save lives.

Bibliography

[1] Wei Wang, Yutao Li, Ting Zou, Xin Wang, Jieyu You, and Yanhong Luo. A novel image classification approach via dense-mobilenet models. Mobile Information Systems, 2020, 2020.

[2] Timothy J Key, Pia K Verkasalo, and Emily Banks. Epidemiology of breast cancer. The lancet oncology, 2(3):133–140, 2001.

[3] Ainur Orazayeva, DA Tusupov, SV Pavlov, and GB Abdikerimova. Efficiency of breast cancer biomedical image processing using filters. Proceedings of the National Academy of Sciences of the Republic of Kazakhstan. Series of physics and information technologies., (1):69–76, 2022.

[4] Christiane K Kuhl, Simone Schrading, Claudia C Leutner, Nuschin Morakkabati- Spitz, Eva Wardelmann, Rolf Fimmers, Walther Kuhn, and Hans H Schild. Mam- mography, breast ultrasound, and magnetic resonance imaging for surveillance of women at high familial risk for breast cancer. Journal of clinical oncology, 23(33):8469–8476, 2005.

[5] Gary H Lyman, Armando E Giuliano, Mark R Somerfield, Al B Benson, Diane C Bodurka, Harold J Burstein, Alistair J Cochran, Hiram S Cody, Stephen B Edge, Sharon Galper, et al. American society of clinical oncology guideline recommendations for sentinel lymph node biopsy in early-stage breast cancer.

Journal of clinical oncology, 23(30):7703–7720, 2005.

[6] Dirk M Elston, Erik J Stratman, and Stanley J Miller. Skin biopsy: Biopsy issues in specific diseases. Journal of the American Academy of Dermatology, 74(1):1–16, 2016.

[7] Umberto Veronesi, Giovanni Paganelli, Giuseppe Viale, Alberto Luini, Stefano Zurrida, Viviana Galimberti, Mattia Intra, Paolo Veronesi, Chris Robertson, Patrick Maisonneuve, et al. A randomized comparison of sentinel-node biopsy with routine axillary dissection in breast cancer. New England Journal of Medicine, 349(6):546–553, 2003.

[8] Sunil R Lakhani, Ian O Ellis, Stuart Schnitt, Puay Hoon Tan, and Marc van de Vijver. Who classification of tumours of the breast. 2012.

[9] Walaa N Ismail, Mohammad Mehedi Hassan, Hessah A Alsalamah, and Gian- carlo Fortino. Cnn-based health model for regular health factors analysis in internet-of-medical things environment. IEEE Access, 8:52541–52549, 2020.

[10] Mitko Veta, Josien PW Pluim, Paul J Van Diest, and Max A Viergever. Breast cancer histopathology image analysis: A review.IEEE transactions on biomedical engineering, 61(5):1400–1411, 2014.

[11] Metin N Gurcan, Laura E Boucheron, Ali Can, Anant Madabhushi, Nasir M Rajpoot, and Bulent Yener. Histopathological image analysis: A review. IEEE reviews in biomedical engineering, 2:147–171, 2009.

[12] Muhammad Khalid Khan Niazi, Anil V Parwani, and Metin N Gurcan. Digital pathology and artificial intelligence. The lancet oncology, 20(5):e253–e261, 2019.

[13] Anant Madabhushi and George Lee. Image analysis and machine learning in digital pathology: Challenges and opportunities.Medical image analysis, 33:170–

175, 2016.

[14] Jun Cheng, Yuting Liu, Wei Huang, Wenhui Hong, Lingling Wang, Xiaohui Zhan, Zhi Han, Dong Ni, Kun Huang, and Jie Zhang. Computational image analysis identifies histopathological image features associated with somatic mutations and patient survival in gastric adenocarcinoma. Frontiers in Oncology, 11:1058, 2021.

[15] Jordan T Ash, Gregory Darnell, Daniel Munro, and Barbara E Engelhardt. Joint analysis of expression levels and histological images identifies genes associated with tissue morphology. Nature communications, 12(1):1–12, 2021.

[16] Fitzgerald CE Rohde GK. Shifat-E-Rabbi M, Yin X. Cell image classification:

A comparative overview. Cytometry A., 97(4):347–362, 2020.

[17] Baris Kayalibay, Grady Jensen, and Patrick van der Smagt. Cnn-based segmen- tation of medical imaging data. arXiv preprint arXiv:1701.03056, 2017.

[18] Jun Xu, Lei Xiang, Qingshan Liu, Hannah Gilmore, Jianzhong Wu, Jinghai Tang, and Anant Madabhushi. Stacked sparse autoencoder (ssae) for nuclei detection on breast cancer histopathology images. IEEE transactions on medical imaging, 35(1):119–130, 2015.

[19] Nicolas Coudray, Paolo Santiago Ocampo, Theodore Sakellaropoulos, Navneet Narula, Matija Snuderl, David Fenyö, Andre L Moreira, Narges Razavian, and Aristotelis Tsirigos. Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning. Nature medicine, 24(10):1559–1567, 2018.

[20] AD Belsare and MM Mushrif. Histopathological image analysis using image processing techniques: An overview. Signal & Image Processing, 3(4):23, 2012.

[21] Sonal Kothari, John H Phan, Todd H Stokes, and May D Wang. Pathology imaging informatics for quantitative analysis of whole-slide images. Journal of the American Medical Informatics Association, 20(6):1099–1108, 2013.

[22] Marek Kowal, Paweł Filipczuk, Andrzej Obuchowicz, Józef Korbicz, and Roman Monczak. Computer-aided diagnosis of breast cancer based on fine needle biopsy microscopic images. Computers in biology and medicine, 43(10):1563–1572, 2013.

[23] Fabio Alexandre Spanhol, Luiz S Oliveira, Caroline Petitjean, and Laurent Heutte. Breast cancer histopathological image classification using convolutional neural networks. In 2016 international joint conference on neural networks (IJCNN), pages 2560–2567. IEEE, 2016.

[24] Farjana Parvin and Md Al Mehedi Hasan. A comparative study of different types of convolutional neural networks for breast cancer histopathological image classification. In2020 IEEE Region 10 Symposium (TENSYMP), pages 945–948.

IEEE, 2020.

[25] Neslihan Bayramoglu, Juho Kannala, and Janne Heikkilä. Deep learning for magnification independent breast cancer histopathology image classification. In 2016 23rd International conference on pattern recognition (ICPR), pages 2440–

2445. IEEE, 2016.

[26] Paweł Filipczuk, Thomas Fevens, Adam Krzyżak, and Roman Monczak.

Computer-aided breast cancer diagnosis based on the analysis of cytological images of fine needle biopsies. IEEE transactions on medical imaging, 32(12):2169–

2178, 2013.

[27] Yungang Zhang, Bailing Zhang, Frans Coenen, and Wenjin Lu. Breast cancer diagnosis from biopsy images with highly reliable random subspace classifier ensembles. Machine vision and applications, 24(7):1405–1420, 2013.

[28] Juan Zuluaga-Gomez, Zeina Al Masry, Khaled Benaggoune, Safa Meraghni, and Nourredine Zerhouni. A cnn-based methodology for breast cancer diagnosis using thermal images. Computer Methods in Biomechanics and Biomedical Engineer- ing: Imaging & Visualization, 9(2):131–145, 2021.

[29] Fabio A Spanhol, Luiz S Oliveira, Caroline Petitjean, and Laurent Heutte. A dataset for breast cancer histopathological image classification. Ieee transactions on biomedical engineering, 63(7):1455–1462, 2015.

[30] Luis Perez and Jason Wang. The effectiveness of data augmentation in image classification using deep learning. arXiv preprint arXiv:1712.04621, 2017.

[31] Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, and Liang- Chieh Chen. Mobilenetv2: Inverted residuals and linear bottlenecks. InProceed- ings of the IEEE conference on computer vision and pattern recognition, pages 4510–4520, 2018.

Dalam dokumen Nazarbayev University Repository (Halaman 30-42)