Home > Published Issues > 2023 > Volume 14, No. 6, 2023 >
JAIT 2023 Vol.14(6): 1390-1402
doi: 10.12720/jait.14.6.1390-1402

An Efficient CSPK-FCM Explainable Artificial Intelligence Model on COVID-19 Data to Predict the Emotion Using Topic Modeling

Priya C. and Durai Raj Vincent P. M. *
School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, Vellore, India
Email: priya.2017@vitstudent.ac.in (P.C.)
*Correspondence: pmvincent@vit.ac.in (D.R.V.P.M.)

Manuscript received June 19, 2023; revised August 12, 2023; accepted August 21, 2023; published December 14, 2023.

Abstract—Incessant COVID-19 pandemic negatively impacts nations throughout the globe. It is necessary to determine how people react to public health interventions and understand their concerns. Twitter is a social media platform that has emerged as a tool for disseminating information, debating concepts, and reviewing or commenting on global issues. This study applies Explainable Artificial Intelligence (XAI) methods, like Cosine Similarity and Polynomial Kernel-centered Fuzzy C-Means (CSPK-FCM) centered topic modeling and Fuzzy Logic with Improved Long Short-Term Memory (FL-ILSTM) centered Sentiment Analysis to COVID-19 data on Twitter. The proposed model has five major steps: preprocessing, feature extraction, term weighting, topic modeling (clustering), and classification. Twitter comments relating to the COVID-19 pandemic are initially collected from publicly accessible websites. The collected data are then preprocessed to remove irrelevant information, namely, noises. The Feature Extraction phase is then performed by extracting emoticon and non-emoticon features. The extracted feature dataset is scored: the Term Frequency Inverse Document Frequency-Chi-Square (TFIDF-CHI) method is utilized for non-emoticon, and the score for the emoticon is assigned based on a few criteria. For Topic modeling, the TFIDF-CHI scores are provided to the CSPK-FCM clustering algorithm, which groups the most frequently discussed topics throughout COVID-19. FL-ILSTM executes the Sentiment analysis of clustered topics and emoticon features. It has extraordinary performance when compared to other methodologies.
 
Keywords—COVID-19, topic modeling, sentiment analysis, twitter sentiment analysis, Explainable Artificial Intelligence (XAI), fuzzy logic, Long Short-Term Memory (LSTM)

Cite: Priya C. and Durai Raj Vincent P. M., "An Efficient CSPK-FCM Explainable Artificial Intelligence Model on COVID-19 Data to Predict the Emotion Using Topic Modeling," Journal of Advances in Information Technology, Vol. 14, No. 6, pp. 1390-1402, 2023.

Copyright © 2023 by the authors. This is an open access article distributed under the Creative Commons Attribution License (CC BY-NC-ND 4.0), which permits use, distribution and reproduction in any medium, provided that the article is properly cited, the use is non-commercial and no modifications or adaptations are made.