Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3607720.3607796acmotherconferencesArticle/Chapter ViewAbstractPublication PagesnissConference Proceedingsconference-collections
research-article

Tweets similarity classification based on Machine Learning Algorithms, TF-IDF and the Dynamic Case Based Reasoning

Published: 13 November 2023 Publication History

Abstract

The research on the field of Twitter sentiment analysis, which aims to extract users’ sentiments through their public opinion about a given topic, has been increased and grown rapidly across a broad range of disciplines in the last decade. In this article, we propose a hybrid approach for Tweets similarity classification Based on Dynamic Case Based Reasoning approach, machine learning algorithms and Multi-Agent System. Our approach proposes a multi-agent adaptive system for Tweets similarity classification. It combines the Dynamic Case-Based Reasoning approach with the scientific measurement of keyword weight (Term Frequency- Inverse Document Frequency, TF-IDF). It consists of gathering and pre-processing tweets about a given topic and use a feature extraction to extract useful features. Machine Learning algorithms are then used for similarity content-based classification. Our approach is general and can be used to follow users’ tweets traces to predict their sentiments and provide them with an individualized content. In this study, Covid-19 tweets have been taken as an example.

References

[1]
GUPTA, Bhumika, NEGI, Monika, VISHWAKARMA, Kanika, et al. Study of Twitter sentiment analysis using machine learning algorithms on Python. International Journal of Computer Applications, 2017, vol. 165, no 9, p. 29-34.
[2]
Kumar, A., & Jaiswal, A. (2019). Systematic literature review of sentiment analysis on Twitter using soft computing techniques. Concurrency and Computation: Practice and Experience, e5107. 
[3]
MÜLLER, Martin, SALATHÉ, Marcel, et KUMMERVOLD, Per E. Covid-twitter-bert: A natural language processing model to analyse covid-19 content on twitter. arXiv preprint arXiv:2005.07503, 2020.
[4]
Go, Alec, Richa Bhayani, and Lei Huang. "Twitter sentiment classification using distant supervision." CS224N project report, Stanford 1.12 (2009): 2009.
[5]
Elbagir, Shihab, and Jing Yang. "Twitter sentiment analysis using natural language toolkit and VADER sentiment." Proceedings of the international multiconference of engineers and computer scientists. Vol. 122. 2019.
[6]
S. E. Saad and J. Yang, "Twitter Sentiment Analysis Based on Ordinal Regression," in IEEE Access, vol. 7, pp. 163677-163685, 2019.
[7]
RACHMAN, Fika Hastarita, et al. Twitter sentiment analysis of Covid-19 using term weighting TF-IDF and logistic regresion. In : 2020 6th Information Technology International Seminar (ITIS). IEEE, 2020. p. 238-242.
[8]
Le, D.VK., Chen, Z., Wong, Y.W. et al. A complete online-SVM pipeline for case-based reasoning system: a study on pipe defect detection system. Soft Comput 24, 16917–16933 (2020). https://doi.org/10.1007/s00500-020-04985-7
[9]
Kouissi, Mohamed, El Mokhtar En-Naimi, and Abdelhamid Zouhair. 2022. “Hybrid Approach for Wind Turbines Power Curve Modeling Founded on Multi-Agent System and Two Machine Learning Algorithms, K-Means Method and the K-Nearest Neighbors, in the Retrieve Phase of the Dynamic Case Based Reasoning”. International Journal of Online and Biomedical Engineering (iJOE) 18 (06):pp. 110-122. https://doi.org/10.3991/ijoe.v18i06.29565.
[10]
Kolodner J (1996) in Case-Based Reasoning: Experiences, Lessons, and Future Directions. AAAI Press, USA, pp 349–370
[11]
AAMODT A. & PLAZA E. Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches. AI Communications. IOS Press, Vol. 7: 1, pp. 39-59, 1994.
[12]
CORDIER, Amélie, FUCHS, Béatrice, et MILLE, Alain. Engineering and learning of adaptation knowledge in case-based reasoning. In : Managing Knowledge in a World of Networks: 15th International Conference, EKAW 2006, Poděbrady, Czech Republic, October 2-6, 2006. Proceedings 15. Springer Berlin Heidelberg, 2006. p. 303-317.
[13]
RASOVSKA, Ivana. Contribution à une méthodologie de capitalisation des connaissances basée sur le raisonnement à partir de cas: Application au diagnostic dans une plateforme d'e-maintenance. 2006. Thèse de doctorat. Université de Franche-Comté.
[14]
E. M. En-Naimi and A. Zouhair, “Intelligent dynamic case-based reasoning using multiagents system in adaptive e-service, ecommerce and e-learning systems,” Int. J. of Knowledge and Learning (IJKL), vol. 11, pp. 42-57, 2016.
[15]
A. Zouhair, “Raisonnement à Partir de Cas Dynamique Multi-Agents: application à un système de tuteur intelligent»,” PhD in computer science, in Cotutelle between the Faculty of Sciences and Technologies of Tangier (Morocco) and the University of Le Havre (France), 2014.
[16]
Anthony Jnr., B., Abdul Majid, M. i Romli, A. (2019). Hybrid Multi-Agents and Case Based Reasoning for Aiding Green Practice in Institutions of Higher Learning. Tehnički vjesnik, 26 (1), 13-21. https://doi.org/10.17559/TV-20170301074502
[17]
Kalra, M., Lal, N., & Qamar, S. (2017). K-Mean Clustering Algorithm Approach for Data Mining of Heterogeneous Data. Lecture Notes in Networks and Systems, 61 70. https:// doi.org/10.1007/978-981-10-3920-1_7
[18]
F.-Z. Hibbi, O. Abdoun, and E. K. Haimoudi, “Smart Tutoring System: A Predictive Personalized Feedback in a Pedagogical Sequence”, Int. J. Emerg. Technol. Learn., vol. 16, no. 20, pp. pp. 263–268, Oct. 2021. https://doi.org/10.3991/ijet.v16i20.24783
[19]
HARAKAWA, Ryosuke, TAKIMURA, Shoji, OGAWA, Takahiro, et al. Consensus clustering of tweet networks via semantic and sentiment similarity estimation. IEEE Access, 2019, vol. 7, p. 116207-116217.
[20]
Joshi K D and Nalwade P S 2013 Modified K-Means for Better Initial Cluster Centres. International Journal of Computer Science and Mobile Computing II 7 p 2.
[21]
Bain K K, Firli I, And Tri S 2016 Genetic Algorithm For Optimized Initial Centers K-Means Clusterin
[22]
ADDIGA, Akash et BAGUI, Sikha. Sentiment Analysis on Twitter Data Using Term Frequency-Inverse Document Frequency. Journal of Computer and Communications, 2022, vol. 10, no 8, p. 117-128.
[23]
GUPTA, Bhumika, NEGI, Monika, VISHWAKARMA, Kanika, et al. Study of Twitter sentiment analysis using machine learning algorithms on Python. International Journal of Computer Applications, 2017, vol. 165, no 9, p. 29-34.
[24]
https://wordcloud.app/
[25]
YUAN, Chunhui et YANG, Haitao. Research on K-value selection method of K-means clustering algorithm. J—Multidisciplinary Scientific Journal, 2019, vol. 2, no 2, p. 226-235.
[26]
NG, S. C. Principal component analysis to reduce dimension on digital image. Procedia computer science, 2017, vol. 111, p. 113-119.
[27]
https://matplotlib.org/ ; https://www.python.org/

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
NISS '23: Proceedings of the 6th International Conference on Networking, Intelligent Systems & Security
May 2023
451 pages
ISBN:9798400700194
DOI:10.1145/3607720
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 November 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Dynamic Case Based Reasoning (DCBR)
  2. Machine Learning
  3. Multi Agents System
  4. Term Frequency-Inverse Document Frequency (TF-IDF)
  5. Tweets similarity

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

NISS 2023

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 15
    Total Downloads
  • Downloads (Last 12 months)15
  • Downloads (Last 6 weeks)1
Reflects downloads up to 26 Nov 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media