Structural analysis of chat messages for topic detection
Abstract
Purpose
The purpose of this research is to study the characteristics of chat messages from analysing a collection of 33,121 sample messages gathered from 1,700 sessions of conversations of 72 pairs of MSN Messenger users over a four month duration from June to September of 2005. The primary objective of chat message characterization is to understand the properties of chat messages for effective message analysis, such as message topic detection.
Design/methodology/approach
From the study on chat message characteristics, an indicative term‐based categorization approach for chat topic detection is proposed. In the proposed approach, different techniques such as sessionalisation of chat messages and extraction of features from icon texts and URLs are incorporated for message pre‐processing. Naïve Bayes, Associative Classification, and Support Vector Machine are employed as classifiers for categorizing topics from chat sessions.
Findings
Indicative term‐based approach is superior to the traditional document frequency based approach, for feature selection in chat topic categorization.
Originality/value
This paper studies the characteristics of chat messages and proposes an indicative term‐based categorization approach for chat topic detection.
Keywords
Citation
Dong, H., Cheung Hui, S. and He, Y. (2006), "Structural analysis of chat messages for topic detection", Online Information Review, Vol. 30 No. 5, pp. 496-516. https://doi.org/10.1108/14684520610706398
Publisher
:Emerald Group Publishing Limited
Copyright © 2006, Emerald Group Publishing Limited