Nothing Special   »   [go: up one dir, main page]

To read this content please select one of the options below:

Structural analysis of chat messages for topic detection

Haichao Dong (School of Computer Engineering, Nanyang Technological University, Singapore)
Siu Cheung Hui (School of Computer Engineering, Nanyang Technological University, Singapore)
Yulan He (School of Computer Engineering, Nanyang Technological University, Singapore)

Online Information Review

ISSN: 1468-4527

Article publication date: 1 September 2006

1400

Abstract

Purpose

The purpose of this research is to study the characteristics of chat messages from analysing a collection of 33,121 sample messages gathered from 1,700 sessions of conversations of 72 pairs of MSN Messenger users over a four month duration from June to September of 2005. The primary objective of chat message characterization is to understand the properties of chat messages for effective message analysis, such as message topic detection.

Design/methodology/approach

From the study on chat message characteristics, an indicative term‐based categorization approach for chat topic detection is proposed. In the proposed approach, different techniques such as sessionalisation of chat messages and extraction of features from icon texts and URLs are incorporated for message pre‐processing. Naïve Bayes, Associative Classification, and Support Vector Machine are employed as classifiers for categorizing topics from chat sessions.

Findings

Indicative term‐based approach is superior to the traditional document frequency based approach, for feature selection in chat topic categorization.

Originality/value

This paper studies the characteristics of chat messages and proposes an indicative term‐based categorization approach for chat topic detection.

Keywords

Citation

Dong, H., Cheung Hui, S. and He, Y. (2006), "Structural analysis of chat messages for topic detection", Online Information Review, Vol. 30 No. 5, pp. 496-516. https://doi.org/10.1108/14684520610706398

Publisher

:

Emerald Group Publishing Limited

Copyright © 2006, Emerald Group Publishing Limited

Related articles