Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1183614.1183728acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Adapting association patterns for text categorization: weaknesses and enhancements

Published: 06 November 2006 Publication History

Abstract

The use of association patterns for text categorization has attracted great interest and a variety of useful methods have been developed. However, the key characteristics of pattern-based text categorization remain unclear. Indeed, there are still no concrete answers for the following two questions: First, what kind of association patterns are the best candidate for pattern-based text categorization? Second, what is the most desirable way to use patterns for text categorization? In this paper, we focus on answering the above two questions. Specifically, we show that hyperclique patterns are more desirable than frequent patterns for text categorization. Along this line, we develop an algorithm for text categorization using hyperclique patterns. The experimental results show that our method provides better performance than state-of-the-art methods in terms of both computational performance and classification accuracy.

References

[1]
R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. In SIGMOD, 1993.
[2]
C.-C. Chang and C.-J. Lin. Libsvm. In http://www.csie.ntu.edu.tw/ cjlin/libsvm/.
[3]
S. T. Dumais, J. Platt, D. Heckerman, and M. Sahami. Inductive learning algorithms and representations for text categorization. In CIKM, 1998.
[4]
J. Feng, H. Liu, and J. Zou. Moderate itemset fittest for text classification. In WWW, 2005.
[5]
T. Joachims. A probabilistic analysis of the rocchio algorithm with tfidf for text categorization. In ICML, 1997.
[6]
W. Li, J. Han, and J. Pei. Cmar: Accurate and efficient classification based on multiple class-association rules. In ICDM, 2001.
[7]
B. Liu, W. Hsu, and Y. Ma. Integrating classification and association rule mining. In KDD, 1998.
[8]
J. Wang and G. Karypis. Harmony: Efficiently mining the best rules for classification. In SDM, 2005.
[9]
H. Xiong, P. Tan, and V. Kumar. Mining strong affinity association patterns in data sets with skewed support distribution. In ICDM, 2003.
[10]
J. Yan, N. Liu, B. Zhang, S. Yan, and et al. Ocfs: Optimal orthogonal centroid feature selection for text categorization. In SIGIR, 2005.

Index Terms

  1. Adapting association patterns for text categorization: weaknesses and enhancements

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge management
    November 2006
    916 pages
    ISBN:1595934332
    DOI:10.1145/1183614
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 06 November 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. hyperclique patterns
    2. text categorization

    Qualifiers

    • Article

    Conference

    CIKM06
    CIKM06: Conference on Information and Knowledge Management
    November 6 - 11, 2006
    Virginia, Arlington, USA

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 312
      Total Downloads
    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 19 Nov 2024

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media