Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/1151736.1151752dlproceedingsArticle/Chapter ViewAbstractPublication PagesadcConference Proceedingsconference-collections
Article
Free access

A new approach to intelligent text filtering based on novelty detection

Published: 01 January 2006 Publication History

Abstract

This paper presents an original approach to modelling user's information need in text filtering environment. This approach relies on a specific novelty detection model which allows both accurate learning of user's profile and evaluation of the coherency of user's behaviour during his interaction with the system. Thanks to an online learning algorithm, the novelty detection model is also able to track changes in user's interests over time.The proposed approach has been successfully tested on the Reuters-21578 benchmark. The experimental results prove that this approach signicantly outperforms the well-known Rocchio's learning algorithm.

References

[1]
Ault, T. & Yang, Y. (2001), kNN, Rocchio and Metrics for Information Filtering at TREC-10, in 'The Tenth Text REtrieval Conference (TREC 10)'
[2]
Dumais, S., Platt, J., Heckerman, D.,& Sahami, M. (1998), Inductive Learning Algorithms and Representations for Text Categorization, in 'Proceedings of the Seventh International Conference on Information and Knowledge Management CIKM', pp. 148-155.
[3]
Kassab, R., Lamirel, J.C., & Nauer, E. (2005), Novelty Detection for Modeling User's Profile, in 'Proceedings of the 18th International Florida Artificial Intelligence Research Society Conference (FLAIRS 05)', Clearwater Beach, Florida, AAAI Press. pp. 830-831.
[4]
Kassab, R., Lamirel, J.C., & Nauer, E. (2005), Une nouvelle approche pour la modelisation du profil de l'utilisateur dans les systèèmes de filtrage d'information : le modèle de filtre détecteur de nouveauté, in 'The Deuxième Conference en Recherche d'information et Applications, CORIA'05', France, pp. 185-200.
[5]
Kohonen, T. (1984), Self organisation and associative memory, Springer Verlag, New York, USA.
[6]
Lamirel, J.C., & Crééhange, M. (1994), Application of a Symbolico-Connectionist Approach for the Design of a Highly Interactive Documentary Database Interrogation System with On-Line Learning Capabilities, in 'Proceedings of the Third International Conference on Information and Knowledge Management (CIKM 94)', pp. 155-163.
[7]
Liu, T., Liu, S., Chen, Z. & Ma, W. (2003), An Evaluation on Feature Selection for Text Clustering, in 'Proceedings of the Twentieth International Conference on Machine Learning (ICML-2003)'.
[8]
Robertson, S.E. & Soboroff, I. (2001), The TREC-9 Filtering Track Final Report, in 'Proceedings of the 9th Text REtrieval Conference (TREC 9)', pp. 25-40.
[9]
Rocchio, J. J. (1971), Relevance feedback in information retrieval, In The SMART Retrieval System : Experiments in Automatic Document Processing , Prentice Hall Inc., Englewood Cliffs, New Jersey.
[10]
Salton, G. & Buckley, C. (1988), Term weighting approaches in automatic text retrieval, in 'Information Processing and Management', 24(5), 513-523.
[11]
Schapire, R., Singer, Y. & Singhal, A. (1998), Boosting and Rocchio Applied to Text Filtering, in 'Proceedings of the Twenty-first Annual International ACM SIGIR Conference on Research and Development in Information Retrieval', pp. 215-223.
[12]
Schutze, H., Hull, David A. & Pedersen, Jan O. (1995), A Comparison of Classifiers and Document Representations for the Routing Problem, in 'Proceedings of SIGIR'95, the International Conference on Research and Development in Information Retrieval (1995)', ACM Press, pp. 229-237.
[13]
Shankar, S. & Karypis, George(2000), Weight Adjustment Schemes for a Centroid Based Classifier, 'Computer Science Technical Report (TR00-035)', Department of Computer Science, University of Minnesota, Minneapolis, Minnesota.
[14]
Singhal, A., Mitra, M. & Buckley, C. (1997), Learning routing queries in a query zone, in 'Proceedings SIGIR'97, 20th ACM International Conference on Research and Development in Information Retrieval', pp. 25-32.
[15]
Yang, Y. & Pedersen, Jan O. (1997), A Comparative Study on Feature Selection in Text Categorization, in 'Proceedings of ICML-97, 14th International Conference on Machine Learning', pp. 412-420.
[16]
Zhang, Y., Callan, J. & Minka, T. (2002), Novelty and redundancy detection in adaptive filtering, in 'Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval', pp. 81-88.

Cited By

View all
  • (2018)Network-based approach to detect novelty of scholarly literatureInformation Sciences: an International Journal10.1016/j.ins.2017.09.037422:C(542-557)Online publication date: 1-Jan-2018
  • (2007)Towards a synthetic analysis of user's information need for more effective personalized filtering servicesProceedings of the 2007 ACM symposium on Applied computing10.1145/1244002.1244190(852-859)Online publication date: 11-Mar-2007

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ADC '06: Proceedings of the 17th Australasian Database Conference - Volume 49
January 2006
202 pages
ISBN:1920682317

Publisher

Australian Computer Society, Inc.

Australia

Publication History

Published: 01 January 2006

Author Tags

  1. information filtering
  2. novelty detection
  3. online learning
  4. personalization
  5. profile

Qualifiers

  • Article

Conference

ADC '06
January 16 - 19, 2006
Hobart, Australia

Acceptance Rates

Overall Acceptance Rate 98 of 224 submissions, 44%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)29
  • Downloads (Last 6 weeks)5
Reflects downloads up to 18 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Network-based approach to detect novelty of scholarly literatureInformation Sciences: an International Journal10.1016/j.ins.2017.09.037422:C(542-557)Online publication date: 1-Jan-2018
  • (2007)Towards a synthetic analysis of user's information need for more effective personalized filtering servicesProceedings of the 2007 ACM symposium on Applied computing10.1145/1244002.1244190(852-859)Online publication date: 11-Mar-2007

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media