Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/188490.188585acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article
Free access

Improving text retrieval for the routing problem using latent semantic indexing

Published: 01 August 1994 Publication History
First page of PDF

References

[1]
Gerard Salton, editor. The SMART retrieval system: Expemments zn A utomatzc Documen~ Processing. Prentice-Hall, 1971.
[2]
S. Deerwester, S. Dumais, G. Furnas, T. Landauer, and R. Harshman. Indexing by latent semantic analysis. Journal of the Amemcan Soczety for Information Sczence, 41(6):391-407, 1990.
[3]
Donna Harman. Overview of the first TREC conference. In Proc. of the 16th ACM/SIGIR Conference, pages 36-47, 1993.
[4]
Gerard Salton and Christopher Buckley. Improving retrieval performance by relevance feedback. Journal of the Amerzcan Society for Informatzon Sczence, 41(4):288-297, 1990.
[5]
Yonggang Qiu and H.P. Fret. Concept based query expansion. In Proc. of the i6th A CM/SIGIR Conference, pages 160-169, 1993.
[6]
Hinrich Schiitze. Dimensions of meaning. In Proceedings of Supercomputzug '92, pages 787-796, 1992.
[7]
S.K.M. Wong, Y.J. Cat, and Y.Y. Yao. Computation of term associations by a neural network. In Proc. of the 16th ACM/SIGIR Conference, pages 107-115, 1993.
[8]
J. Friedman, J. Bentley, and R. Finkel. An algorithm for finding best matches in logarithmic expected time. A CM Transactions on Mathematical Software, 3(3):209-226, 1977.
[9]
G. Furnas, S. Deerwester, S. Dumais, T. Landauer, R. Harshman, L. Streeter, and K. Lochbaum. Information retrieval using a singular value decomposition model of latent semantic structure. In Proc. of the 11th A CM/SIGIR Conference, pages 465-480, 1988.
[10]
B.T. Bartell, G.W. Cottrell, and R.K. Belew. Latent semantic indexing is an optimal special case of multidimensional scaling. In Proc. of the 15th A CM/SIGIR Conference, pages 161-167, 1992.
[11]
J.J. Rocchio. Relevance feedback in information retrieval. In Gerard Salton, editor, The SMART retrieval system: Experiments in Automatic Document Processing, pages 313-323. Prentice-Hall, 1971.
[12]
Gerard Salton and Christopher Buckley. Term-weighting approaches in automatic text retrieval. Information Processzn9 and Management, 24(5):513-523, 1988.
[13]
M. Berry. Large scale singular value computations. Internatzonal Journal of Supercomputer Apphcations, 6(1):13-49, 1992.
[14]
David Hull. Using statistical testing in the evaluation of retrieval performance. In Proc. of the 16th ACM/SIGIR Conference, pages 329-338, 1993.
[15]
Donna Harman. Relevance feedback revisited. In Proc. of the 15th A CM/SIGIR Conference, pages 1-10, 1993.
[16]
Geoffrey J. McLachlan. Discriminant Analysis and Statistical Pattern Rccognztzon, pages 52-64, 341-346. Wiley, 1992.
[17]
Ross Wilkinson and Philip Hingston. Using the cosine measure in a neural network for document retrieval. In Proc. of the i41h ACM/SIGIR Conference, pages 202-210, 1991.
[18]
S.K.M. Wong, W. Ziarko, and P.C.N. Wong. Generalized vector space model in information retrieval. In Proc. of the 8th A CM/SIGIR Conference, pages 18-25, 1985.

Cited By

View all
  • (2018)Implementing an individualized recommendation system using latent semantic analysisProceedings of the 6th International Conference on Information and Education Technology10.1145/3178158.3178163(239-243)Online publication date: 6-Jan-2018
  • (2016)Bayesian Performance Comparison of Text ClassifiersProceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval10.1145/2911451.2911547(15-24)Online publication date: 7-Jul-2016
  • (2014)Dynamic Clustering Based on Minimum Spanning Tree and Context Similarity for Enhancing Document ClassificationInternational Journal of Information Retrieval Research10.4018/ijirr.20140101034:1(46-60)Online publication date: 1-Jan-2014
  • Show More Cited By

Recommendations

Reviews

Alan F. Smeaton

Routing is an information retrieval application based on static user queries or profiles matched against an incoming stream of documents. Hull applies the latent semantic indexing (LSI) technique to the routing problem. Information retrieval research is dominated by work based on the vector space model, in which index terms are represented as orthogonal base vectors and documents and queries are represented as vectors in this vector space. One of the drawbacks of this model is that, in reality, index terms are not independent but have varying degrees of term associations. LSI is a statistical technique that transforms the search space into a space with much-reduced dimensionality that builds in term associations, albeit at the expense of computational overhead. Using a small test collection, this paper reports marginal improvements in the effectiveness of routing with LSI over that of the vector space model. The author then combines LSI with statistical classification techniques based on discriminant analysis and obtains greater improvements in effectiveness. Because of the small size of the document set used, this work can be described as exploratory; the interesting work has still to come when the technique is scaled up to much larger collections.

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '94: Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
August 1994
363 pages
ISBN:038719889X

Sponsors

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 01 August 1994

Check for updates

Qualifiers

  • Article

Conference

SIGIR94
Sponsor:
  • AICA
  • Irish Comp Soc
  • SIGIR
  • BCS-IRSG
  • BCS-IRSB
  • Dublin City University

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)35
  • Downloads (Last 6 weeks)10
Reflects downloads up to 23 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Implementing an individualized recommendation system using latent semantic analysisProceedings of the 6th International Conference on Information and Education Technology10.1145/3178158.3178163(239-243)Online publication date: 6-Jan-2018
  • (2016)Bayesian Performance Comparison of Text ClassifiersProceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval10.1145/2911451.2911547(15-24)Online publication date: 7-Jul-2016
  • (2014)Dynamic Clustering Based on Minimum Spanning Tree and Context Similarity for Enhancing Document ClassificationInternational Journal of Information Retrieval Research10.4018/ijirr.20140101034:1(46-60)Online publication date: 1-Jan-2014
  • (2012)FSKNNExpert Systems with Applications: An International Journal10.1016/j.eswa.2011.08.14139:3(2813-2821)Online publication date: 1-Feb-2012
  • (2009)Text and hypertext categorizationArtificial intelligence10.5555/1793943.1793945(11-38)Online publication date: 1-Jan-2009
  • (2009)Wikipedia-based semantic interpretation for natural language processingJournal of Artificial Intelligence Research10.5555/1622716.162272834:1(443-498)Online publication date: 1-Mar-2009
  • (2009)An analysis of latent semantic term self-correlationACM Transactions on Information Systems10.1145/1462198.146220027:2(1-35)Online publication date: 9-Mar-2009
  • (2008)An empirical study of required dimensionality for large-scale latent semantic indexing applicationsProceedings of the 17th ACM conference on Information and knowledge management10.1145/1458082.1458105(153-162)Online publication date: 26-Oct-2008
  • (2007)Semi-supervised single-label text categorization using centroid-based classifiersProceedings of the 2007 ACM symposium on Applied computing10.1145/1244002.1244189(844-851)Online publication date: 11-Mar-2007
  • (2007)Text classification based on partial least square analysisProceedings of the 2007 ACM symposium on Applied computing10.1145/1244002.1244187(834-838)Online publication date: 11-Mar-2007
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media