Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/1620163.1620205guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Single document keyphrase extraction using neighborhood knowledge

Published: 13 July 2008 Publication History

Abstract

Existing methods for single document keyphrase extraction usually make use of only the information contained in the specified document. This paper proposes to use a small number of nearest neighbor documents to provide more knowledge to improve single document keyphrase extraction. A specified document is expanded to a small document set by adding a few neighbor documents close to the document, and the graph-based ranking algorithm is then applied on the expanded document set to make use of both the local information in the specified document and the global information in the neighbor documents. Experimental results demonstrate the good effectiveness and robustness of our proposed approach.

References

[1]
Berger, A., and Mittal, V. 2000. OCELOT: A system for summarizing Web Pages. In Proceedings of SIGIR2000.
[2]
Barker, K., and Cornacchia, N. 2000. Using nounphrase heads to extract document keyphrases. In Canadian Conference on AI.
[3]
Böhm, C., and Berchtold, S. 2001. Searching in high-dimensional spaces-index structures for improving the performance of multimedia databases. ACM Computing Surveys, 33(3): 322-373.
[4]
Frank, E.; Paynter, G. W.; Witten, I. H.; Gutwin, C.; and Nevill-Manning, C. G. 1999. Domain-specific keyphrase extraction. Proceedings of IJCAI-99, pp. 668-673.
[5]
Gutwin, C.; Paynter, G. W.; Witten, I. H.; Nevill-Manning, C. G.; and Frank, E. 1999. Improving browsing in digital libraries with keyphrase indexes. Journal of Decision Support Systems, 27, 81-104.
[6]
Hammouda, K. M.; Matute, D. N.; and Kamel, M. S. 2005. CorePhrase: keyphrase extraction for document clustering. In Proceedings of MLDM2005.
[7]
Hulth, A. 2003. Improved automatic keyword extraction given more linguistic knowledge. In Proceedings of EMNLP2003.
[8]
Kelleher, D., and Luz, S. 2005. Automatic hypertext keyphrase detection. In Proceedings of IJCAI2005.
[9]
Krulwich, B., and Burkey, C. 1996. Learning user information interests through the extraction of semantically significant phrases. In AAAI 1996 Spring Symposium on Machine Learning in Information Access.
[10]
Medelyan, O., and Witten, I. H. 2006. Thesaurus based automatic keyphrase indexing. In Proceedings of JCDL2006.
[11]
Mihalcea, R., and Tarau, P. 2004. TextRank: Bringing order into texts. In Proceedings of EMNLP2004.
[12]
Muñoz, A. 1996. Compound key word generation from document databases using a hierarchical clustering ART model. Intelligent Data Analysis, 1(1).
[13]
Nguyen, T. D., and Kan, M.-Y. 2007. Keyphrase extraction in scientific publications. In Proceedings of ICADL2007.
[14]
Over, P. 2001. Introduction to DUC-2001: an intrinsic evaluation of generic news text summarization systems. In Proceedings of DUC2001.
[15]
Page, L.; Brin, S.; Motwani, R.; and Winograd, T. 1998. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Libraries.
[16]
Song, M.; Song, I.-Y.; and Hu, X. 2003. KPSpotter: a flexible information gain-based keyphrase extraction system. In Proceedings of WIDM2003.
[17]
Steier, A. M., and Belew, R. K. 1993. Exporting phrases: A statistical analysis of topical language. In Proceedings of Second Symposium on Document Analysis and Information Retrieval, pp. 179-190.
[18]
Tomokiyo, T., and Hurst, M. 2003. A language model approach to keyphrase extraction. In Proceedings of ACL Workshop on Multiword Expressions.
[19]
Toutanova, K., and Manning, C. D. 2000. Enriching the knowledge sources used in a maximum entropy Part-of-Speech tagger. In Proceedings of EMNLP/VLC-2000.
[20]
Turney, P. D. 2000. Learning algorithms for keyphrase extraction. Information Retrieval, 2:303-336.
[21]
Turney, P. D. 2003. Coherent keyphrase extraction via web mining. In Proc. of IJCAI-03, pages 434-439.
[22]
Wan, X.; Yang, J.; and Xiao, J. 2007. Single document summarization with document expansion. In Proceedings of AAAI2007.
[23]
Witten, I. H.; Paynter, G. W.; Prank, E.; Gutwin, C.; and Nevill-Manning, C. G. 1999. KEA: Practical automatic keyphrase extraction. Proceedings of Digital Libraries 99 (DL'99), pp. 254-256.
[24]
Wong, T.-L.; Lam, W.; and Chan, S.-K. 2006. Collaborative information extraction and mining from multiple web documents. In Proceedings of SDM2006.
[25]
Xue, G.-R.; Lin, C.; Yang, Q.; Xi, W.; Zeng, H.-J.; Yu, Y.; and Chen, Z. 2005. Scalable collaborative filtering using cluster-based smoothing. In Proceedings of SIGIR2005.
[26]
Yih, W.-T.; Goodman, J.; and Carvalho, V. R. 2006. Finding advertising keywords on web pages. In Proceedings of WWW2006.

Cited By

View all
  • (2023)From statistical methods to deep learning, automatic keyphrase predictionInformation Processing and Management: an International Journal10.1016/j.ipm.2023.10338260:4Online publication date: 1-Jul-2023
  • (2022)A new dataset for French and multilingual keyphrase generationProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3603027(38046-38059)Online publication date: 28-Nov-2022
  • (2022)Domain-Specific Keyword Extraction Using Joint Modeling of Local and Global Contextual SemanticsACM Transactions on Knowledge Discovery from Data10.1145/349456016:4(1-30)Online publication date: 8-Jan-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
AAAI'08: Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
July 2008
1266 pages
ISBN:9781577353683

Sponsors

  • Association for the Advancement of Artificial Intelligence

Publisher

AAAI Press

Publication History

Published: 13 July 2008

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)From statistical methods to deep learning, automatic keyphrase predictionInformation Processing and Management: an International Journal10.1016/j.ipm.2023.10338260:4Online publication date: 1-Jul-2023
  • (2022)A new dataset for French and multilingual keyphrase generationProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3603027(38046-38059)Online publication date: 28-Nov-2022
  • (2022)Domain-Specific Keyword Extraction Using Joint Modeling of Local and Global Contextual SemanticsACM Transactions on Knowledge Discovery from Data10.1145/349456016:4(1-30)Online publication date: 8-Jan-2022
  • (2021)Unsupervised Keyword Combination Query Generation from Online Health Related Content for Evidence-Based Fact CheckingThe 23rd International Conference on Information Integration and Web Intelligence10.1145/3487664.3487701(267-277)Online publication date: 29-Nov-2021
  • (2021)Attention-based Unsupervised Keyphrase Extraction and Phrase Graph for COVID-19 Medical Literature RetrievalACM Transactions on Computing for Healthcare10.1145/34739393:1(1-16)Online publication date: 15-Oct-2021
  • (2021)An Interactive Neural Network Approach to Keyphrase Extraction in Talent RecruitmentProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482319(2383-2393)Online publication date: 26-Oct-2021
  • (2021)CrowdTC: Crowd-powered Learning for Text ClassificationACM Transactions on Knowledge Discovery from Data10.1145/345721616:1(1-23)Online publication date: 20-Jul-2021
  • (2021)Web Document Encoding for Structure-Aware Keyphrase ExtractionProceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3404835.3463067(1823-1827)Online publication date: 11-Jul-2021
  • (2021)A survey on different dimensions for graphical keyword extraction techniquesArtificial Intelligence Review10.1007/s10462-021-10010-654:6(4731-4770)Online publication date: 1-Aug-2021
  • (2021)A novel cluster-based approach for keyphrase extraction from MOOC video lecturesKnowledge and Information Systems10.1007/s10115-021-01568-263:7(1663-1686)Online publication date: 1-Jul-2021
  • Show More Cited By

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media