Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3133811.3133829acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicibeConference Proceedingsconference-collections
research-article

Detect Incorrect Triples in Knowledge Base Based on Triple Confidence Evaluation

Published: 17 August 2017 Publication History

Abstract

The knowledge base is an important form of data storage and organization in the fields of knowledge service, and it is the basis of knowledge representation learning. The accuracy of the contents in the knowledge base determines the effectiveness of knowledge service applications. This study proposes a generic computational methodology to evaluate the confidence level of triples in knowledge bases and detect potentially incorrect ones for further verification. In our methodology, the confidence of a triple is evaluated based on weighted feature words that are able to characterize the subject-object relation embedded in the triple, and the feature words are extracted from a corpus of natural language sentences using statistical and natural language processing techniques. Based on the calculated confidence values of triples, incorrect triples are detected using machine-learning-based classification. An experiment on a data set of industry applications has been conducted to demonstrate the workflow of evaluating triple confidence and detecting in-correct triples using our methodology.

References

[1]
Linda A. Macaulay, Ian Miles, Jennifer Wilby, Yin Leng Tan, Liping Zhao, Babis Theodoulidis. 2013. Case Studies in Service Innovation (Service Science: Research and Innovations in the Service Economy). Springer Verlag. ISBN-13: 978-1489996862.
[2]
Wang Yan, Cheng Gang. On Service Innovation of Science and Technology Knowledge Service Enterprises from the Perspective of "Internet Plus". Journal of Intelligence, Vol.34, No.10, Oct. 2015, Pages 183--188.
[3]
Parikshit Sondhi, ChengXiang Zhai. 2014. Mining Semi-Structured Online Knowledge Bases to Answer Natural Language Questions on Community QA Websites. Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management (CIKM 2014), Pages 341--350.
[4]
Zhuoyu Wei, Jun Zhao, Kang Liu, Zhenyu Qi, Zhengya Sun, Guanhua Tian. Large-scale Knowledge Base Completion: Inferring via Grounding Network Sampling over Selected Instances. Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (CIKM 2015), Pages 1331--1340.
[5]
Maximilian Nickel, Kevin Murphy, Volker Tresp, Evgeniy Gabrilovich. A Review of Relational Machine Learning for Knowledge Graphs. Proceedings of the IEEE, Dec. 2015, Volume: 104, Issue: 1, Pages: 11--33.
[6]
Maximilian Nickel, Volker Tresp, Hans-Peter Kriegel. A Three-Way Model for Collective Learning on Multi-Relational Data. Proceedings of ICML 2011.
[7]
W. Wong, W. Liu, M. Bennamoun. Acquiring Semantic Relations using the Web for Constructing Lightweight Ontologies. Proceeding of the 13th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2009).
[8]
Fabian M. Suchanek, Sebastian Riedel, Sameer Singh, Partha P. Talukdar. AKBC 2013: third workshop on automated knowledge base construction. Proceedings of the 22nd ACM international conference on Information & Knowledge Management (CIKM 2013), Pages 2539--2540.
[9]
Akerkar RA and Sajja Priti Srinivas. 2012. Knowledge-based systems. Jones & Bartlett Publishers, Sudbury, MA, USA. ISBN 9780763776473.
[10]
Mohamed Yahya, Klaus Berberich, Shady Elbassuoni, Gerhard Weikum. Robust question answering over the web of linked data. Proceedings of the 22nd ACM international conference on Information & Knowledge Management (CIKM 2013), Pages 1107--1116.
[11]
Xiaojiang Liu, Nenghai Yu. Multi-Type Web Relation Extraction Based on Bootstrapping. 2010 WASE International Conference on Information Engineering (ICIE), Pages: 24--27.
[12]
E Agichtein, L Gravano. Snowball: Extracting Relations from Large Plain-text Collections. Proceedings of the fifth ACM conference on Digital libraries (DL 2010), Pages: 85--94.
[13]
Xin Wang, Chenggui Han. Knowledge-Based Data Check. Computer Science, 2004, Vo1. 31, No. 10.
[14]
Youngjoong Ko. A study of term weighting schemes using class information for text classification. Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2012), Pages: 1029--1030.
[15]
Grigori Sidorov, Alexander Gelbukh, Helena Gómez-Adorno, David Pinto. 2014. Soft Similarity and Soft Cosine Measure: Similarity of Features in Vector Space Model. Computación y Sistemas (2014), Vol 18, No 3.
[16]
Stanford POS Tagger. http://nlp.stanford.edu/software/tagger.shtml. Retrieved Jan 2016.
[17]
H. C. Wu, R. W. P. Luk, K. F. Wong, K. L. Kwok. Interpreting TF-IDF term weights as making relevance decisions. ACM Transactions on Information Systems (TOIS), Volume 26 Issue 3, June 2008.
[18]
Alberto Barron-Cedeno, Andreas Eiselt, Paolo Rosso. Monolingual Text Similarity Measures: A Comparison of Models over Wikipedia Articles Revisions. Cedeno, 2009.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICIBE '17: Proceedings of the 3rd International Conference on Industrial and Business Engineering
August 2017
107 pages
ISBN:9781450353519
DOI:10.1145/3133811
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • Waseda University: Waseda University

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 August 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Confidence Evaluation
  2. Knowledge Base
  3. Knowledge Service
  4. Relation Extraction
  5. Triple

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICIBE 2017

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 54
    Total Downloads
  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 26 Sep 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media