research-article

Detect Incorrect Triples in Knowledge Base Based on Triple Confidence Evaluation

Authors:

Xiaojun HuangAuthors Info & Claims

ICIBE '17: Proceedings of the 3rd International Conference on Industrial and Business Engineering

Pages 93 - 101

https://doi.org/10.1145/3133811.3133829

Published: 17 August 2017 Publication History

Abstract

The knowledge base is an important form of data storage and organization in the fields of knowledge service, and it is the basis of knowledge representation learning. The accuracy of the contents in the knowledge base determines the effectiveness of knowledge service applications. This study proposes a generic computational methodology to evaluate the confidence level of triples in knowledge bases and detect potentially incorrect ones for further verification. In our methodology, the confidence of a triple is evaluated based on weighted feature words that are able to characterize the subject-object relation embedded in the triple, and the feature words are extracted from a corpus of natural language sentences using statistical and natural language processing techniques. Based on the calculated confidence values of triples, incorrect triples are detected using machine-learning-based classification. An experiment on a data set of industry applications has been conducted to demonstrate the workflow of evaluating triple confidence and detecting in-correct triples using our methodology.

References

[1]

Linda A. Macaulay, Ian Miles, Jennifer Wilby, Yin Leng Tan, Liping Zhao, Babis Theodoulidis. 2013. Case Studies in Service Innovation (Service Science: Research and Innovations in the Service Economy). Springer Verlag. ISBN-13: 978-1489996862.

[2]

Wang Yan, Cheng Gang. On Service Innovation of Science and Technology Knowledge Service Enterprises from the Perspective of "Internet Plus". Journal of Intelligence, Vol.34, No.10, Oct. 2015, Pages 183--188.

[3]

Parikshit Sondhi, ChengXiang Zhai. 2014. Mining Semi-Structured Online Knowledge Bases to Answer Natural Language Questions on Community QA Websites. Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management (CIKM 2014), Pages 341--350.

Digital Library

[4]

Zhuoyu Wei, Jun Zhao, Kang Liu, Zhenyu Qi, Zhengya Sun, Guanhua Tian. Large-scale Knowledge Base Completion: Inferring via Grounding Network Sampling over Selected Instances. Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (CIKM 2015), Pages 1331--1340.

Digital Library

[5]

Maximilian Nickel, Kevin Murphy, Volker Tresp, Evgeniy Gabrilovich. A Review of Relational Machine Learning for Knowledge Graphs. Proceedings of the IEEE, Dec. 2015, Volume: 104, Issue: 1, Pages: 11--33.

[6]

Maximilian Nickel, Volker Tresp, Hans-Peter Kriegel. A Three-Way Model for Collective Learning on Multi-Relational Data. Proceedings of ICML 2011.

[7]

W. Wong, W. Liu, M. Bennamoun. Acquiring Semantic Relations using the Web for Constructing Lightweight Ontologies. Proceeding of the 13th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2009).

Digital Library

[8]

Fabian M. Suchanek, Sebastian Riedel, Sameer Singh, Partha P. Talukdar. AKBC 2013: third workshop on automated knowledge base construction. Proceedings of the 22nd ACM international conference on Information & Knowledge Management (CIKM 2013), Pages 2539--2540.

Digital Library

[9]

Akerkar RA and Sajja Priti Srinivas. 2012. Knowledge-based systems. Jones & Bartlett Publishers, Sudbury, MA, USA. ISBN 9780763776473.

[10]

Mohamed Yahya, Klaus Berberich, Shady Elbassuoni, Gerhard Weikum. Robust question answering over the web of linked data. Proceedings of the 22nd ACM international conference on Information & Knowledge Management (CIKM 2013), Pages 1107--1116.

Digital Library

[11]

Xiaojiang Liu, Nenghai Yu. Multi-Type Web Relation Extraction Based on Bootstrapping. 2010 WASE International Conference on Information Engineering (ICIE), Pages: 24--27.

Digital Library

[12]

E Agichtein, L Gravano. Snowball: Extracting Relations from Large Plain-text Collections. Proceedings of the fifth ACM conference on Digital libraries (DL 2010), Pages: 85--94.

Digital Library

[13]

Xin Wang, Chenggui Han. Knowledge-Based Data Check. Computer Science, 2004, Vo1. 31, No. 10.

[14]

Youngjoong Ko. A study of term weighting schemes using class information for text classification. Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2012), Pages: 1029--1030.

Digital Library

[15]

Grigori Sidorov, Alexander Gelbukh, Helena Gómez-Adorno, David Pinto. 2014. Soft Similarity and Soft Cosine Measure: Similarity of Features in Vector Space Model. Computación y Sistemas (2014), Vol 18, No 3.

[16]

Stanford POS Tagger. http://nlp.stanford.edu/software/tagger.shtml. Retrieved Jan 2016.

[17]

H. C. Wu, R. W. P. Luk, K. F. Wong, K. L. Kwok. Interpreting TF-IDF term weights as making relevance decisions. ACM Transactions on Information Systems (TOIS), Volume 26 Issue 3, June 2008.

Digital Library

[18]

Alberto Barron-Cedeno, Andreas Eiselt, Paolo Rosso. Monolingual Text Similarity Measures: A Comparison of Models over Wikipedia Articles Revisions. Cedeno, 2009.

Index Terms

Detect Incorrect Triples in Knowledge Base Based on Triple Confidence Evaluation
1. Computing methodologies
  1. Artificial intelligence
    1. Knowledge representation and reasoning
      1. Ontology engineering
2. Information systems
  1. Data management systems
    1. Database design and models
      1. Data model extensions
        Inconsistent data
      2. Entity relationship models

Recommendations

A Methodology to Evaluate Triple Confidence and Detect Incorrect Triples in Knowledge Bases
JCDL '16: Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries

The accuracy of the contents of a knowledge base determines the effectiveness of knowledge service applications, thus, it is necessary to evaluate the confidence of triples when a knowledge base is built. This study introduces a generic computational ...
The Construction of Knowledge Base on Pre-Qin Chinese Reduplication
Chinese Lexical Semantics
Abstract
Reduplication is an important manifestation of Chinese morphological change. While there are numerous reduplications among polysyllabic words of Pre-Qin Chinese, there is no dedicated knowledge base on reduplication. This project designs a ...
Research on Knowledge Model for Ontology-Based Knowledge Base
BCGIN '11: Proceedings of the 2011 International Conference on Business Computing and Global Informatization

The robust knowledge base needs a reasonable design pattern to represent knowledge model. On the foundation of the research about relationship of knowledge model with knowledge ontology, the knowledge model defined by three categories of domain ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICIBE '17: Proceedings of the 3rd International Conference on Industrial and Business Engineering

August 2017

107 pages

ISBN:9781450353519

DOI:10.1145/3133811

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

Waseda University: Waseda University

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 August 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ICIBE 2017

ICIBE 2017: 2017 3rd International Conference on Industrial and Business Engineering

August 17 - 19, 2017

Sapporo, Japan

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
54
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 26 Sep 2024

Other Metrics

View Author Metrics

Citations

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents