Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3148011.3148033acmconferencesArticle/Chapter ViewAbstractPublication Pagesk-capConference Proceedingsconference-collections
research-article

Detection of Relation Assertion Errors in Knowledge Graphs

Published: 04 December 2017 Publication History

Abstract

Although the link prediction problem, where missing relation assertions are predicted, has been widely researched, error detection did not receive as much attention. In this paper, we investigate the problem of error detection in relation assertions of knowledge graphs, and we propose an error detection method which relies on path and type features used by a classifier for every relation in the graph exploiting local feature selection. We perform an extensive evaluation on a variety of datasets, backed by a manual evaluation on DBpedia and NELL, and we propose and evaluate heuristics for the selection of relevant graph paths to be used as features in our method.

References

[1]
Antoine Bordes, Xavier Glorot, Jason Weston, and Yoshua Bengio. 2012. Joint Learning of Words and Meaning Representations for Open-Text Semantic Parsing. In Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2012, La Palma, Canary Islands, April 21-23, 2012. 127--135. http://jmlr.csail.mit.edu/proceedings/papers/v22/bordes12.html
[2]
Antoine Bordes, Nicolas Usunier, Alberto Garcia-Duran, Jason Weston, and Oksana Yakhnenko. {n. d.}. Translating Embeddings for Modeling Multi-relational Data. In Advances in Neural Information Processing Systems 26.
[3]
Antoine Bordes, Jason Weston, Ronan Collobert, and Yoshua Bengio. 2011. Learning Structured Embeddings of Knowledge Bases. In Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2011, San Francisco, California, USA, August 7-11, 2011. http://www.aaai.org/ocs/index.php/AAAI/AAAI11/paper/view/3659
[4]
Leo Breiman. 2001. Random Forests. Machine Learning 45, 1 (01 Oct 2001), 5--32.
[5]
Kai-Wei Chang, Scott Wen-tau Yih, Bishan Yang, and Chris Meek. 2014. Typed Tensor Decomposition of Knowledge Bases for Relation Extraction, In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. https://www.microsoft.com/en-us/research/publication/typed-tensor-decomposition-of-knowledge-bases-for-relation-extraction/
[6]
Corinna Cortes and Vladimir Vapnik. 1995. Support-Vector Networks. Mach. Learn. 20, 3 (Sept. 1995), 273--297.
[7]
Jeremy Debattista, Christoph Lange, and Sören Auer. 2016. A Preliminary Investigation Towards Improving Linked Data Quality Using Distance-Based Outlier Detection. In Semantic Technology - 6th Joint International Conference, JIST 2016, Singapore, Singapore, November 2-4, 2016, Revised Selected Papers. 116--124.
[8]
Matt Gardner and Tom M. Mitchell. 2015. Efficient and Expressive Knowledge Base Completion Using Subgraph Feature Extraction. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015, Lisbon, Portugal, September 17-21, 2015. 1488--1498. http://aclweb.org/anthology/D/D15/D15-1173.pdf
[9]
Rodolphe Jenatton, Nicolas L. Roux, Antoine Bordes, and Guillaume R Obozinski. {n. d.}. A latent factor model for highly multi-relational data. In Advances in Neural Information Processing Systems 25.
[10]
Rudolf Kadlec, Ondrej Bajgar, and Jan Kleindienst. 2017. Knowledge Base Completion: Baselines Strike Back. CoRR abs/1705.10744 (2017). http://arxiv.org/abs/1705.10744
[11]
Ni Lao and William W. Cohen. 2010. Relational Retrieval Using a Combination of Path-constrained Random Walks. Mach. Learn. 81, 1 (Oct. 2010), 53--67.
[12]
Ni Lao and William W. Cohen. 2010. Relational retrieval using a combination of path-constrained random walks. Machine Learning 81, 1 (01 Oct 2010), 53--67.
[13]
Ni Lao, Tom Mitchell, and William W. Cohen. 2011. Random Walk Inference and Learning in a Large Scale Knowledge Base. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP '11). Association for Computational Linguistics, Stroudsburg, PA, USA, 529--539. http://dl.acm.org/citation.cfm?id=2145432.2145494
[14]
Yankai Lin, Zhiyuan Liu, and Maosong Sun. 2015. Modeling Relation Paths for Representation Learning of Knowledge Bases. CoRR abs/1506.00379 (2015). http://arxiv.org/abs/1506.00379
[15]
Yankai Lin, Zhiyuan Liu, Maosong Sun, Yang Liu, and Xuan Zhu. 2015. Learning Entity and Relation Embeddings for Knowledge Graph Completion. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI'15). AAAI Press, 2181--2187. http://dl.acm.org/citation.cfm?id=2886521.2886624
[16]
André Melo and Heiko Paulheim. 2017. Local and global feature selection for multilabel classification with binary relevance. Artificial Intelligence Review (2017), 1--28.
[17]
André Melo, Heiko Paulheim, and Johanna Völker. 2016. Type Prediction in RDF Knowledge Bases Using Hierarchical Multilabel Classification. In Proceedings of the 6th International Conference on Web Intelligence, Mining and Semantics (WIMS '16). ACM, New York, NY, USA, Article 14, 10 pages.
[18]
Simone Paolo Ponzetto Heiko Paulheim Michael Cochez, Petar Ristoski. 2017. Global RDF Vector Space Embeddings. In International Semantic Web Conference. to appear.
[19]
Maximilian Nickel, Kevin Murphy, Volker Tresp, and Evgeniy Gabrilovich. 2016. A Review of Relational Machine Learning for Knowledge Graphs. Proc. IEEE 104, 1 (2016), 11--33.
[20]
Maximilian Nickel, Lorenzo Rosasco, and Tomaso A. Poggio. 2015. Holographic Embeddings of Knowledge Graphs. CoRR abs/1510.04935 (2015). http://arxiv.org/abs/1510.04935
[21]
Maximilian Nickel, Volker Tresp, and Hans peter Kriegel. 2011. A Three-Way Model for Collective Learning on Multi-Relational Data. In Proceedings of the 28th International Conference on Machine Learning (ICML-11). ACM. http://www.icml-2011.org/papers/438_icmlpaper.pdf
[22]
Heiko Paulheim. 2017. Data-driven joint debugging of the DBpedia mappings and ontology. In European Semantic Web Conference. Springer, 404--418.
[23]
Heiko Paulheim. 2017. Knowledge graph refinement: A survey of approaches and evaluation methods. Semantic Web 8, 3 (2017), 489--508.
[24]
Heiko Paulheim and Christian Bizer. {n. d.}.
[25]
Heiko Paulheim and Christian Bizer. 2014. Improving the Quality of Linked Data Using Statistical Distributions. Int. J. Semant. Web Inf. Syst. 10, 2 (April 2014), 63--86.
[26]
Heiko Paulheim and Aldo Gangemi. 2015. Serving DBpedia with DOLCE--more than just adding a cherry on top. In International Semantic Web Conference. Springer, 180--196.
[27]
Daniel Ringler and Heiko Paulheim. 2017. One Knowledge Graph to Rule them All? Analyzing the Differences between DBpedia, YAGO, Wikidata & co. In 40th German Conference on Artificial Intelligence. to appear.
[28]
Petar Ristoski and Heiko Paulheim. {n. d.}.
[29]
Baoxu Shi and Tim Weninger. 2017. ProjE: Embedding Projection for Knowledge Graph Completion. (2017). https://aaai.org/ocs/index.php/AAAI/AAAI17/paper/view/14279
[30]
Richard Socher, Danqi Chen, Christopher D Manning, and Andrew Ng. 2013. Reasoning With Neural Tensor Networks for Knowledge Base Completion. In Advances in Neural Information Processing Systems 26. Curran Associates, Inc., 926--934. http://papers.nips.cc/paper/5028-reasoning-with-neural-tensor-networks-for-knowledge-base-completion.pdf
[31]
Kristina Toutanova and Danqi Chen. 2015. Observed Versus Latent Features for Knowledge Base and Text Inference, In 3rd Workshop on Continuous Vector Space Models and Their Compositionality. https://www.microsoft.com/en-us/research/publication/observed-versus-latent-features-for-knowledge-base-and-text-inference/
[32]
Théo Trouillon, Johannes Welbl, Sebastian Riedel, Éric Gaussier, and Guillaume Bouchard. 2016. Complex Embeddings for Simple Link Prediction. CoRR abs/1606.06357 (2016). http://arxiv.org/abs/1606.06357
[33]
Chengyu Wang, Rong Zhang, Xiaofeng He, and Aoying Zhou. 2016. Error Link Detection and Correction in Wikipedia. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management (CIKM '16). ACM, New York, NY, USA, 307--316.
[34]
Q. Wang, Z. Mao, B. Wang, and L. Guo. 2017. Knowledge Graph Embedding: A Survey of Approaches and Applications. IEEE Transactions on Knowledge and Data Engineering PP, 99 (2017), 1--1.
[35]
Zhen Wang, Jianwen Zhang, Jianlin Feng, and Zheng Chen. {n. d.}. In AAAI.
[36]
Gabriel Weaver, Barbara Strickland, and Gregory Crane. 2006. Quantifying the accuracy of relational statements in Wikipedia: a methodology. 2006 IEEE/ACM 6th Joint Conference on Digital Libraries 00 (2006), 358.
[37]
Han Xiao, Minlie Huang, Yu Hao, and Xiaoyan Zhu. 2015. TransG: A Generative Mixture Model for Knowledge Graph Embedding. CoRR abs/1509.05488 (2015). http://arxiv.org/abs/1509.05488
[38]
Bishan Yang, Wen-tau Yih, Xiaodong He, Jianfeng Gao, and Li Deng. 2014. Learning Multi-Relational Semantics Using Neural-Embedding Models. CoRR abs/1411.4072 (2014). http://arxiv.org/abs/1411.4072

Cited By

View all
  • (2024)Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic LabelBioengineering10.3390/bioengineering1103022511:3(225)Online publication date: 27-Feb-2024
  • (2024)Towards Knowledge Graph Refinement: Misdirected Triple IdentificationAdjunct Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and Personalization10.1145/3631700.3665235(460-466)Online publication date: 27-Jun-2024
  • (2024)Knowledge graph error detection with unsupervised triplet networkSeventh Global Intelligent Industry Conference (GIIC 2024)10.1117/12.3032265(36)Online publication date: 11-Oct-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
K-CAP '17: Proceedings of the 9th Knowledge Capture Conference
December 2017
271 pages
ISBN:9781450355537
DOI:10.1145/3148011
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 December 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Knowledge graphs refinement
  2. data cleansing
  3. error detection

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

K-CAP 2017
Sponsor:
K-CAP 2017: Knowledge Capture Conference
December 4 - 6, 2017
TX, Austin, USA

Acceptance Rates

Overall Acceptance Rate 55 of 198 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)42
  • Downloads (Last 6 weeks)4
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Enhancing Error Detection on Medical Knowledge Graphs via Intrinsic LabelBioengineering10.3390/bioengineering1103022511:3(225)Online publication date: 27-Feb-2024
  • (2024)Towards Knowledge Graph Refinement: Misdirected Triple IdentificationAdjunct Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and Personalization10.1145/3631700.3665235(460-466)Online publication date: 27-Jun-2024
  • (2024)Knowledge graph error detection with unsupervised triplet networkSeventh Global Intelligent Industry Conference (GIIC 2024)10.1117/12.3032265(36)Online publication date: 11-Oct-2024
  • (2024)Quality Evaluation of Triples in Knowledge Graph by Incorporating Internal With External ConsistencyIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.318603335:2(1980-1992)Online publication date: Feb-2024
  • (2024)Integrating Entity Attributes for Error-Aware Knowledge Graph EmbeddingIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.331014936:4(1667-1682)Online publication date: Apr-2024
  • (2024)ProMvSDInformation Processing and Management: an International Journal10.1016/j.ipm.2024.10370561:4Online publication date: 1-Jul-2024
  • (2024)Can data improve knowledge graph?Memetic Computing10.1007/s12293-024-00429-z16:3(403-413)Online publication date: 12-Aug-2024
  • (2023)Knowledge graphs for enhancing transparency in health data ecosystems1Semantic Web10.3233/SW-22329414:5(943-976)Online publication date: 8-May-2023
  • (2023)Error Detection on Knowledge Graphs with Triple Embedding2023 31st European Signal Processing Conference (EUSIPCO)10.23919/EUSIPCO58844.2023.10289852(1604-1608)Online publication date: 4-Sep-2023
  • (2023)Automatic Extraction of Effective Relations in Knowledge Graph for a Recommendation Explanation SystemProceedings of the 38th ACM/SIGAPP Symposium on Applied Computing10.1145/3555776.3577732(1754-1761)Online publication date: 27-Mar-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media