Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2912845.2912848acmotherconferencesArticle/Chapter ViewAbstractPublication PageswimsConference Proceedingsconference-collections
research-article

How Can Ontologies Give You Clue for Truth-Discovery? An Exploratory Study

Published: 13 June 2016 Publication History

Abstract

The main aim of truth-finding methods is to identify the most reliable and trustworthy data among a set of facts. Since existing methods assume a single true value, they cannot deal with numerous real-world use cases in which a set of true values exists for a given fact, even for functional predicate (e.g. Picasso is born in Màlaga and in Spain). This paper studies how traditional truth-finding methods can be adapted to this setting. After introducing a new definition of true value and discussing associated implications, we propose an approach that can be used to identify true values among a set of non-conflicting claims; it takes advantage of belief functions to incorporate knowledge about value relationships in the form of a partial ordering of claimed values. By reducing the error rate up to 30% adapting classical approaches, the effectiveness and suitability of our proposal is clearly highlighted through empirical experiments performed on DBpedia.

References

[1]
S. Auer, C. Bizer, G. Kobilarov, J. Lehmann, R. Cyganiak, and Z. Ives. The Semantic Web: 6th International Semantic Web Conference, 2nd Asian Semantic Web Conference, ISWC 2007 + ASWC 2007, Busan, Korea, November 11-15, 2007. Proceedings, chapter DBpedia: A Nucleus for a Web of Open Data, pages 722--735. Springer Berlin Heidelberg, Berlin, Heidelberg, 2007.
[2]
L. Berti-Equille, A. D. Sarma, Xin, Dong, A. Marian, and D. Srivastava. Sailing the Information Ocean with Awareness of Currents: Discovery and Application of Source Dependence. Cidr, 2009.
[3]
L. Blanco, V. Crescenzi, P. Merialdo, and P. Papotti. Probabilistic Models to Reconcile Complex Data from Inaccurate Data Sources. Proceedings of the 22Nd International Conference on Advanced Information Systems Engineering, pages 83--97, 2010.
[4]
S. Brin and L. Page. The Anatomy of a Search Engine. Proceedings of the 7th International World-Wide Web Conference, page 20, 1998.
[5]
X. L. Dong, L. Berti-Equille, Y. Hu, and D. Srivastava. Global detection of complex copying relationships between sources. Proceedings of the VLDB Endowment, 3(1):1358--1369, 2010.
[6]
X. L. Dong, L. Berti-Equille, and D. Srivastava. Integrating conflicting data: the role of source dependence. Proceedings of the VLDB Endowment, 2(1):550--561, 2009.
[7]
X. L. Dong, L. Berti-Equille, and D. Srivastava. Truth Discovery and Copying Detection in a Dynamic World. Vldb, 2(1):562--573, 2009.
[8]
X. L. Dong, E. Gabrilovich, K. Murphy, V. Dang, W. Horn, C. Lugaresi, S. Sun, and W. Zhang. Knowledge-based trust: Estimating the trustworthiness of web sources. Proc. VLDB Endow., 8(9):938--949, May 2015.
[9]
B. J. Fogg, H. Tseng, C. Hall, and H. Drive. Elements of computer credibility. Conference on Human Factors in Computing Systems - Proceedings, (May):80--87, 1999.
[10]
A. Galland, S. Abiteboul, A. Marian, and P. Senellart. Corroborating Information from Disagreeing Views. WSDM '10 Proceedings of the third ACM international conference on Web search and data mining, pages 131--140, 2010.
[11]
Y. Gil and D. Artz. Towards content trust of web resources. Web Semantics: Science, Services and Agents on the World Wide Web, 5(4):227--239, 2007.
[12]
M. Gupta, Y. Sun, and J. Han. Trust analysis with clustering. In Proceedings of the 20th International Conference Companion on World Wide Web, WWW '11, pages 53--54, New York, NY, USA, 2011. ACM.
[13]
S. Harispe, A. Imoussaten, F. Trousset, and J. Montmain. On the consideration of a bring-to-mind model for computing the information content of concepts defined into ontologies. In Fuzzy Systems (FUZZ-IEEE), 2015 IEEE International Conference on, pages 1--8, Aug 2015.
[14]
S. Harispe, S. Ranwez, S. Janaqi, and J. Montmain. The semantic measures library and toolkit: fast computation of semantic similarity and relatedness using biomedical ontologies. Bioinformatics, 30(5):740--742, 2014.
[15]
S. Harispe, S. Ranwez, S. Janaqi, and J. Montmain. Semantic similarity from natural language and ontology analysis. Synthesis Lectures on Human Language Technologies, 8(1):1--254, 2015.
[16]
K. Kelton, K. R. Fleischmann, and W. A. Wallace. Trust in digital information. J. Am. Soc. Inf. Sci. Technol., 59(3):363--374, Feb. 2008.
[17]
J. M. Kleinberg. Authoritative Sources in a Hyperlinked Environment. Journal of the ACM, 46(May 1997):668--677, 1999.
[18]
Y. Li, J. Gao, C. Meng, Q. Li, L. Su, B. Zhao, W. Fan, and J. Han. A survey on truth discovery. CoRR, abs/1505.02463, 2015.
[19]
D. Lin. An information-theoretic definition of similarity. In 15th International Conference of Machine Learning (ICML 1998), volume 98, pages 296--304, 1998.
[20]
J. Pasternack and D. Roth. Knowing what to believe (when you already know something). In Proceedings of the 23rd International Conference on Computational Linguistics, COLING '10, pages 877--885, Stroudsburg, PA, USA, 2010. Association for Computational Linguistics.
[21]
J. Pasternack and D. Roth. Making better informed trust decisions with generalized fact-finding. IJCAI International Joint Conference on Artificial Intelligence, pages 2324--2329, 2011.
[22]
R. Pochampally, A. Das Sarma, X. L. Dong, A. Meliou, and D. Srivastava. Fusing data with correlations. Proceedings of the 2014 ACM SIGMOD international conference on Management of data - SIGMOD '14, pages 433--444, 2014.
[23]
G.-J. Qi, C. C. Aggarwal, J. Han, and T. Huang. Mining collective intelligence in diverse groups. In Proceedings of the 22Nd International Conference on World Wide Web, WWW '13, pages 1041--1052, Switzerland, 2013. International World Wide Web Conferences Steering Committee.
[24]
D. Sánchez and M. Batet. Semantic similarity estimation in the biomedical domain: An ontology-based information-theoretic perspective. Journal of biomedical informatics, 44(5):749--759, 2011.
[25]
G. Shafer et al. A mathematical theory of evidence, volume 1. Princeton university press Princeton, 1976.
[26]
D. A. Waguih and L. Berti-Equille. Truth discovery algorithms: An experimental evaluation. CoRR, abs/1409.6428, 2014.
[27]
D. Wang, T. Abdelzaher, L. Kaplan, and C. Aggarwal. Recursive fact-finding: A streaming approach to truth estimation in crowdsourcing applications. In Distributed Computing Systems (ICDCS), 2013 IEEE 33rd International Conference on, pages 530--539, July 2013.
[28]
D. Wang, L. Kaplan, H. Le, and T. Abdelzaher. On truth discovery in social sensing: A maximum likelihood estimation approach. In Proceedings of the 11th International Conference on Information Processing in Sensor Networks, IPSN '12, pages 233--244, New York, NY, USA, 2012. ACM.
[29]
X. Wang, Q. Z. Sheng, X. S. Fang, L. Yao, X. Xu, and X. Li. An integrated bayesian approach for effective multi-truth discovery. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, CIKM '15, pages 493--502, New York, NY, USA, 2015. ACM.
[30]
X. Yin, J. Han, and P. S. Yu. Truth discovery with multiple conflicting information providers on the Web. IEEE Transactions on Knowledge and Data Engineering, 20(6):796--808, 2008.
[31]
B. Zhao, B. I. P. Rubinstein, J. Gemmell, and J. Han. A bayesian approach to discovering truth from conflicting sources for data integration. Proc. VLDB Endow., 5(6):550--561, Feb. 2012.

Cited By

View all
  • (2018)Leveraging Data Relationships to Resolve Conflicts from Disparate Data SourcesDatabase and Expert Systems Applications10.1007/978-3-319-98812-2_7(99-115)Online publication date: 3-Sep-2018
  • (2018)Combining Truth Discovery and RDF Knowledge Bases to Their Mutual AdvantageThe Semantic Web – ISWC 201810.1007/978-3-030-00671-6_38(652-668)Online publication date: 8-Oct-2018

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
WIMS '16: Proceedings of the 6th International Conference on Web Intelligence, Mining and Semantics
June 2016
309 pages
© 2016 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 June 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Belief Functions
  2. Ontology
  3. Source Trustworthiness
  4. Truth Discovery

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

WIMS '16

Acceptance Rates

WIMS '16 Paper Acceptance Rate 36 of 53 submissions, 68%;
Overall Acceptance Rate 140 of 278 submissions, 50%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)10
  • Downloads (Last 6 weeks)3
Reflects downloads up to 19 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Leveraging Data Relationships to Resolve Conflicts from Disparate Data SourcesDatabase and Expert Systems Applications10.1007/978-3-319-98812-2_7(99-115)Online publication date: 3-Sep-2018
  • (2018)Combining Truth Discovery and RDF Knowledge Bases to Their Mutual AdvantageThe Semantic Web – ISWC 201810.1007/978-3-030-00671-6_38(652-668)Online publication date: 8-Oct-2018

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media