Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1007/978-3-319-18818-8_33guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Ranking Entities in the Age of Two Webs, an Application to Semantic Snippets

Published: 31 May 2015 Publication History

Abstract

The advances of the Linked Open Data LOD initiative are giving rise to a more structured Web of data. Indeed, a few datasets act as hubs e.g., DBpedia connecting many other datasets. They also made possible new Web services for entity detection inside plain text e.g.,ï źDBpedia Spotlight, thus allowing for new applications that can benefit from a combination of the Web of documents and the Web of data. To ease the emergence of these new applications, we propose a query-biased algorithm LDRANK for the ranking of web of data resources with associated textual data. Our algorithm combines link analysis with dimensionality reduction. We use crowdsourcing for building a publicly available and reusable dataset for the evaluation of query-biased ranking of Web of data resources detected in Web pages. We show that, on this dataset, LDRANK outperforms the state of the art. Finally, we use this algorithm for the construction of semantic snippets of which we evaluate the usefulness with a crowdsourcing-based approach.

References

[1]
Alonso, O., Marshall, C., Najork, M.: Crowdsourcing a subjective labeling task: a human-centered framework to ensure reliable results. Technical report, MSR-TR-2014-91. http://research.microsoft.com/apps/pubs/default.aspx
[2]
Bai, X., Delbru, R., Tummarello, G.: RDF snippets for semantic web search engines. In: Meersman, R., Tari, Z. eds. OTM 2008, Part II. LNCS, vol. 5332, pp. 1304---1318. Springer, Heidelberg 2008
[3]
Berry, M.W.: Large-scale sparse singular value computations. Int. J. Supercomput. Appl. 61, 13---49 1992
[4]
Bizer, C., Eckert, K., Meusel, R., Mühleisen, H., Schuhmacher, M., Völker, J.: Deployment of RDFa, microdata, and microformats on the web --- a quantitative analysis. In: Alani, H., Kagal, L., Fokoue, A., Groth, P., Biemann, C., Parreira, J.X., Aroyo, L., Noy, N., Welty, C., Janowicz, K. eds. ISWC 2013, Part II. LNCS, vol. 8219, pp. 17---32. Springer, Heidelberg 2013
[5]
Carvalho, A., Larson, K.: A consensual linear opinion pool. In: Proceedings of the Twenty-Third international Joint Conference on Artificial Intelligence, pp. 2518---2524. AAAI Press 2013
[6]
Dali, L., Fortuna, B., Duc, T.T., Mladenić, D.: Query-independent learning to rank for RDF entity search. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. eds. ESWC 2012. LNCS, vol. 7295, pp. 484---498. Springer, Heidelberg 2012
[7]
Demartini, G., Difallah, D.E., Cudré-Mauroux, P.: Zencrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking. In: Proceedings of the 21st International Conference on World Wide Web, pp. 469---478. ACM 2012
[8]
Ding, L., Finin, T., Joshi, A., Pan, R., Cost, R.S., Peng, Y., Reddivari, P., Doshi, V., Sachs, J.: Swoogle: a search and metadata engine for the semantic web. In: Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, pp. 652---659. ACM 2004
[9]
Fafalios, P., Tzitzikas, Y.: Post-analysis of keyword-based search results using entity mining, linked data, and link analysis at query time 2014
[10]
Franz, T., Schultz, A., Sizov, S., Staab, S.: Triplerank: ranking semantic web data by tensor decomposition. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. eds. ISWC 2009. LNCS, vol. 5823, pp. 213---228. Springer, Heidelberg 2009
[11]
Ge, W., Cheng, G., Li, H., Qu, Y.: Incorporating compactness to generate term-association view snippets for ontology search. Inf. Process. Manage. 49, 513---528 2013
[12]
Haas, K., Mika, P., Tarjan, P., Blanco, R.: Enhanced results for web search. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 725---734. ACM 2011
[13]
Järvelin, K., Kekäläinen, J.: Ir evaluation methods for retrieving highly relevant documents. In: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 41---48. ACM 2000
[14]
Jeong, J.W., Morris, M.R., Teevan, J., Liebling, D.J.: A crowd-powered socially embedded search engine. In: ICWSM 2013
[15]
Jindal, V., Bawa, S., Batra, S.: A review of ranking approaches for semantic search on web. Inf. Process. Manage. 502, 416---425 2014
[16]
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM JACM 465, 604---632 1999
[17]
Kohlschütter, C., Fankhauser, P., Nejdl, W.: Boilerplate detection using shallow text features. In: Proceedings of the Third ACM International Conference on Web search and Data Mining, pp. 441---450. ACM 2010
[18]
Krippendorff, K.: Content analysis: An introduction to Its Methodology. Sage, Thousand Oaks 2012
[19]
Landis, J.R., Koch, G.G.: The measurement of observer agreement for categorical data. Biometrics 33, 159---174 1977
[20]
Lempel, R., Moran, S.: Salsa: the stochastic approach for link-structure analysis. ACM Trans. Inf. Syst. TOIS 192, 131---160 2001
[21]
Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: Dbpedia spotlight: shedding light on the web of documents. In: Proceedings of the 7th International Conference on Semantic Systems, pp. 1---8. I-Semantics 2011, ACM 2011
[22]
Nie, Z., Zhang, Y., Wen, J.R., Ma, W.Y.: Object-level ranking: bringing order to web objects. In: Proceedings of the 14th International Conference on World Wide Web, pp. 567---574. ACM 2005
[23]
Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: bringing order to the web 1999
[24]
Penin, T., Wang, H., Tran, T., Yu, Y.: Snippet generation for semantic web search engines. In: Domingue, J., Anutariya, C. eds. ASWC 2008. LNCS, vol. 5367, pp. 493---507. Springer, Heidelberg 2008
[25]
Roa-Valverde, A.J., Sicilia, M.A.: A survey of approaches for ranking on the web of data. Inf. Retrieval 17, 1---31 2014
[26]
Steiner, T., Troncy, R., Hausenblas, M.: How google is using linked data today and vision for tomorrow. In: Proceedings of Linked Data in the Future Internet 700 2010
[27]
Tonon, A., Catasta, M., Demartini, G., Cudré-Mauroux, P., Aberer, K.: TRank: ranking entity types using the web of data. In: Alani, H., Kagal, L., Fokoue, A., Groth, P., Biemann, C., Parreira, J.X., Aroyo, L., Noy, N., Welty, C., Janowicz, K. eds. ISWC 2013, Part I. LNCS, vol. 8218, pp. 640---656. Springer, Heidelberg 2013
[28]
Wei, W., Barnaghi, P., Bargiela, A.: Rational research model for ranking semantic entities. Inf. Sci. 18113, 2823---2840 2011

Cited By

View all
  • (2016)DBtrendsProceedings of the 12th International Conference on Semantic Systems10.1145/2993318.2993322(9-16)Online publication date: 12-Sep-2016
  • (2016)Automated extraction of concept matcher thesaurus from semi-structured catalogue-like sources of data on the webProceedings of the 18th Conference of Open Innovations Association FRUCT10.1109/FRUCT-ISPIT.2016.7561521(153-160)Online publication date: 25-Apr-2016

Index Terms

  1. Ranking Entities in the Age of Two Webs, an Application to Semantic Snippets

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    Proceedings of the 12th European Semantic Web Conference on The Semantic Web. Latest Advances and New Domains - Volume 9088
    May 2015
    797 pages
    ISBN:9783319188171

    Publisher

    Springer-Verlag

    Berlin, Heidelberg

    Publication History

    Published: 31 May 2015

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 08 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2016)DBtrendsProceedings of the 12th International Conference on Semantic Systems10.1145/2993318.2993322(9-16)Online publication date: 12-Sep-2016
    • (2016)Automated extraction of concept matcher thesaurus from semi-structured catalogue-like sources of data on the webProceedings of the 18th Conference of Open Innovations Association FRUCT10.1109/FRUCT-ISPIT.2016.7561521(153-160)Online publication date: 25-Apr-2016

    View Options

    View options

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media