Nothing Special   »   [go: up one dir, main page]

Skip to content
BY 4.0 license Open Access Published by De Gruyter Open Access November 4, 2020

Linking Web Resources in Web of Data to Encyclopedic Knowledge Base

  • Farzam Matinfar EMAIL logo
From the journal Open Computer Science

Abstract

This paper introduces Wikipedia as an extensive knowledge base which provides additional information about a great number of web resources in the semantic web, and shows how RDF web resources in the web of data can be linked to this encyclopedia. Given an input web resource, the designed system identifies the topic of the web resource and links it to the corresponding Wikipedia article. To perform this task, we use the core labeling properties in web of data to specify the candidate Wikipedia articles for a web resource. Finally, a knowledge based approach is used to identify the most appropriate article in Wikipedia database. Evaluation of the system shows the high performance of the designed system.

References

[1] Bizer, C., Heath, T., and Berners-Lee, T.: ‘The Story So Far’, International Journal on semantic Web and Information Systems, 2009, 5, (3), pp. 1-2210.4018/jswis.2009081901Search in Google Scholar

[2] http://www.w3.org/DesignIssues/LinkedData.htmlSearch in Google Scholar

[3] Venkatesan, A., Ngompe, G.T., Hassouni, N.E., Chentli, I., Guignon, V., Jonquet, C., Ruiz, M., and Larmande, P.: ‘Agronomic Linked Data (AgroLD): a Knowledge-based System to Enable Integrative Biology in Agronomy’, PLOS ONE’, 2018, 13, (11)10.1371/journal.pone.0198270Search in Google Scholar PubMed PubMed Central

[4] Knoblock, C.A., Szekely, P., Fink, E., Degler, D., Newbury, D., Sanderson, R., Blanch, K., Snyder, S., Chheda, N., Jain, N., Krishna, R.R., Sreekanth, N.B., and Yao, Y.: ‘Lessons Learned in Building Linked Data for the American Art Collaborative’, in Editor (Ed.)^(Eds.): ‘Book Lessons Learned in Building Linked Data for the American Art Collaborative’ (2017, edn.), pp. 263-27910.1007/978-3-319-68204-4_26Search in Google Scholar

[5] Bechhofer, S., Buchan, I., Roure, D.D., Missier, P., Ainsworth, J., Bhagat, J., Couch, P., Cruickshank, D., Delderfield, M., Dunlop, I., Gamble, M., Michaelides, D., Owen, S., Newman, D., Sufi, S., and Goble, C.: ‘Why linked data is not enough for scientists, Future Generation Computer Systems’, Future Generation Computer Systems, 2013, 29, (2), pp. 599-61110.1016/j.future.2011.08.004Search in Google Scholar

[6] Tummarello, G., Delbru, R., and Oren, E.: ‘Sindice.com: weaving the open linked data’, in Editor (Ed.)^(Eds.): ‘Book Sindice.com: weaving the open linked data’ (2007, edn.), pp. 552-56510.1007/978-3-540-76298-0_40Search in Google Scholar

[7] Klimek, J., Skoda, P., and Necasky, M.: ‘Survey of Tools for Linked Data Consumption’, Semantic Web journal, 2019, 10, (4), pp. 665-72010.3233/SW-180316Search in Google Scholar

[8] http://wifo5-03.informatik.uni-mannheim.de/bizer/pub/LinkedDataTutorial/Search in Google Scholar

[9] Matinfar, F., Nematbakhsh, M.A., and Lausen, G.: ‘Discovery of RDFS: SeeAlso Patterns in Semantic Web’, International Journal of Pattern Recognition and Artificial Intelligence, 2014, 28, (2)10.1142/S0218001414500037Search in Google Scholar

[10] Bunescu, R., and Pasca, M.: ‘Using Encyclopedic Knowledge for Named entity Disambiguation’, in Editor (Ed.)^(Eds.): ‘Book Using Encyclopedic Knowledge for Named entity Disambiguation’ (2006, edn.), pp. 9-16Search in Google Scholar

[11] Nguyen, H., and Cao, T.H.: ‘Named entity disambiguation on an ontology enriched by Wikipedia’, in Editor (Ed.)^(Eds.): ‘Book Named entity disambiguation on an ontology enriched by Wikipedia’ (2008, edn.), pp. 247 – 25410.1109/RIVF.2008.4586363Search in Google Scholar

[12] Klang, M., and Nugues, P.: ‘Linking, Searching, and Visualizing Entities inWikipedia’, in Editor (Ed.)^(Eds.): ‘Book Linking, Searching, and Visualizing Entities inWikipedia’ (2018, edn.), pp. 3426 - 3432Search in Google Scholar

[13] Mehler, A., Baumartz, D., Henlein, A., and Hemati, W.: ‘fast-Sense: An EfficientWord Sense Disambiguation Classifier’, in Editor (Ed.)^(Eds.): ‘Book fastSense: An EfficientWord Sense Disambiguation Classifier’ (2018, edn.), pp.Search in Google Scholar

[14] Kazama, J.i., and Torisawa, K.: ‘Exploiting Wikipedia as External Knowledge for Named Entity Recognition’. Proc. Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, {Prague, Czech Republic2007 pp. PagesSearch in Google Scholar

[15] Moro, A., Raganato, A., and Navigli, R.: ‘Entity Linking meetsWord Sense Disambiguation: a Unified Approach’, Transactions of the Association for Computational Linguistics, 2014, 2, (1), pp. 231-24410.1162/tacl_a_00179Search in Google Scholar

[16] Aggarwal, N., and Buitelaar, P.: ‘Wikipedia-Based Distributional Semantics for Entity Relatedness’, in Editor (Ed.)^(Eds.): ‘Book Wikipedia-Based Distributional Semantics for Entity Relatedness’ (2014, edn.), pp.Search in Google Scholar

[17] Syed, Z., Finin, T., and Joshi, A.: ‘Wikipedia as an Ontology for Describing Documents’, in Editor (Ed.)^(Eds.): ‘Book Wikipedia as an Ontology for Describing Documents’ (2008, edn.), pp.Search in Google Scholar

[18] Schonhofen, P.: ‘Identifying document topics using the Wikipedia category network’, Journal of Web Intelligence and Agent Systems, 2009, 7, (2), pp. 195-20710.3233/WIA-2009-0162Search in Google Scholar

[19] Medelyan, O., Milne, D.N., Legg, C., and Witten, I.H.: ‘Mining meaning from Wikipedia’, International Journal of Human-Computer Studies, 2009, 67, (9), pp. 716-75810.1016/j.ijhcs.2009.05.004Search in Google Scholar

[20] McCrae, J.: ‘MappingWordNet Instances to Wikipedia’, in Editor (Ed.)^(Eds.): ‘Book MappingWordNet Instances to Wikipedia’ (2018, edn.), pp.Search in Google Scholar

[21] AbdulgabbarSaif, Omar, N., Zainodin, U.Z., and Aziz, M.J.A.: ‘Building Sense Tagged Corpus Using Wikipedia for Supervised Word Sense Disambiguation’, Procedia Computer Science, 2018, 123, pp. 403-41210.1016/j.procs.2018.01.062Search in Google Scholar

[22] Galárraga, L., Symeonidou, D., and Moissinac, J.-C.: ‘Rule Mining for Semantifying Wikilinks’. Proc. Linked Data on the Web (LDOW2015), Florence, Italy2015 pp. PagesSearch in Google Scholar

[23] Roberto Navigli, P.V.: ‘Structural semantic interconnections a knowledge-based approach to word sense disambiguation’, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2005, 27, (7), pp. 1075 - 108610.1109/TPAMI.2005.149Search in Google Scholar PubMed

[24] Lesk, M.: ‘Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone’, in Editor (Ed.)^(Eds.): ‘Book Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone’ (1986, edn.), pp. 24-2610.1145/318723.318728Search in Google Scholar

[25] Araujo, S., Houben, G.-J., and Schwabe, D.: ‘Linkator: Enriching Web Pages by Automatically Adding Dereferenceable Semantic Annotations’. Proc. International Conference on Web Engineering2010 pp. Pages10.1007/978-3-642-13911-6_24Search in Google Scholar

[26] Agrawal, R., Gollapudi, S., Kannan, A., and Kenthapadi, K.: ‘Identifying enrichment candidates in textbooks’, in Editor (Ed.)^(Eds.): ‘Book Identifying enrichment candidates in textbooks’ (2011, edn.), pp. 483-49210.1145/1963192.1963362Search in Google Scholar

[27] Klímek, J., Škoda, P., and Něcaský, M.: ‘Survey of tools for Linked Data consumption’, Semantic Web, 2018, pp. 1-57Search in Google Scholar

[28] Nentwig, M., Hartung, M., Ngomo, A.-C.N., and Rahm, E.: ‘A survey of current Link Discovery frameworks’, Semantic Web, 2017, 8, (3), pp. 419-43610.3233/SW-150210Search in Google Scholar

[29] Ngomo, A.-C.N., and Auer, S.: ‘LIMES - A Time-Efficient Approach for Large-Scale Link Discovery on the Web of Data’, in Editor (Ed.)^(Eds.): ‘Book LIMES - A Time-Efficient Approach for Large-Scale Link Discovery on the Web of Data’ (2011, edn.), pp.Search in Google Scholar

[30] Achichi, M., Bellahsene, Z., Ellefi, M.B., and Todorov, K.: ‘Linking and disambiguating entities across heterogeneous RDF graphs’, Journal of Web Semantics, 2019, 55, pp. 108-12110.1016/j.websem.2018.12.003Search in Google Scholar

[31] Volz, J., Bizer, C., Berlin, F.U., Gaedke, M., and Kobilarov, G.: ‘Silk – A Link Discovery Framework for the Web of Data’, in Editor (Ed.)^(Eds.): ‘Book Silk – A Link Discovery Framework for the Web of Data’ (2009, edn.), pp.Search in Google Scholar

[32] Scharffe, F., Liu, Y., and Zhou, C.: ‘RDF-AI: an Architecture for RDF Datasets Matching, Fusion and Interlink’, in Editor (Ed.)^(Eds.): ‘Book RDF-AI: an Architecture for RDF Datasets Matching, Fusion and Interlink’ (2009, edn.), pp.Search in Google Scholar

[33] Nikolov, A., Uren, V., Motta, E., and Roeck, A.d.: ‘Integration of semantically annotated data by the KnoFuss architecture’, in Editor (Ed.)^(Eds.): ‘Book Integration of semantically annotated data by the KnoFuss architecture’ (2008, edn.), pp.10.1007/978-3-540-87696-0_24Search in Google Scholar

[34] Hassanzadeh, O., and Consens, M.P.: ‘Linked Movie Database’, in Editor (Ed.)^(Eds.): ‘Book Linked Movie Database’ (2009, edn.), pp.Search in Google Scholar

[35] Raimond, Y., Sutton, C., and Sandler, M.: ‘Automatic Interlinking of Music Datasets on the Semantic Web’, in Editor (Ed.)^(Eds.): ‘Book Automatic Interlinking of Music Datasets on the Semantic Web’ (2008, edn.), pp.10.1109/MMUL.2009.29Search in Google Scholar

[36] Olfa Ben Said, A.W., Adel M. Alimi: ‘ Interlinking video programs with Linked Open Data ’. Proc. 15th International Conference on Intelligent Systems Design and Applications (ISDA)2015 pp. Pages10.1109/ISDA.2015.7489159Search in Google Scholar

[37] Hausenblas, M., Troncy, R., Raimond, Y., and Bürger, T.: ‘Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia Fragments’, in Editor (Ed.)^(Eds.): ‘Book Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia Fragments’ (2009, edn.), pp.Search in Google Scholar

[38] Rajabi, E., and Greller, W.: ‘Exposing Social Data as Linked Data in Education’, International Journal on Semantic Web and Information Systems 2019, 15, (2), pp. 92-10610.4018/IJSWIS.2019040105Search in Google Scholar

[39] Dezhao Song, Y.L., Jeff Heflin: ‘Linking Heterogeneous Data in the Semantic Web Using Scalable and Domain-Independent Candidate Selection’, IEEE Transactions on Knowledge and Data Engineering, 2016, (99)10.1109/TKDE.2016.2606399Search in Google Scholar

[40] Hausenblas, M., Halb, W., and Raimond, Y.: ‘Scripting User Contributed Interlinking’, in Editor (Ed.)^(Eds.): ‘Book Scripting User Contributed Interlinking’ (2008, edn.), pp.Search in Google Scholar

[41] James N. K. Liu, Y.-L.H., Edward H. Y. Lim, Xi-Zhao Wang: ‘A New Method for Knowledge and Information Management Domain Ontology Graph Model’, IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2013, 43, (1), pp. 115 - 12710.1109/TSMCA.2012.2196431Search in Google Scholar

[42] Sun, H., Fan, W., Shen, W., and Xiao, T.: ‘Ontology Fusion in High-Level-Architecture-Based Collaborative Engineering Environments’, IEEE Transactions on Systems, Man, and Cybernetics, 2013, 43, (1), pp. 2 - 1310.1109/TSMCA.2012.2190138Search in Google Scholar

[43] Zhao, L., and Ichise, R.: ‘Ontology Integration for Linked Data’, Journal on Data Semantics, 2014, 3, (4), pp. 237–25410.1007/s13740-014-0041-9Search in Google Scholar

[44] Matinfar, F., Nematbakhsh, M., and Lausen, G.: ‘Web Resource Sense Disambiguation in Web of Data’, Journal of Universal Computer Science, 2013, 19, (13), pp. 1871-1891Search in Google Scholar

[45] Bloehdorn, S., Hotho, A., and Staab, S.: ‘An Ontology-based framework for text mining’, GLDV Journal for computational linguistics and language technology, 2004, 20, (1), pp. 87-11210.21248/jlcl.20.2005.70Search in Google Scholar

[46] Rajpathak, D.G., and Singh, S.: ‘An Ontology-Based Text Mining Method to Develop D-Matrix From Unstructured Text’, IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2014, 44, (7), pp. 966 - 97710.1109/TSMC.2013.2281963Search in Google Scholar

[47] Lund, K., and Burgess, C.: ‘Producing high-dimensional semantic spaces from lexical co-occurrence’, Behavior Research Methods, Instruments, & Computers, 1996, 28, (2), pp. 203–20810.3758/BF03204766Search in Google Scholar

[48] Ponzetto, S.P., and Navigli, R.: ‘Knowledge-rich Word Sense Disambiguation rivaling supervised systems’, in Editor (Ed.)^(Eds.): ‘Book Knowledge-rich Word Sense Disambiguation rivaling supervised systems’ (2010, edn.), pp. 1522-1531Search in Google Scholar

[49] Fogarolli, A.: ‘Word Sense Disambiguation Based on Wikipedia Link Structure’. Proc. IEEE International Conference on Semantic Computing2009 pp. Pages10.1109/ICSC.2009.7Search in Google Scholar

[50] Mihalcea, R.: ‘Using Wikipedia for Automatic Word Sense Disambiguation’, in Editor (Ed.)^(Eds.): ‘Book Using Wikipedia for Automatic Word Sense Disambiguation’ (2007, edn.), pp.Search in Google Scholar

[51] Strube, M., and Ponzetto, S.P.: ‘WikiRelate! computing semantic relatedness using Wikipedia’, in Editor (Ed.)^(Eds.): ‘Book WikiRelate! computing semantic relatedness using Wikipedia’ (2006, edn.), pp. 1419-1424Search in Google Scholar

[52] Csomai, A., and Mihalcea, R.: ‘Linking documents to encyclopedic knowledge’, IEEE Intelligent Systems, 2008, 23, (5), pp. 34-4110.1109/MIS.2008.86Search in Google Scholar

[53] David Tomýs, Y.G., Francisco Agullý: ‘Entity linking in media content and user comments: Connecting data to wikipedia and other knowledge bases’. Proc. eChallenges e-2015 Conference2015 pp. Pages10.1109/eCHALLENGES.2015.7441053Search in Google Scholar

[54] Zesch, T., Müller, C., and Gurevych, I.: ‘Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary’, in Editor (Ed.)^(Eds.): ‘Book Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary’ (2008, edn.), pp. 1646–1652Search in Google Scholar

Received: 2019-10-11
Accepted: 2020-03-24
Published Online: 2020-11-04

© 2020 Farzam Matinfar, published by De Gruyter

This work is licensed under the Creative Commons Attribution 4.0 International License.

Downloaded on 3.10.2024 from https://www.degruyter.com/document/doi/10.1515/comp-2020-0102/html
Scroll to top button