Htab2RDF: Mapping HTML Tables to RDF Triples

Authors

  • Djelloul Bouchiha EEDIS Laboratory, Djillali Liabes University of Sidi Bel Abbes
  • Mimoun Malki EEDIS Laboratory, Djillali Liabes University of Sidi Bel Abbes
  • Abdullah Alghamdi College of Computer and Information Sciences, KSU, Riyadh
  • Khalid Alnafjan College of Computer and Information Sciences, KSU, Riyadh

Keywords:

HTML tables, RDF, relational databases, Linked Data, domain ontology, WordNet

Abstract

The Web has become a tremendously huge data source hidden under linked documents. A significant number of Web documents include HTML tables generated dynamically from relational databases. Often, there is no direct public access to the databases themselves. On the other hand, RDF (Resource Description Framework) gives an efficient mechanism to represent directly data on the Web based on a Web-scalable architecture for identification and interpretation of terms. This leads to the concept of Linked Data on the Web. To allow direct access to data on the Web as Linked Data, we propose in this paper an approach to transform HTML tables into RDF triples. It consists of three main phases: refining, pre-treatment and mapping. The whole process is assisted by a domain ontology and the WordNet lexical database. A tool called Htab2RDF has been implemented. Experiments have been carried out to evaluate and show efficiency of the proposed approach.

Downloads

Download data is not yet available.

Downloads

Published

2018-02-09

How to Cite

Bouchiha, D., Malki, M., Alghamdi, A., & Alnafjan, K. (2018). Htab2RDF: Mapping HTML Tables to RDF Triples. Computing and Informatics, 36(6), 1467–1491. Retrieved from https://www.cai.sk/ojs/index.php/cai/article/view/2017_6_1467

Most read articles by the same author(s)