Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

A semi-automated framework for semantically annotating web content

Published: 01 April 2018 Publication History

Abstract

Todays web is growing very fast and having a strong online presence is becoming critical for businesses. In contrast, websites in search results using traditional Search Engine Optimization (SEO) techniques became less effective in achieving the desired visibility. To overcome the limitations of traditional SEO, some recent trends are adopting the technologies of Semantic Web to annotate web content with Semantic Markup that can be understood by search engines. However, the balance between the accuracy of annotations and the automation level still under investigation. This research proposes a semi-automated framework that provides a high level of accurate annotations with minimum user interaction, based on Schema.org; a well-accepted ontology for ordinary things in life. The proposed framework aims to analyze the contents of web documents and extract the unique keywords and key phrases that best describe that content. Then, annotate those keywords and key phrases with the appropriate Schema.org vocabularies. This will reflect on the understanding level of the search engines to the web contents and accordingly a better visibility in the search results. SMAF is a semi-automated framework that provides accurate annotations with minimum user interaction.SMAF analyzes web pages, extracts unique keywords and annotates them using Schema.org vocabularies.Constructing accumulated SMAF owned database for keywords and key phrases.SMAF improves the visibility of web contents in search engines by making them understandable.

References

[1]
J. Brocker, G. Janvan Ahee, 2008.
[2]
M. Naeem, O. Bashi, Malm University, 2011.
[3]
L.F. Sikos, Introduction to the semantic web, in: Mastering Struct. Data Semant. Web, Apress, Berkeley, CA, 2015, pp. 111.
[4]
T. Berners-Lee, J. Hendler, O. Lassila, The semantic web, Sci. Am., 284 (2001) 34-43.
[5]
V.N. Gudivada, D. Rao, J. Paris, Understanding search-engine optimization, Comput. (Long. Beach. Calif), 48 (2015) 43-52.
[6]
T. Mavridis, A.L. Symeonidis, Identifying valid search engine ranking factors in a web 2.0 and web 3.0 context for building efficient SEO mechanisms, Eng. Appl. Artif. Intell., 41 (2015) 75-91.
[7]
N. Ambiah, D. Lukose, Enriching webpages with semantic information, in: Int Conf. Dublin Core Metadata Appl. 2012, pp. 111, http://dcpapers.dublincore.org/pubs/article/view/3663.
[8]
A. Aldaej, University of Surrey, 2015.
[9]
L. Yu, Springer Berlin Heidelberg, Berlin, Heidelberg, 2014.
[10]
P. Kogut, W. Holmes, AeroDAML: Applying information extraction to generate DAML annotations from web pages, in: First Int. Conf. Knowl. Capture KCAP 2001 Work. Knowl. Markup Semant. Annot. Vol. 21, 2001, p. 3.
[11]
J. Krutil, M. Kudelka, V. Snasel, Web page classification based on Schema.org collection, Comput. Asp. Soc. Networks, in: CASoN, 2012 Fourth Int. Conf. 2012, pp. 356360,
[12]
JSON-LD - JSON for Linking Data, 2013. http://json-ld.org/(accessed 18.05.16).
[13]
M. Antoniazzi, Mapping the TORCH Ontology to Schema.org, Oslo and Akershus University College of Applied Sciences, 2015.
[14]
R.V. Guha, D. Brickley, S. Macbeth, Schema.org, Commun. ACM, 59 (2016) 44-51.
[15]
P. Barker, L.M. Campbell, What is schema.org? Cetis Brief. Pap. 2014, pp. 113, http://publications.cetis.ac.uk/wp-content/uploads/2014/06/schemaBriefing.pdf.
[16]
R.V. Guha, D. Brickley, S. Macbeth, Schema.org: Evolution of structured data on the web, Commun. ACM, 59 (2016) 44-51.
[17]
P. Mika, On Schema.org and why it matters for the web, Int. Comput. IEEE, 19 (2015) 52-55.
[18]
C. Veres, E. Elseth, Schema.org for the semantic web with MaDaME, in: CEUR Workshop Proc. Vol. 1026, 2013, pp. 1115.
[19]
A. Khalili, S. Auer, WYSIWYM Authoring of Structured Content Based on Schema.org, in: Lect. Notes Comput. Sci. (Including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics) LNCS, vol. 8181, 2013, pp. 425-438.
[20]
A. Tort, A. Oliv, A computer-guided approach to website Schema.org design, in: Concept. Model. 33rd Int. Conf. ER 2014, Atlanta, GA, USA, Oct. 27-29, 2014. Proc., Springer International Publishing, Cham, 2014, pp. 28-42.
[21]
S. Homoceanu, F. Geilert, C. Pek, W.-T. Balke, Any suggestions? Active schema support for structuring web information, in: Database Syst. Adv. Appl. Vol. 8422, 2014, pp. 251-265.
[22]
I. Toma, C. Stanciu, A. Fensel, I. Stavrakantonakis, D. Fensel, Improving the online visibility of touristic service providers by using semantic annotations, in: Semant. Web ESWC 2014 Satell. Events ESWC 2014 Satell. Events, Anissaras, Crete, Greece, May 25-29, 2014, Revis. Sel. Pap., Springer International Publishing, Cham, 2014, pp. 259-262.
[23]
C. Manning, P. Raghavan, H. Schutze, Cambridge University Press, Cambridge, 2008.
[24]
V. Uren, P. Cimiano, J. Iria, S. Handschuh, M. Vargas-Vera, E. Motta, F. Ciravegna, Semantic annotation for knowledge management: Requirements and a survey of the state of the art, Web Semant., 4 (2006) 14-28.
[25]
L. Marujo, A. Gershman, J. Carbonell, R. Frederking, Frederking, Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization, Lr. 2012, 2011, pp. 399403, http://www.lrec-conf.org/proceedings/lrec2012/pdf/672_Paper.pdf.
[26]
L. Jean-Louis, A. Zouaq, M. Gagnon, F. Ensan, An assessment of online semantic annotators for the keyword extraction task, in: Lect. Notes Comput. Sci. (Including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), Springer International Publishing, Cham, 2014, pp. 548-560.
  1. A semi-automated framework for semantically annotating web content

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Future Generation Computer Systems
    Future Generation Computer Systems  Volume 81, Issue C
    April 2018
    580 pages

    Publisher

    Elsevier Science Publishers B. V.

    Netherlands

    Publication History

    Published: 01 April 2018

    Author Tags

    1. JSON-LD
    2. Schema.org
    3. Semantic Markup
    4. Semantic annotation

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 0
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 16 Nov 2024

    Other Metrics

    Citations

    View Options

    View options

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media