Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2611040.2611052acmotherconferencesArticle/Chapter ViewAbstractPublication PageswimsConference Proceedingsconference-collections
research-article

Linked Enterprise Data for Fine Grained Named Entity Linking and Web Intelligence

Published: 02 June 2014 Publication History

Abstract

To identify trends and assign metadata elements such as location and sentiment to the correct entities, Web intelligence applications require methods for linking named entities and revealing relations between organizations, persons and products. For this purpose we introduce Recognyze, a named entity linking component that uses background knowledge obtained from linked data repositories. This paper outlines the underlying methods, provides insights into the migration of proprietary knowledge sources to linked enterprise data, and discusses the lessons learned from adapting linked data for named entity linking. A large dataset obtained from Orell Füssli, the largest Swiss business information provider, serves as the main showcase. This dataset includes more than nine million triples on companies, their contact information, management, products and brands. We identify major challenges towards applying this data for named entity linking and conduct a comprehensive evaluation based on several news corpora to illustrate how Recognyze helps address them, and how it improves the performance of named entity linking components drawing upon linked data rather than machine learning techniques.

References

[1]
E. Amitay, N. Har'El, R. Sivan, and A. Soffer. Web-a-where: geotagging web content. In SIGIR '04: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, pages 273--280, New York, NY, USA, 2004. ACM.
[2]
C. Bizer, J. Lehmann, G. Kobilarov, S. Auer, C. Becker, R. Cyganiak, and S. Hellmann. DBpedia - a crystallization point for the web of data. Journal of Web Semantics: Science, Services and Agents on the World Wide Web, 7(3):154--165, 2009.
[3]
S. Chaudhuri, U. Dayal, and V. Narasayya. An overview of business intelligence technology. Communications of the ACM, 54(8):88--98, Aug. 2011.
[4]
H. Chen. Business and market intelligence 2.0. IEEE Intelligent Systems, 25(1):68--83, 2010.
[5]
A. Das and B. Gambäck. Sentimantics: conceptual spaces for lexical sentiment polarity representation with contextuality. In Proceedings of the 3rd Workshop in Computational Approaches to Subjectivity and Sentiment Analysis, WASSA '12, page 38--46, Stroudsburg, PA, USA, 2012. Association for Computational Linguistics.
[6]
N. Fernández, J. Arias Fisteus, L. Sánchez, and G. López. IdentityRank: named entity disambiguation in the news domain. Expert Systems with Applications, 39(10):9207--9221, 2012.
[7]
A. Gangemi. A comparison of knowledge extraction tools for the semantic web. In P. Cimiano, O. Corcho, V. Presutti, L. Hollink, and S. Rudolph, editors, The Semantic Web: Semantics and Big Data, number 7882 in Lecture Notes in Computer Science, pages 351--366. Springer Berlin Heidelberg, Jan. 2013.
[8]
B. Hachey, W. Radford, J. Nothman, M. Honnibal, and J. R. Curran. Evaluating entity linking with wikipedia. Artificial Intelligence, 194:130--150, 2013.
[9]
X. Han and J. Zhao. Named entity disambiguation by leveraging wikipedia semantic knowledge. In Proceedings of the 18th ACM conference on Information and knowledge management, CIKM '09, page 215--224, New York, NY, USA, 2009. ACM.
[10]
J. Hoffart, M. A. Yosef, I. Bordino, H. Fürstenau, M. Pinkal, M. Spaniol, B. Taneva, S. Thater, and G. Weikum. Robust disambiguation of named entities in text. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '11, page 782--792, Stroudsburg, PA, USA, 2011. Association for Computational Linguistics.
[11]
J. J. Jung. Online named entity recognition method for microtexts in social networking services: A case study of twitter. Expert Systems with Applications, 39(9):8066--8070, 2012.
[12]
S. S. Kataria, K. S. Kumar, R. R. Rastogi, P. Sen, and S. H. Sengamedu. Entity disambiguation with hierarchical topic models. In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD '11, page 1037--1045, New York, NY, USA, 2011. ACM.
[13]
H. P. Luhn. A business intelligence system. IBM Journal of Research and Development, 2(4):314--319, 1958.
[14]
S. Negash and P. Gray. Business intelligence. In F. Burstein, C. Holsapple, S. Negash, and P. Gray, editors, Handbook on Decision Support Systems 2, International Handbooks Information System, pages 175--193. Springer Berlin Heidelberg, 2008.
[15]
J. Nothman, N. Ringland, W. Radford, T. Murphy, and J. R. Curran. Learning multilingual named entity recognition from wikipedia. Artificial Intelligence, 194:151--175, 2013.
[16]
A. Pilz and G. Paaβ. From names to entities using thematic context distance. In Proceedings of the 20th ACM international conference on Information and knowledge management, CIKM '11, page 857--866, New York, NY, USA, 2011. ACM.
[17]
E. F. Tjong Kim Sang. Introduction to the CoNLL-2002 shared task: language-independent named entity recognition. In proceedings of the 6th conference on Natural language learning - Volume 20, COLING-02, page 1--4, Stroudsburg, PA, USA, 2002. Association for Computational Linguistics.
[18]
E. F. Tjong Kim Sang and F. De Meulder. Introduction to the CoNLL-2003 shared task: language-independent named entity recognition. In Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4, CONLL '03, page 142--147, Stroudsburg, PA, USA, 2003. Association for Computational Linguistics.
[19]
D. Urbansky, J. A. Thom, D. Schuster, and A. Schill. Training a named entity recognizer on the web. In Proceedings of the 12th international conference on Web information system engineering, WISE'11, page 87--100, Berlin, Heidelberg, 2011. Springer-Verlag.
[20]
C. Wang, K. Chakrabarti, T. Cheng, and S. Chaudhuri. Targeted disambiguation of ad-hoc, homogeneous sets of named entities. In Proceedings of the 21st international conference on World Wide Web, WWW '12, page 719--728, New York, NY, USA, 2012. ACM.
[21]
H. J. Watson and B. H. Wixom. The current state of business intelligence. Computer, 40(9):96--99, 2007.
[22]
A. Weichselbraun. A utility centered approach for evaluating and optimizing geo-tagging. In First International Conference on Knowledge Discovery and Information Retrieval (KDIR 2009), pages 134--139, Madeira, Portugal, October 2009.
[23]
A. Weichselbraun, S. Gindl, and A. Scharl. A context-dependent supervised learning approach to sentiment detection in large textual databases. Journal of Information and Data Management, 1(3):329--342, 2010.
[24]
A. Weichselbraun, S. Gindl, and A. Scharl. Extracting and grounding context-aware sentiment lexicons. IEEE Intelligent Systems, 28(2):39--46, 2013.
[25]
F. Wu and D. S. Weld. Open information extraction using wikipedia. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL '10, page 118--127, Stroudsburg, PA, USA, 2010. Association for Computational Linguistics.

Cited By

View all
  • (2024)Information Extraction to Identify Novel Technologies and Trends in Renewable EnergyWorld Conference of AI-Powered Innovation and Inventive Design10.1007/978-3-031-75923-9_22(330-345)Online publication date: 29-Oct-2024
  • (2017)JRC-NamesSemantic Web10.3233/SW-1602288:2(283-295)Online publication date: 1-Jan-2017
  • (2016)Exploring a framework for identity and attribute linking across heterogeneous data systemsProceedings of the 2nd International Workshop on BIG Data Software Engineering10.1145/2896825.2896833(19-25)Online publication date: 14-May-2016
  • Show More Cited By

Index Terms

  1. Linked Enterprise Data for Fine Grained Named Entity Linking and Web Intelligence

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WIMS '14: Proceedings of the 4th International Conference on Web Intelligence, Mining and Semantics (WIMS14)
    June 2014
    506 pages
    ISBN:9781450325387
    DOI:10.1145/2611040
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    In-Cooperation

    • Aristotle University of Thessaloniki

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 02 June 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Web intelligence
    2. business news
    3. linked enterprise data
    4. linked open data
    5. named entity linking

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    WIMS '14

    Acceptance Rates

    WIMS '14 Paper Acceptance Rate 41 of 90 submissions, 46%;
    Overall Acceptance Rate 140 of 278 submissions, 50%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 19 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Information Extraction to Identify Novel Technologies and Trends in Renewable EnergyWorld Conference of AI-Powered Innovation and Inventive Design10.1007/978-3-031-75923-9_22(330-345)Online publication date: 29-Oct-2024
    • (2017)JRC-NamesSemantic Web10.3233/SW-1602288:2(283-295)Online publication date: 1-Jan-2017
    • (2016)Exploring a framework for identity and attribute linking across heterogeneous data systemsProceedings of the 2nd International Workshop on BIG Data Software Engineering10.1145/2896825.2896833(19-25)Online publication date: 14-May-2016
    • (2016)Scalable Knowledge Extraction and Visualization for Web IntelligenceProceedings of the 2016 49th Hawaii International Conference on System Sciences (HICSS)10.1109/HICSS.2016.467(3749-3757)Online publication date: 5-Jan-2016
    • (2016)Analyzing the public discourse on works of fiction - Detection and visualization of emotion in online coverage about HBO's Game of ThronesInformation Processing and Management: an International Journal10.1016/j.ipm.2015.02.00352:1(129-138)Online publication date: 1-Jan-2016
    • (2014)Visualizing Contextual Information in Aggregated Web Content RepositoriesProceedings of the 2014 9th Latin American Web Congress10.1109/LAWeb.2014.18(114-118)Online publication date: 22-Oct-2014

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media