Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2740908.2742005acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Towards a Complete Event Type Taxonomy

Published: 18 May 2015 Publication History

Abstract

We present initial results of our effort to build an extensive and complete taxonomy of events described in news articles. By crawling Wikipedia's current events portal we identified nine top-level event types. Using articles referenced by the portal we built a event type classification model for news articles using lexical and semantic features and present a small-scale manual evaluation of its results. Results show that our model can accurately distinguish between event types but its coverage could still be significantly improved.

References

[1]
X. Carreras, L. Padró, L. Zhang, A. Rettinger, Z. Li, E. García-Cuesta, v. Agić, B. Bekavec, B. Fortuna, and T.vStajner. Xlike project language analysis services. In Proceedings of the Demonstrations Session at EACL 2014, pages 9--12, Gothenburg, Sweden, April 2014. Association for Computational Linguistics.
[2]
N. Chambers. Event schema induction with a probabilistic entity-driven model. Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 1797--1807, 2013.
[3]
N. Chambers and D. Jurafsky. Template-Based Information Extraction without the Templates. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pages 976--986, 2011.
[4]
J. C. K. Cheung, H. Poon, and L. Vanderwende. Probabilistic Frame Induction. Proceedings of 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 837--846, 2013.
[5]
N. Cristianini and J. Shawe-Taylor. An Introduction to Support Vector Machines: And Other Kernel-based Learning Methods. Cambridge University Press, New York, NY, USA, 2000.
[6]
P. Exner and P. Nugues. Using semantic role labeling to extract events from wikipedia. In Proceedings of the Workshop on Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE 2011). Workshop in conjunction with the 10th International Semantic Web Conference, pages 23--24, 2011.
[7]
A. Ko\vsmerlj, J. Belyaeva, G. Leban, B. Fortuna, and M. Grobelnik. Crowdsourcing event extraction. In NewsKDD: Data Science for News Publishing workshop. Workshop in conjunction with KDD2014 the 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2014.
[8]
E. Kuzey, J. Vreeken, and G. Weikum. A fresh look on knowledge bases: Distilling named events from news. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, CIKM '14, pages 1689--1698, New York, NY, USA, 2014. ACM.
[9]
E. Kuzey and G. Weikum. Extraction of temporal facts and events from wikipedia. In Proceedings of the 2Nd Temporal Web Analytics Workshop, TempWeb '12, pages 25--32, New York, NY, USA, 2012. ACM.
[10]
E. Kuzey and G. Weikum. Evin: Building a knowledge base of events. In Proceedings of the Companion Publication of the 23rd International Conference on World Wide Web Companion, WWW Companion '14, pages 103--106, Republic and Canton of Geneva, Switzerland, 2014. International World Wide Web Conferences Steering Committee.
[11]
G. Leban, B. Fortuna, J. Brank, and M. Grobelnik. Event registry: Learning about world events from news. In Proceedings of the Companion Publication of the 23rd International Conference on World Wide Web Companion, WWW Companion '14, pages 107--110. International World Wide Web Conferences Steering Committee, 2014.
[12]
C. D. Manning and H. Schütze. Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, MA, USA, 1999.
[13]
R. Shaw, R. Troncy, and L. Hardman. Lode: Linking open descriptions of events. In A. Gómez-Pérez, Y. Yu, and Y. Ding, editors, The Semantic Web, Lecture Notes in Computer Science, pages 153--167. Springer Berlin Heidelberg, 2009.
[14]
M. Trampuš and B. Novak. The internals of an aggregated web news feed. In Proceedings of 15th Multiconference on Information Society 2012 (IS-2012), Ljubljana, Slovenia, 2012.

Cited By

View all
  • (2020)Multilabel graph-based classification for missing labelsInternational Journal on Digital Libraries10.1007/s00799-020-00295-3Online publication date: 12-Oct-2020
  • (2020)Feature selection for classifying multi-labeled past eventsInternational Journal on Digital Libraries10.1007/s00799-020-00293-5Online publication date: 8-Sep-2020
  • (2018)System for Category-driven Retrieval of Historical EventsProceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries10.1145/3197026.3203888(413-414)Online publication date: 23-May-2018
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '15 Companion: Proceedings of the 24th International Conference on World Wide Web
May 2015
1602 pages
ISBN:9781450334730
DOI:10.1145/2740908

Sponsors

  • IW3C2: International World Wide Web Conference Committee

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 May 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. event extraction
  2. event type
  3. natural language processing
  4. news classification
  5. wikipedia

Qualifiers

  • Research-article

Funding Sources

  • European Union

Conference

WWW '15
Sponsor:
  • IW3C2

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)1
Reflects downloads up to 19 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2020)Multilabel graph-based classification for missing labelsInternational Journal on Digital Libraries10.1007/s00799-020-00295-3Online publication date: 12-Oct-2020
  • (2020)Feature selection for classifying multi-labeled past eventsInternational Journal on Digital Libraries10.1007/s00799-020-00293-5Online publication date: 8-Sep-2020
  • (2018)System for Category-driven Retrieval of Historical EventsProceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries10.1145/3197026.3203888(413-414)Online publication date: 23-May-2018
  • (2018)Classifying Short Descriptions of Past EventsAdvances in Information Retrieval10.1007/978-3-319-76941-7_69(729-736)Online publication date: 1-Mar-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media