Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1871437.1871709acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
poster

Relational feature engineering of natural language processing

Published: 26 October 2010 Publication History

Abstract

We present a new framework for feature engineering of natural language processing that is based on a relational data model of text. It includes fast and flexible methods for implementing and extracting new features and thereby reduces the effort of creating an NLP system for a particular task.
In an instantiation and evaluation of the framework for the problem of coreference resolution in multiple languages, we were able to obtain competitive results in a short implementation period. This demonstrates the potential power of our framework for feature engineering.

References

[1]
A. Bagga and B. Baldwin. Algorithms for scoring coreference chains. In In The First International Conference on Language Resources and Evaluation Workshop on Linguistics Coreference, pages 563--566, 1998.
[2]
V. Bogorny, A. T. Palma, P. Engel, and L. O. Alvares. Weka-gdpm: Integrating classical data mining toolkit to geographic information systems. In WAAMD, pages 9--16. SBC, 2006.
[3]
T. Connolly and C. E. Begg. Database Systems: A Practical Approach to Design, Implementation and Management 2nd Ed. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA, 1998.
[4]
R. G. Crawford and I. A. Macleod. A relational approach to modular information retrieval systems design. In Proceedings of the American Society for Information Systems Annual Meeting, volume 15, 1978.
[5]
D. Grossman. Using the relational model and part-of-speech tagging to implement text relevance. In ACM CIKM, 1992.
[6]
D. Grossman, O. Frieder, D. O. Holmes, and D. C. Roberts. Integrating structured data and text: A relational approach. Journal of the American Society of Information Science, 48, 1997.
[7]
M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten. The WEKA Data Mining Software: An Update, 2009.
[8]
H. Kobdani and H. Schütze. Sucre: A modular system for coreference resolution. In Proceedings of the 5th International Workshop on Semantic Evaluation, pages 92--95, Uppsala, Sweden, July 2010. ACL.
[9]
H. Liu and H. Motoda. Feature Selection for Knowledge Discovery and Data Mining. Kluwer Academic Publishers, Norwell, MA, USA, 1998.
[10]
X. Luo. On coreference resolution performance metrics. In HLT '05: Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, pages 25--32, Morristown, NJ, USA, 2005. ACL.
[11]
C. A. Lynch and M. Stonebraker. Extended user-defined indexing with application to textual databases. In Fourteenth International Conference on Very Large Data Bases, August 29 - September 1, 1988, Los Angeles, California, USA, Proceedings, pages 306--317, 1988.
[12]
T. M. Mitchell. Machine Learning. McGraw-Hill, New York, 1997.
[13]
M. Vilain, J. Burger, J. Aberdeen, D. Connolly, and L. Hirschman. A model-theoretic coreference scoring scheme. In MUC6 '95: Proceedings of the 6th conference on Message understanding, pages 45--52, Morristown, NJ, USA, 1995. ACL.

Cited By

View all
  • (2020)Approximate Decision Tree Induction over Approximately Engineered Data FeaturesRough Sets10.1007/978-3-030-52705-1_28(376-384)Online publication date: 7-Jul-2020
  • (2015)EET: Efficient event tracking over emergency-oriented web data2015 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN.2015.7280798(1-8)Online publication date: Jul-2015
  • (2014)FEATURE SELECTION BASED ON COMPACTNESS AND SEPARABILITYComputational Intelligence10.1111/coin.1201030:3(636-656)Online publication date: 1-Aug-2014
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '10: Proceedings of the 19th ACM international conference on Information and knowledge management
October 2010
2036 pages
ISBN:9781450300995
DOI:10.1145/1871437
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 October 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. coreference resolution
  2. feature engineering
  3. natural language processing
  4. relational data model

Qualifiers

  • Poster

Conference

CIKM '10

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)1
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2020)Approximate Decision Tree Induction over Approximately Engineered Data FeaturesRough Sets10.1007/978-3-030-52705-1_28(376-384)Online publication date: 7-Jul-2020
  • (2015)EET: Efficient event tracking over emergency-oriented web data2015 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN.2015.7280798(1-8)Online publication date: Jul-2015
  • (2014)FEATURE SELECTION BASED ON COMPACTNESS AND SEPARABILITYComputational Intelligence10.1111/coin.1201030:3(636-656)Online publication date: 1-Aug-2014
  • (2012)RDBMS Model for Scientific Articles AnalyticsIntelligent Tools for Building a Scientific Information Platform10.1007/978-3-642-24809-2_4(49-60)Online publication date: 24-Jan-2012
  • (2011)Supervised coreference resolution with SUCREProceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task10.5555/2132936.2132947(71-75)Online publication date: 23-Jun-2011
  • (2011)Self organizing maps in NLPProceedings of the 8th international conference on Advances in self-organizing maps10.5555/2026666.2026694(228-237)Online publication date: 13-Jun-2011
  • (2011)Bootstrapping coreference resolution using word associationsProceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 110.5555/2002472.2002572(783-792)Online publication date: 19-Jun-2011
  • (2011)Self Organizing Maps in NLP: Exploration of Coreference Feature SpaceAdvances in Self-Organizing Maps10.1007/978-3-642-21566-7_23(228-237)Online publication date: 2011

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media