research-article

ESN-NER: entity storage network using attention mechanism for chinese NER

Authors:

Long LiAuthors Info & Claims

AIIPCC '19: Proceedings of the International Conference on Artificial Intelligence, Information Processing and Cloud Computing

Article No.: 41, Pages 1 - 8

https://doi.org/10.1145/3371425.3371436

Published: 19 December 2019 Publication History

Abstract

Chinese named entity recognition (NER) is more difficult than it in English because of the lack of nature delimiters. First, Chinese NER requires word segmentation, but word-based segmentation will generate errors due to the different granularity of the word segmentation tools. Second, most NER models heavily rely on local linguistic features, but the scope of influence provided by local linguistic features is limited, so sometimes the model will give different results to the same entity in different sentences. To address the above problems, we propose the Entity Storage Network Model called ESN Model for Chinese NER, which is a character-based model to avoid word segmentation errors. Specifically, we design an entity storage layer in this model to extract and store the entity information as a local linguistic feature, and design a position feature which is generated by four flags to enhance the learning of boundary. Then we incorporate the attention mechanism to extend the scope of the local linguistic features. The experimental results on two real-world datasets demonstrate that our model outperforms the state-of-the-art models in Chinese NER task.

References

[1]

Miwa M and Bansal M (2016). End-to-end relation extraction using lstms on sequences and tree structures. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, August 7--12, 2016, Berlin, Germany Volume 1: Long Papers.

[2]

Gupta N, Singh S and Roth D (2017). Entity linking via joint encoding of types, descriptions, and context. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp 2681--2690.

[3]

Fragkou P (2017). Applying named entity recognition and co-reference resolution for segmenting English texts. Progress in Artificial Intelligence 6(4), 325--346.

[4]

Liu L, Shang J, Ren X, Xu FF, Gui H, Peng J and Han J (2018). Empower sequence labeling with task-aware neural language model. In Thirty-Second AAAI Conference on Artificial Intelligence.

[5]

Chen W, Zhang Y and Isahara H (2006b). Chinese named entity recognition with conditional random fields. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp 118--121.

[6]

Zhang Y and Yang J (2018). Chinese ner using lattice lstm. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Melbourne, Australia, July 15--20, 2018, Volume 1: Long Papers, pp 1554--1564.

[7]

Isozaki H and Kazawa H (2002). Efficient support vector classifiers for named entity recognition. In Proceedings of the 19th international conference on Computational linguistics -Volume 1, Association for Computational Linguistics, pp 1--7.

Digital Library

[8]

Bikel DM, Miller S, Schwartz R and Weischedel R (1998). Nymble: a high-performance learning name-finder. arXiv preprint cmp-lg/9803003.

[9]

Lafferty J, McCallum A and Pereira FC (2001). Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the Eighteenth International Conference on Machine Learning, pp 282--289.

Digital Library

[10]

Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K and Kuksa P (2011). Natural language processing (almost) from scratch. Journal of machine learning research 12(Aug), 2493--2537.

Digital Library

[11]

Huang Z, Xu W and Yu K (2015). Bidirectional lstm-crf models for sequence tagging. arXiv preprint arXiv:150801991.

[12]

Hammerton J (2003). Named entity recognition with long short-term memory. In Proceedings of the seventh conference on Natural language learning at HLTNAACL 2003-Volume 4, Association for Computational Linguistics, pp 172--175.

Digital Library

[13]

Dong C, Zhang J, Zong C, Hattori M and Di H (2016). Character-based lstm-crf with radical-level features for Chinese named entity recognition. In Natural Language Understanding and Intelligent Applications, Springer, pp 239--250.

[14]

Luo G, Huang X, Lin CY and Nie Z (2015). Joint entity recognition and disambiguation. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp 879--888.

[15]

Rei M (2017). Semi-supervised multitask learning for sequence labeling. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, Canada, July 30 - August 4, Volume 1: Long Papers, pp 2121--2130.

[16]

Peters ME, Ammar W, Bhagavatula C and Power R (2017). Semi-supervised sequence tagging with bidirectional language models. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, Canada, July 30 - August 4, Volume 1: Long Papers, pp 1756--1765.

[17]

Mikolov T, Sutskever I, Chen K, Corrado GS and Dean J (2013). Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems, pp 3111--3119.

[18]

Graves A, Wayne G and Danihelka I (2014). Neural turing machines. Computer Science.

[19]

Weston J, Chopra S and Bordes A (2014). Memory networks. arXiv preprint arXiv:14103916.

[20]

Yin X, Zheng D, Lu Z and Liu R (2018). Neural entity reasoner for global consistency in ner. arXiv preprint arXiv:181000347.

[21]

Seo M, Kembhavi A, Farhadi A and Hajishirzi H (2016). Bidirectional attention flow for machine comprehension. arXiv preprint arXiv:161101603.

[22]

Rei M, Crichton GK and Pyysalo S (2016). Attending to characters in neural sequence labeling models. In COLING 2016, 26th International Conference on Computational Linguistics, Proceedings of the Conference: Technical Papers, December 11--16, 2016, Osaka, Japan, pp 309--318.

[23]

Ratinov L and Roth D (2009). Design challenges and misconceptions in named entity recognition. In Proceedings of the thirteenth conference on computational natural language learning, Association for Computational Linguistics, pp 147--155.

[24]

Levow GA (2006). The third international Chinese language processing bakeoff: Word segmentation and named entity recognition. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp 108--117.

[25]

Chen A, Peng F, Shan R and Sun G (2006a). Chinese named entity recognition with conditional probabilistic models. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp 173--176.

[26]

Zhang S, Qin Y, Wen J and Wang X (2006). Word segmentation and named entity recognition for sighan bakeoff3. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp 158--161.

[27]

Zhou J, Qu W and Zhang F (2013). Chinese named entity recognition via joint identification and categorization. Chinese journal of electronics, 22(2), 225--230.

[28]

Lu Y, Zhang Y and Ji D (2016). Multi-prototype Chinese character embedding. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), pp 855--859.

[29]

Cao P, Chen Y, Liu K, Zhao J and Liu S (2018). Adversarial transfer learning for Chinese named entity recognition with self-attention mechanism. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp 182--192.

[30]

Zhu Y, Wang G and Karlsson BF (2019). Can-Ner: Convolutional attention network for Chinese named entity recognition. arXiv preprint arXiv:190402141.

Cited By

Sun ZLi X(2023)Named Entity Recognition Model Based on Feature FusionInformation10.3390/info1402013314:2(133)Online publication date: 17-Feb-2023
https://doi.org/10.3390/info14020133

Index Terms

ESN-NER: entity storage network using attention mechanism for chinese NER
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Information extraction

Recommendations

Multi-task learning with helpful word selection for lexicon-enhanced Chinese NER
Abstract
Named entity recognition (NER) is a common task in the field of natural language processing, but it remains more challenging in Chinese due to the lack of natural delimiters. Recently, lots of works incorporate external lexicon into character-...
SDTCNs: A Symmetric Double Temporal Convolutional Network for Chinese NER
Wireless Algorithms, Systems, and Applications
Abstract
Chinese NER is a basic task of Chinese natural language processing. Most current models for Chinese NER can be roughly divided into two categories: character-based models and word-based models. Character-based models cannot effectively utilize the ...
MGCN: A Novel Multi-Graph Collaborative Network for Chinese NER
Natural Language Processing and Chinese Computing
Abstract
Named Entity Recognition (NER), one of the most important directions in Natural Language Processing (NLP), is an essential pre-processing step in many downstream NLP tasks. In recent years, most of the existing methods solve Chinese NER tasks by ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

AIIPCC '19: Proceedings of the International Conference on Artificial Intelligence, Information Processing and Cloud Computing

December 2019

464 pages

ISBN:9781450376334

DOI:10.1145/3371425

Conference Chairs:
João Manuel R. S. Tavares,
Zeshui Xu

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

ASciE: Association for Science and Engineering

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 December 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

the Guangxi Universities Young and Middle-aged Teacher Basic Ability Enhancement Project
Guangxi Innovation-Driven Development Project
National Natural Science Foundation of China
Natural Science Foundation of Guangxi Province

Conference

AIIPCC '19

Sponsor:

ASciE

AIIPCC '19: 2019 International Conference on Artificial Intelligence, Information Processing and Cloud Computing

December 19 - 21, 2019

Sanya, China

Acceptance Rates

AIIPCC '19 Paper Acceptance Rate 78 of 211 submissions, 37%;

Overall Acceptance Rate 78 of 211 submissions, 37%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
126
Total Downloads

Downloads (Last 12 months)4
Downloads (Last 6 weeks)0

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Sun ZLi X(2023)Named Entity Recognition Model Based on Feature FusionInformation10.3390/info1402013314:2(133)Online publication date: 17-Feb-2023
https://doi.org/10.3390/info14020133

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten