Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3371425.3371436acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaiipccConference Proceedingsconference-collections
research-article

ESN-NER: entity storage network using attention mechanism for chinese NER

Published: 19 December 2019 Publication History

Abstract

Chinese named entity recognition (NER) is more difficult than it in English because of the lack of nature delimiters. First, Chinese NER requires word segmentation, but word-based segmentation will generate errors due to the different granularity of the word segmentation tools. Second, most NER models heavily rely on local linguistic features, but the scope of influence provided by local linguistic features is limited, so sometimes the model will give different results to the same entity in different sentences. To address the above problems, we propose the Entity Storage Network Model called ESN Model for Chinese NER, which is a character-based model to avoid word segmentation errors. Specifically, we design an entity storage layer in this model to extract and store the entity information as a local linguistic feature, and design a position feature which is generated by four flags to enhance the learning of boundary. Then we incorporate the attention mechanism to extend the scope of the local linguistic features. The experimental results on two real-world datasets demonstrate that our model outperforms the state-of-the-art models in Chinese NER task.

References

[1]
Miwa M and Bansal M (2016). End-to-end relation extraction using lstms on sequences and tree structures. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, August 7--12, 2016, Berlin, Germany Volume 1: Long Papers.
[2]
Gupta N, Singh S and Roth D (2017). Entity linking via joint encoding of types, descriptions, and context. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp 2681--2690.
[3]
Fragkou P (2017). Applying named entity recognition and co-reference resolution for segmenting English texts. Progress in Artificial Intelligence 6(4), 325--346.
[4]
Liu L, Shang J, Ren X, Xu FF, Gui H, Peng J and Han J (2018). Empower sequence labeling with task-aware neural language model. In Thirty-Second AAAI Conference on Artificial Intelligence.
[5]
Chen W, Zhang Y and Isahara H (2006b). Chinese named entity recognition with conditional random fields. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp 118--121.
[6]
Zhang Y and Yang J (2018). Chinese ner using lattice lstm. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Melbourne, Australia, July 15--20, 2018, Volume 1: Long Papers, pp 1554--1564.
[7]
Isozaki H and Kazawa H (2002). Efficient support vector classifiers for named entity recognition. In Proceedings of the 19th international conference on Computational linguistics -Volume 1, Association for Computational Linguistics, pp 1--7.
[8]
Bikel DM, Miller S, Schwartz R and Weischedel R (1998). Nymble: a high-performance learning name-finder. arXiv preprint cmp-lg/9803003.
[9]
Lafferty J, McCallum A and Pereira FC (2001). Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the Eighteenth International Conference on Machine Learning, pp 282--289.
[10]
Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K and Kuksa P (2011). Natural language processing (almost) from scratch. Journal of machine learning research 12(Aug), 2493--2537.
[11]
Huang Z, Xu W and Yu K (2015). Bidirectional lstm-crf models for sequence tagging. arXiv preprint arXiv:150801991.
[12]
Hammerton J (2003). Named entity recognition with long short-term memory. In Proceedings of the seventh conference on Natural language learning at HLTNAACL 2003-Volume 4, Association for Computational Linguistics, pp 172--175.
[13]
Dong C, Zhang J, Zong C, Hattori M and Di H (2016). Character-based lstm-crf with radical-level features for Chinese named entity recognition. In Natural Language Understanding and Intelligent Applications, Springer, pp 239--250.
[14]
Luo G, Huang X, Lin CY and Nie Z (2015). Joint entity recognition and disambiguation. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp 879--888.
[15]
Rei M (2017). Semi-supervised multitask learning for sequence labeling. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, Canada, July 30 - August 4, Volume 1: Long Papers, pp 2121--2130.
[16]
Peters ME, Ammar W, Bhagavatula C and Power R (2017). Semi-supervised sequence tagging with bidirectional language models. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, Canada, July 30 - August 4, Volume 1: Long Papers, pp 1756--1765.
[17]
Mikolov T, Sutskever I, Chen K, Corrado GS and Dean J (2013). Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems, pp 3111--3119.
[18]
Graves A, Wayne G and Danihelka I (2014). Neural turing machines. Computer Science.
[19]
Weston J, Chopra S and Bordes A (2014). Memory networks. arXiv preprint arXiv:14103916.
[20]
Yin X, Zheng D, Lu Z and Liu R (2018). Neural entity reasoner for global consistency in ner. arXiv preprint arXiv:181000347.
[21]
Seo M, Kembhavi A, Farhadi A and Hajishirzi H (2016). Bidirectional attention flow for machine comprehension. arXiv preprint arXiv:161101603.
[22]
Rei M, Crichton GK and Pyysalo S (2016). Attending to characters in neural sequence labeling models. In COLING 2016, 26th International Conference on Computational Linguistics, Proceedings of the Conference: Technical Papers, December 11--16, 2016, Osaka, Japan, pp 309--318.
[23]
Ratinov L and Roth D (2009). Design challenges and misconceptions in named entity recognition. In Proceedings of the thirteenth conference on computational natural language learning, Association for Computational Linguistics, pp 147--155.
[24]
Levow GA (2006). The third international Chinese language processing bakeoff: Word segmentation and named entity recognition. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp 108--117.
[25]
Chen A, Peng F, Shan R and Sun G (2006a). Chinese named entity recognition with conditional probabilistic models. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp 173--176.
[26]
Zhang S, Qin Y, Wen J and Wang X (2006). Word segmentation and named entity recognition for sighan bakeoff3. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp 158--161.
[27]
Zhou J, Qu W and Zhang F (2013). Chinese named entity recognition via joint identification and categorization. Chinese journal of electronics, 22(2), 225--230.
[28]
Lu Y, Zhang Y and Ji D (2016). Multi-prototype Chinese character embedding. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), pp 855--859.
[29]
Cao P, Chen Y, Liu K, Zhao J and Liu S (2018). Adversarial transfer learning for Chinese named entity recognition with self-attention mechanism. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp 182--192.
[30]
Zhu Y, Wang G and Karlsson BF (2019). Can-Ner: Convolutional attention network for Chinese named entity recognition. arXiv preprint arXiv:190402141.

Cited By

View all
  • (2023)Named Entity Recognition Model Based on Feature FusionInformation10.3390/info1402013314:2(133)Online publication date: 17-Feb-2023

Index Terms

  1. ESN-NER: entity storage network using attention mechanism for chinese NER

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    AIIPCC '19: Proceedings of the International Conference on Artificial Intelligence, Information Processing and Cloud Computing
    December 2019
    464 pages
    ISBN:9781450376334
    DOI:10.1145/3371425
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    • ASciE: Association for Science and Engineering

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 December 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. attention mechanism
    2. chinese NER
    3. entity storage network
    4. linguistic feature

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    AIIPCC '19
    Sponsor:
    • ASciE

    Acceptance Rates

    AIIPCC '19 Paper Acceptance Rate 78 of 211 submissions, 37%;
    Overall Acceptance Rate 78 of 211 submissions, 37%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 18 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Named Entity Recognition Model Based on Feature FusionInformation10.3390/info1402013314:2(133)Online publication date: 17-Feb-2023

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media