Few-Shot Learning for Chinese NLP Tasks

Liang Xu¹²,
Xiaojing Lu¹²,
Chenyang Yuan¹²,
Xuanwei Zhang¹²,
Hu Yuan¹²,
Huilin Xu¹²,
Guoao Wei¹²,
Xiang Pan¹²,
Junyi Li¹²,
Jianlin Su¹²,
Zhenyu Yang¹²,
Renfen Hu¹² &
…
Hai Hu¹²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13029))

Included in the following conference series:

CCF International Conference on Natural Language Processing and Chinese Computing

1714 Accesses
1 Citations

Abstract

In the paper, we report the results for the NLPCC2021 shared-task of Few-shot Learning for Chinese NLP. This shared task is proposed in the context of pre-trained language models, where models only have access to limited human-labeled data. The goal of the task is to compare different learning schemes. The task includes nine sub-tasks and three task forms: single sentence classification, sentence pair classification, and machine reading comprehension. In order to accommodate the properties of few-shot learning, we sampled the examples using various sampling methods, some with 32 examples in total for one dataset, while others with 4 to 16 examples per class. Ninety teams registered for the shared task, employing a wide range of learning schemes, including data augmentation, utilizing multiple templates rather than a single template, using unlabeled data for pre-training or semi-supervised training. The best model achieved 65.3 in the mean accuracy, compared with the human score of 83.9. This result is 8 points higher than our baseline model (using the PET scheme). We believe our few-shot learning tasks and results demonstrate the potential of the recently introduced few-shot learning methods and provide guidance and important empirical evidence for future research.

L. Xu, X. Lu, C. Yuan, X. Zhang, H. Yuan and H. Xu—Contributed equally.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Advances and Challenges in Few-Shot Learning for Natural Language Processing: A Pilot Study

Better Few-Shot Text Classification with Pre-trained Language Model

Active Few-Shot Learning with FASL

Notes

1.
It is a voice assistant developed by OPPO for mobile phones and IOT devices.
2.
https://www.cluebenchmarks.com/NLPCC.html.

References

Brown, T.B., et al.: Language models are few-shot learners (2020)
Google Scholar
Cui, Y., Che, W., Liu, T., Qin, B., Wang, S., Hu, G.: Revisiting pre-trained models for Chinese natural language processing. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, pp. 657–668. Association for Computational Linguistics, Online, November 2020. https://www.aclweb.org/anthology/2020.findings-emnlp.58
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Zhang, Y.W.E., Wang, Z.: Entailment method based on template selection for FewCLUE evaluation (2021)
Google Scholar
Gao, T., Fisch, A., Chen, D.: Making pre-trained language models better few-shot learners. In: Association for Computational Linguistics (ACL) (2021)
Google Scholar
Hu, H., Richardson, K., Liang, X., Lu, L., Kübler, S., Moss, L.: OCNLI: original Chinese natural language inference. In: Findings of Empirical Methods for Natural Language Processing (Findings of EMNLP) (2020)
Google Scholar
IFLYTEK CO. L: Iflytek: a multiple categories Chinese text classifier. competition official website (2019). http://challenge.xfyun.cn/2019/gamelist
Zeng, J., Jiang, S.W.Y., Li, M.: Enhanced few-shot learning with multiple-pattern-exploiting training. In: CCF International Conference on Natural Language Processing and Chinese Computing (2021)
Google Scholar
Lan, Z., Chen, M., Goodman, S., Gimpel, K., Sharma, P., Soricut, R.: ALBERT: a lite BERT for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942 (2019)
Liu, X., et al.: GPT understands, too. arXiv preprint arXiv:2103.10385 (2021)
Liu, Y., et al.: RoBERTa: a robustly optimized BERT pretraining approach. arXiv preprint arXiv:1907.11692 (2019)
Conversational-AI Center of OPPO XiaoBu: BUSTM: OPPO Xiaobu dialogue short text matching dataset (2021). https://github.com/xiaobu-coai/BUSTM
Schick, T., Schütze, H.: Exploiting cloze-questions for few-shot text classification and natural language inference. In: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, pp. 255–269. Association for Computational Linguistics, Online, April 2021. https://aclanthology.org/2021.eacl-main.20
Schick, T., Schütze, H.: It’s not just size that matters: small language models are also few-shot learners. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 2339–2352. Association for Computational Linguistics, Online, June 2021. https://doi.org/10.18653/v1/2021.naacl-main.185, https://aclanthology.org/2021.naacl-main.185
Tam, D., Menon, R.R., Bansal, M., Srivastava, S., Raffel, C.: Improving and simplifying pattern exploiting training (2021)
Google Scholar
Wang, S., Fang, H., Khabsa, M., Mao, H., Ma, H.: Entailment as few-shot learner (2021)
Google Scholar
Wei, J., et al.: NEZHA: neural contextualized representation for Chinese language understanding. arXiv preprint arXiv:1909.00204 (2019)
Xu, L., et al.: CLUE: a Chinese language understanding evaluation benchmark. In: Proceedings of the 28th International Conference on Computational Linguistics, pp. 4762–4772. International Committee on Computational Linguistics, Barcelona, Spain (Online), December 2020. https://doi.org/10.18653/v1/2020.coling-main.419, https://aclanthology.org/2020.coling-main.419
Zheng, C., Huang, M., Sun, A.: ChID: a Large-scale Chinese IDiom dataset for Cloze test. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 778–787 (2019)
Google Scholar
Xu, Z., et al.: When few-shot learning meets large-scale knowledge-enhanced pre-training: Alibaba at FewCLUE. In: CCF International Conference on Natural Language Processing and Chinese Computing (2021)
Google Scholar

Download references

Acknowledgements

Many thanks to NLPCC for offering us this opportunity to organize this task and people who took part in this task.

Author information

Authors and Affiliations

CLUE Team, Beijing, China
Liang Xu, Xiaojing Lu, Chenyang Yuan, Xuanwei Zhang, Hu Yuan, Huilin Xu, Guoao Wei, Xiang Pan, Junyi Li, Jianlin Su, Zhenyu Yang, Renfen Hu & Hai Hu

Authors

Liang Xu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaojing Lu
View author publications
You can also search for this author in PubMed Google Scholar
Chenyang Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Xuanwei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hu Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Huilin Xu
View author publications
You can also search for this author in PubMed Google Scholar
Guoao Wei
View author publications
You can also search for this author in PubMed Google Scholar
Xiang Pan
View author publications
You can also search for this author in PubMed Google Scholar
Junyi Li
View author publications
You can also search for this author in PubMed Google Scholar
Jianlin Su
View author publications
You can also search for this author in PubMed Google Scholar
Zhenyu Yang
View author publications
You can also search for this author in PubMed Google Scholar
Renfen Hu
View author publications
You can also search for this author in PubMed Google Scholar
Hai Hu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Michigan, Ann Arbor, MI, USA
Lu Wang
Peking University, Beijing, China
Yansong Feng
Soochow University, Suzhou, China
Yu Hong
Tianjin University, Tianjin, China
Ruifang He

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, L. et al. (2021). Few-Shot Learning for Chinese NLP Tasks. In: Wang, L., Feng, Y., Hong, Y., He, R. (eds) Natural Language Processing and Chinese Computing. NLPCC 2021. Lecture Notes in Computer Science(), vol 13029. Springer, Cham. https://doi.org/10.1007/978-3-030-88483-3_33

Download citation

DOI: https://doi.org/10.1007/978-3-030-88483-3_33
Published: 06 October 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-88482-6
Online ISBN: 978-3-030-88483-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)

Few-Shot Learning for Chinese NLP Tasks

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Advances and Challenges in Few-Shot Learning for Natural Language Processing: A Pilot Study

Better Few-Shot Text Classification with Pre-trained Language Model

Active Few-Shot Learning with FASL

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

Few-Shot Learning for Chinese NLP Tasks

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Advances and Challenges in Few-Shot Learning for Natural Language Processing: A Pilot Study

Better Few-Shot Text Classification with Pre-trained Language Model

Active Few-Shot Learning with FASL

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation