Computer Science > Computation and Language

arXiv:2210.15147 (cs)

[Submitted on 27 Oct 2022]

Title:A Curriculum Learning Approach for Multi-domain Text Classification Using Keyword weight Ranking

Authors:Zilin Yuan, Yinghui Li, Yangning Li, Rui Xie, Wei Wu, Hai-Tao Zheng

View PDF

Abstract:Text classification is a very classic NLP task, but it has two prominent shortcomings: On the one hand, text classification is deeply domain-dependent. That is, a classifier trained on the corpus of one domain may not perform so well in another domain. On the other hand, text classification models require a lot of annotated data for training. However, for some domains, there may not exist enough annotated data. Therefore, it is valuable to investigate how to efficiently utilize text data from different domains to improve the performance of models in various domains. Some multi-domain text classification models are trained by adversarial training to extract shared features among all domains and the specific features of each domain. We noted that the distinctness of the domain-specific features is different, so in this paper, we propose to use a curriculum learning strategy based on keyword weight ranking to improve the performance of multi-domain text classification models. The experimental results on the Amazon review and FDU-MTL datasets show that our curriculum learning strategy effectively improves the performance of multi-domain text classification models based on adversarial learning and outperforms state-of-the-art methods.

Comments:	Submitted to ICASSP2023 (currently under review)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.15147 [cs.CL]
	(or arXiv:2210.15147v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.15147

Submission history

From: Zilin Yuan [view email]
[v1] Thu, 27 Oct 2022 03:15:26 UTC (321 KB)

Computer Science > Computation and Language

Title:A Curriculum Learning Approach for Multi-domain Text Classification Using Keyword weight Ranking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Curriculum Learning Approach for Multi-domain Text Classification Using Keyword weight Ranking

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators