Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3487664.3487682acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
research-article

Selective Classification of Danmaku Comments Using Distributed Representations

Published: 30 December 2021 Publication History

Abstract

Danmaku commenting has become popular for co-viewing on video-sharing platforms. However, there are usually a large number of irrelevant comments, that contaminate the quality of the information provided by videos. To address this problem, this paper presents a novel approach of classifying Danmaku comments into video categories. Specifically, we use BERT as the backbone architecture to extract semantic features from comments. We introduce a loss function that has an abstention option, which enables the detection of comments that do not fall into any predefined category. The experiments that we conducted using Nicovideo data demonstrated that our selective classification approach effectively discarded those that were irrelevant to a video’s content. We also present a method for subdividing the existing video categories based on the results of Danmaku comment classification. This entails a potential application of our method in hierarchical video clustering.

References

[1]
2013. Nicovideo comment etc. data (in Japanese). https://doi.org/10.32130/idr.3.1. Last accessed: 18/04/2021.
[2]
2020. Kicking Off National Bullying Prevention Month With New Anti-Bullying Features. https://about.instagram.com/blog/announcements/national-bullying-prevention-month. Last accessed: 18/04/2021.
[3]
2020. Yahoo! News strengthens its countermeasures against comments by using the most advanced natural language processing decision model (AI) using deep learning and its own supercomputer kukai.https://about.yahoo.co.jp/pr/release/2020/03/06c/. Last accessed: 18/04/2021.
[4]
Issa Alsmadi and Gan Keng Hoon. 2019. Term weighting scheme for short-text classification: Twitter corpuses. Neural Computing and Applications 31, 8 (2019), 3819–3831.
[5]
Qingchun Bai, Kai Wei, Jie Zhou, Chao Xiong, Yuanbin Wu, Xin Lin, and Liang He. 2021. Entity-level sentiment prediction in Danmaku video interaction. The Journal of Supercomputing(2021). https://doi.org/10.1007/s11227-021-03652-4
[6]
Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2017. Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics 5 (2017), 135–146.
[7]
Yue Chen, Qin Gao, and Pei-Luen Patrick Rau. 2017. Watching a Movie Alone yet Together: Understanding Reasons for Watching Danmaku Videos. International Journal of Human–Computer Interaction 33, 9(2017), 731–743. https://doi.org/10.1080/10447318.2017.1282187
[8]
Yen-Liang Chen, Chia-Ling Chang, and Chin-Sheng Yeh. 2017. Emotion classification of YouTube videos. Decision Support Systems 101 (2017), 40–50.
[9]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol. 1. Association for Computational Linguistics, 4171–4186.
[10]
Yonatan Geifman and Ran El-Yaniv. 2017. Selective Classification for Deep Neural Networks. In Advances in Neural Information Processing Systems, Vol. 30. Curran Associates, Inc.
[11]
Reihaneh H Hariri, Erik M Fredericks, and Kate M Bowers. 2019. Uncertainty in big data analytics: survey, opportunities, and challenges. Journal of Big Data 6, 1 (2019), 1–16.
[12]
Armand Joulin, Edouard Grave, Piotr Bojanowski, and Tomás Mikolov. 2016. Bag of Tricks for Efficient Text Classification. Computing Research Repository abs/1607.01759 (2016).
[13]
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings.
[14]
Ziyin Liu, Zhikang Wang, Paul Pu Liang, Russ R Salakhutdinov, Louis-Philippe Morency, and Masahito Ueda. 2019. Deep Gamblers: Learning to Abstain with Portfolio Theory. In Advances in Neural Information Processing Systems, Vol. 32. Curran Associates, Inc.
[15]
Tomás Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient Estimation of Word Representations in Vector Space. In 1st International Conference on Learning Representations, ICLR 2013, Scottsdale, Arizona, USA, May 2-4, 2013, Workshop Track Proceedings.
[16]
Hamdy Mubarak, Ammar Rashed, Kareem Darwish, Younes Samih, and Ahmed Abdelali. 2021. Arabic Offensive Language on Twitter: Analysis and Experiments. In Proceedings of the Sixth Arabic Natural Language Processing Workshop. Association for Computational Linguistics, Kyiv, Ukraine (Virtual), 126–135.
[17]
Wava Carissa Putri, Rani Aulia Hidayat, Isnaini Nurul Khasanah, and Rahmad Mahendra. 2020. ISWARA at WNUT-2020 Task 2: Identification of Informative COVID-19 English Tweets using BERT and FastText Embeddings. In Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020). 491–494.
[18]
Julio Savigny and Ayu Purwarianti. 2017. Emotion classification on youtube comments using word embedding. In 2017 International Conference on Advanced Informatics, Concepts, Theory, and Applications (ICAICTA). IEEE, 1–5.

Cited By

View all
  • (2023)Effective Language Representations for Danmaku Comment Classification in NicovideoIEICE Transactions on Information and Systems10.1587/transinf.2022DAP0010E106.D:5(838-846)Online publication date: 1-May-2023

Index Terms

  1. Selective Classification of Danmaku Comments Using Distributed Representations
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      iiWAS2021: The 23rd International Conference on Information Integration and Web Intelligence
      November 2021
      658 pages
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 30 December 2021

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. BERT
      2. Danmaku
      3. selective classification
      4. short text classification

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Funding Sources

      • JSPS

      Conference

      iiWAS2021

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)23
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 20 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Effective Language Representations for Danmaku Comment Classification in NicovideoIEICE Transactions on Information and Systems10.1587/transinf.2022DAP0010E106.D:5(838-846)Online publication date: 1-May-2023

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media