Natural Language Processing for Scientific Paper Evaluation: Comparing Human and Machine Judgements

Tom Xu⁹,
Noel Hinton⁹,
Michael Timothy Bennett⁹ &
…
Yoshihiro Maruyama^9,10

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1655))

Included in the following conference series:

International Conference on Human-Computer Interaction

1478 Accesses

Abstract

A huge number of papers have been published about COVID-19. So much it’s overwhelming. Many papers appear on preprint servers such as arXiv before publication. Researchers and clinicians can get ahead of the curve by making use of these preprint papers, but how to tell what is worth reading? Could there be an automated recommendation mechanism? In this paper we address the question by experimenting with SPECTER document-level vector embedding which establishes the representations by incorporating state-of-the-art Transformer models, such as SciBERT, a BERT variant tailored to scientific text. Meanwhile, the dataset we choose to apply SPECTER embedding is the CORD-19 dataset.

This work was supported by JST (JPMJMS2033). The last author would like to thank Advanced Telecommunications Research Institute for his research visit there.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Assessing a BERT-based model for analyzing subjectivity and classifying academic articles

Article 17 June 2024

Advancing language models through domain knowledge integration: a comprehensive approach to training, evaluation, and optimization of social scientific neural word embeddings

Article Open access 22 May 2024

IDC: quantitative evaluation benchmark of interpretation methods for deep text classification models

Article Open access 26 March 2022

References

Beltagy, I., Lo, K., Cohan, A.: SciBERT: a pretrained language model for scientific text (2019)
Google Scholar
Chen, Q., Allot, A., Lu, Z.: Keep up with the latest coronavirus research. Nature 579(7798), 193 (2020). https://doi.org/10.1038/d41586-020-00694-1, https://www.ncbi.nlm.nih.gov/pubmed/32157233
Chen, Q., Allot, A., Lu, Z.: LitCovid: an open database of COVID-19 literature. Nucleic Acids Res. 49, D1534–D1540 (2020)
Article Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding (2019)
Google Scholar
Neumann, P.M.: The mathematical Writings of Évariste Galois. European Mathematical Society (2011)
Google Scholar
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
MathSciNet MATH Google Scholar
SCImago: SJR - SCImago Journal & Country Rank [Portal] (2021). http://www.scimagojr.com. Accessed 29 Apr 2021
Vaswani, A., et al.: Attention is all you need (2017)
Google Scholar
Wang, L.L., Lo, K.: Text mining approaches for dealing with the rapidly expanding literature on COVID-19. Brief. Bioinform. 22(2), 781–799 (2020). https://doi.org/10.1093/bib/bbaa296
Article Google Scholar

Download references

Acknowledgment

The authors are grateful to Ryohei Sasano for his help with the experimental part of this work.

Author information

Authors and Affiliations

School of Computing, The Australian National University, Canberra, Australia
Tom Xu, Noel Hinton, Michael Timothy Bennett & Yoshihiro Maruyama
Advanced Telecommunications Research Institute, Kyoto, Japan
Yoshihiro Maruyama

Authors

Tom Xu
View author publications
You can also search for this author in PubMed Google Scholar
Noel Hinton
View author publications
You can also search for this author in PubMed Google Scholar
Michael Timothy Bennett
View author publications
You can also search for this author in PubMed Google Scholar
Yoshihiro Maruyama
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tom Xu .

Editor information

Editors and Affiliations

University of Crete and Foundation for Research and Technology – Hellas (FORTH), Heraklion, Crete, Greece
Constantine Stephanidis
Foundation for Research and Technology Hellas (FORTH), Heraklion, Crete, Greece
Margherita Antona
Foundation for Research and Technology Hellas (FORTH), Heraklion, Crete, Greece
Stavroula Ntoa
University of Central Florida, Orlando, FL, USA
Gavriel Salvendy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, T., Hinton, N., Bennett, M.T., Maruyama, Y. (2022). Natural Language Processing for Scientific Paper Evaluation: Comparing Human and Machine Judgements. In: Stephanidis, C., Antona, M., Ntoa, S., Salvendy, G. (eds) HCI International 2022 – Late Breaking Posters. HCII 2022. Communications in Computer and Information Science, vol 1655. Springer, Cham. https://doi.org/10.1007/978-3-031-19682-9_90

Download citation

DOI: https://doi.org/10.1007/978-3-031-19682-9_90
Published: 24 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-19681-2
Online ISBN: 978-3-031-19682-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Natural Language Processing for Scientific Paper Evaluation: Comparing Human and Machine Judgements

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Assessing a BERT-based model for analyzing subjectivity and classifying academic articles

Advancing language models through domain knowledge integration: a comprehensive approach to training, evaluation, and optimization of social scientific neural word embeddings

IDC: quantitative evaluation benchmark of interpretation methods for deep text classification models

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Natural Language Processing for Scientific Paper Evaluation: Comparing Human and Machine Judgements

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Assessing a BERT-based model for analyzing subjectivity and classifying academic articles

Advancing language models through domain knowledge integration: a comprehensive approach to training, evaluation, and optimization of social scientific neural word embeddings

IDC: quantitative evaluation benchmark of interpretation methods for deep text classification models

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation