Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2661829.2662066acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

High Impact Academic Paper Prediction Using Temporal and Topological Features

Published: 03 November 2014 Publication History

Abstract

Predicting promising academic papers is useful for a variety of parties, including researchers, universities, scientific councils, and policymakers. Researchers may benefit from such data to narrow down their reading list and focus on what will be important, and policymakers may use predictions to infer rising fields for a more strategic distribution of resources. This paper proposes a novel technique to predict a paper's future impact (i.e., number of citations) by using temporal and topological features derived from citation networks. We use a behavioral modeling approach in which the temporal change in the number of citations a paper gets is clustered, and new papers are evaluated accordingly. Then, within each cluster, we model the impact prediction as a regression problem where the objective is to predict the number of citations that a paper will get in the near or far future, given the early citation performance of the paper. The results of empirical evaluations on data from several well-known citation databases show that the proposed framework performs significantly better than the state of the art approaches.

References

[1]
CiteSeerX Digital Library. http://citeseerx.ist.psu.edu/, 2008. {Online; accessed 23-March-2014}.
[2]
J. Adams. Early citation counts correlate with accumulated impact. Scientometrics, 63(3):567--581, 2005.
[3]
S. Bani-Ahmad and G. Ozsoyoglu. On popularity quality: growth and decay phases of publication popularities. In Innovations in Information Technology, 2009. IIT'09. International Conference on, pages 165--169. IEEE, 2009.
[4]
A.-L. Barabási and R. Albert. Emergence of scaling in random networks. science, 286(5439):509--512, 1999.
[5]
C. Castillo, D. Donato, and A. Gionis. Estimating number of citations using author reputation. In String processing and information retrieval, pages 107--117. Springer, 2007.
[6]
W.-Y. Chen, Y. Song, H. Bai, C.-J. Lin, and E. Y. Chang. Parallel Spectral Clustering in Distributed Systems, 2010.
[7]
E. Garfield. The history and meaning of the journal impact factor. JAMA: the journal of the American Medical Association, 295(1):90--93, 2006.
[8]
J. Gehrke, P. Ginsparg, and J. Kleinberg. Overview of the 2003 kdd cup. ACM SIGKDD Explorations Newsletter, 5(2):149--151, 2003.
[9]
J. E. Hirsch. An index to quantify an individual's scientific research output. Proceedings of the National academy of Sciences of the United States of America, 102(46):16569, 2005.
[10]
E. F. Krause. Taxicab geometry. The Mathematics Teacher, pages 695--706, 1973.
[11]
J. Leskovec, J. Kleinberg, and C. Faloutsos. Graphs over time: densification laws, shrinking diameters and possible explanations. In Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, pages 177--187. ACM, 2005.
[12]
D. Liben-Nowell and J. Kleinberg. The link-prediction problem for social networks. Journal of the American society for information science and technology, 58(7):1019--1031, 2007.
[13]
M. H. MacRoberts and B. R. MacRoberts. Problems of citation analysis: A critical review. Journal of the American Society for Information Science, 40(5):342--349, 1989.
[14]
J. Manjunatha, K. Sivaramakrishnan, R. K. Pandey, and M. N. Murthy. Citation prediction using time series approach kdd cup 2003 (task 1). ACM SIGKDD Explorations Newsletter, 5(2):152--153, 2003.
[15]
K. V. Mardia, J. T. Kent, and J. M. Bibby. Multivariate analysis. 1980.
[16]
W. S. McCulloch and W. Pitts. A logical calculus of the ideas immanent in nervous activity. The bulletin of mathematical biophysics, 5(4):115--133, 1943.
[17]
D. McNamara, P. Wong, P. Christen, and K. S. Ng. Predicting high impact academic papers using citation network features. In Trends and Applications in Knowledge Discovery and Data Mining, pages 14--25. Springer, 2013.
[18]
A. Y. Ng, M. I. Jordan, Y. Weiss, et al. On spectral clustering: Analysis and an algorithm. Advances in neural information processing systems, 2:849--856, 2002.
[19]
H. Sakoe and S. Chiba. Dynamic programming algorithm optimization for spoken word recognition. Acoustics, Speech and Signal Processing, IEEE Transactions on, 26(1):43--49, 1978.
[20]
X. Shi, J. Leskovec, and D. A. McFarland. Citing for high impact. In Proceedings of the 10th annual joint conference on Digital libraries, pages 49--58. ACM, 2010.
[21]
N. Shibata, Y. Kajikawa, and K. Matsushima. Topological analysis of citation networks to discover the future core articles. Journal of the American Society for Information Science and Technology, 58(6):872--882, 2007.
[22]
J. Tang, D. Zhang, and L. Yao. Social network extraction of academic researchers. In ICDM'07, pages 292--301, 2007.
[23]
V. Vapnik. The nature of statistical learning theory. springer, 2000.
[24]
R. Yan, J. Tang, X. Liu, D. Shan, and X. Li. Citation count prediction: learning to estimate future citations for literature. In Proceedings of the 20th ACM international conference on Information and knowledge management, pages 1247--1252. ACM, 2011.
[25]
D. Yogatama, M. Heilman, B. O'Connor, C. Dyer, B. R. Routledge, and N. A. Smith. Predicting a scientific community's response to an article. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 594--604. Association for Computational Linguistics, 2011.

Cited By

View all
  • (2024)Citation Count Prediction for Newly Published Papers最新論文に適用可能な被引用数予測Transactions of the Japanese Society for Artificial Intelligence10.1527/tjsai.39-5_B-O1139:5(B-O11_1-12)Online publication date: 1-Sep-2024
  • (2022)Twin PapersProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557716(4444-4448)Online publication date: 17-Oct-2022
  • (2022)A review of scientific impact prediction: tasks, features and methodsScientometrics10.1007/s11192-022-04547-8128:1(543-585)Online publication date: 26-Nov-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management
November 2014
2152 pages
ISBN:9781450325981
DOI:10.1145/2661829
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 November 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. citation count prediction
  2. clustering
  3. network analysis
  4. regression
  5. time series

Qualifiers

  • Research-article

Funding Sources

Conference

CIKM '14
Sponsor:

Acceptance Rates

CIKM '14 Paper Acceptance Rate 175 of 838 submissions, 21%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)20
  • Downloads (Last 6 weeks)4
Reflects downloads up to 29 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Citation Count Prediction for Newly Published Papers最新論文に適用可能な被引用数予測Transactions of the Japanese Society for Artificial Intelligence10.1527/tjsai.39-5_B-O1139:5(B-O11_1-12)Online publication date: 1-Sep-2024
  • (2022)Twin PapersProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557716(4444-4448)Online publication date: 17-Oct-2022
  • (2022)A review of scientific impact prediction: tasks, features and methodsScientometrics10.1007/s11192-022-04547-8128:1(543-585)Online publication date: 26-Nov-2022
  • (2022)BERT-Based Scientific Paper Quality PredictionArtificial Neural Networks and Machine Learning – ICANN 202210.1007/978-3-031-15937-4_18(212-223)Online publication date: 7-Sep-2022
  • (2021)Interpretable Aspect-Aware Capsule Network for Peer Review Based Citation Count PredictionACM Transactions on Information Systems10.1145/346664040:1(1-29)Online publication date: 24-Nov-2021
  • (2021)Understanding the Inter-Domain Presence of Research Topics in the Computing DisciplineIEEE Transactions on Emerging Topics in Computing10.1109/TETC.2018.28695569:1(366-378)Online publication date: 1-Jan-2021
  • (2021)Early indicators of scientific impact: Predicting citations with altmetricsJournal of Informetrics10.1016/j.joi.2020.10112815:2(101128)Online publication date: May-2021
  • (2021)SIMILAR – Systematic iterative multilayer literature review methodJournal of Informetrics10.1016/j.joi.2020.10111115:1(101111)Online publication date: Feb-2021
  • (2021)Evaluating BERT-based scientific relation classifiers for scholarly knowledge graph construction on digital library collectionsInternational Journal on Digital Libraries10.1007/s00799-021-00313-y23:2(197-215)Online publication date: 2-Nov-2021
  • (2020)Emerging Scientific Field Detection Using Citation Networks and Topic Models—A Case Study of the Nanocarbon FieldApplied System Innovation10.3390/asi30300403:3(40)Online publication date: 14-Sep-2020
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media