Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3400903.3400906acmotherconferencesArticle/Chapter ViewAbstractPublication PagesssdbmConference Proceedingsconference-collections
short-paper

Figure Captioning in Scholarly Literatures to Augment Search Results

Published: 30 July 2020 Publication History

Abstract

Figures convey useful information, such as trends, proportions, and values, in a concise format. People can understand these attributes at a glance, but machine process them difficultly. When searching for figures, the end-user is presented with the caption that does not contain enough information to interpret the figure. In the paper, we propose a novel end-to-end framework for scholarly figure captioning. In the figure parsing module, figures are localized, classified, and analyzed. The plotted data and its association with the legend entries are extracted. In text processing module, the figure-related sentences are identified and measured with the sentence’s relevance to the figure. The sentence subset with the optimum size is selected considering a balance between information content and the size of the generated caption. The final complete captions enable a variety of current exciting applications, such as figure search engine and figure query answering. Empirical experiments show that our proposed framework can effectively generate captions for figures under several metrics.

References

[1]
Sumit Bhatia and Prasenjit Mitra. 2012. Summarizing figures, tables, and algorithms in scientific publications to augment search results. ACM Transactions on Information Systems (TOIS) 30, 1 (2012), 1–24.
[2]
Falk Böschen, Tilman Beck, and Ansgar Scherp. 2018. Survey and empirical comparison of different approaches for text extraction from scholarly figures. Multimedia Tools and Applications 77, 22 (2018), 29475–29505.
[3]
Charles Chen, Ruiyi Zhang, Eunyee Koh, Sungchul Kim, Scott Cohen, Tong Yu, Ryan Rossi, and Razvan Bunescu. 2019. Figure Captioning with Reasoning and Sequence-Level Training. arXiv preprint arXiv:1906.02850(2019).
[4]
Christopher Andreas Clark and Santosh Divvala. 2015. Looking beyond text: Extracting figures, tables and captions from computer science papers. In Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence.
[5]
Mathieu Cliche, David Rosenberg, Dhruv Madeka, and Connie Yee. 2017. Scatteract: Automated extraction of data from scatter plots. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 135–150.
[6]
Xiaobing Han, Yanfei Zhong, Liqin Cao, and Liangpei Zhang. 2017. Pre-trained alexnet architecture with pyramid pooling and supervision for high spatial resolution remote sensing image scene classification. Remote Sensing 9, 8 (2017), 848.
[7]
Weihua Huang and Chew Lim Tan. 2007. A system for understanding imaged infographics and its applications. In Proceedings of the 2007 ACM symposium on Document engineering. 9–18.
[8]
Daekyoung Jung, Wonjae Kim, Hyunjoo Song, Jeong-in Hwang, Bongshin Lee, Bohyoung Kim, and Jinwook Seo. 2017. ChartSense: Interactive data extraction from chart images. In Proceedings of the 2017 chi conference on human factors in computing systems. 6706–6717.
[9]
Heechul Jung, Min-Kook Choi, Jihun Jung, Jin-Hee Lee, Soon Kwon, and Woo Young Jung. 2017. ResNet-based vehicle classification and localization in traffic surveillance systems. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops. 61–67.
[10]
David Morris, Peichen Tang, and Ralph Ewerth. 2019. A neural approach for text extraction from scholarly figures. In 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1438–1443.
[11]
Jorge Poco and Jeffrey Heer. 2017. Reverse-engineering visualizations: Recovering visual encodings from chart images. In Computer Graphics Forum, Vol. 36. Wiley Online Library, 353–363.
[12]
Noah Siegel, Zachary Horvitz, Roie Levin, Santosh Divvala, and Ali Farhadi. 2016. FigureSeer: Parsing result-figures in research papers. In European Conference on Computer Vision. Springer, 664–680.
[13]
Yunchao Wei, Wei Xia, Min Lin, Junshi Huang, Bingbing Ni, Jian Dong, Yao Zhao, and Shuicheng Yan. 2015. HCP: A flexible CNN framework for multi-label image classification. IEEE transactions on pattern analysis and machine intelligence 38, 9(2015), 1901–1907.
[14]
Anna Wilbik, James M Keller, and Gregory Lynn Alexander. 2011. Linguistic summarization of sensor data for eldercare. In 2011 IEEE International Conference on Systems, Man, and Cybernetics. IEEE, 2595–2599.

Cited By

View all
  • (2023)Scientific document processing: challenges for modern learning methodsInternational Journal on Digital Libraries10.1007/s00799-023-00352-724:4(283-309)Online publication date: 24-Mar-2023

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
SSDBM '20: Proceedings of the 32nd International Conference on Scientific and Statistical Database Management
July 2020
241 pages
ISBN:9781450388146
DOI:10.1145/3400903
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 July 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Figure Captioning
  2. Figure Search Engine
  3. Figures Parsing
  4. Relevant Sentence Extraction

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Conference

SSDBM 2020

Acceptance Rates

Overall Acceptance Rate 56 of 146 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)1
Reflects downloads up to 22 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Scientific document processing: challenges for modern learning methodsInternational Journal on Digital Libraries10.1007/s00799-023-00352-724:4(283-309)Online publication date: 24-Mar-2023

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media