short-paper

Figure Captioning in Scholarly Literatures to Augment Search Results

Authors:

Jingya Yang,

Dongdong Zhang,

Gaocai Dong,

Jing PengAuthors Info & Claims

SSDBM '20: Proceedings of the 32nd International Conference on Scientific and Statistical Database Management

Article No.: 15, Pages 1 - 4

https://doi.org/10.1145/3400903.3400906

Published: 30 July 2020 Publication History

Get Access

Abstract

Figures convey useful information, such as trends, proportions, and values, in a concise format. People can understand these attributes at a glance, but machine process them difficultly. When searching for figures, the end-user is presented with the caption that does not contain enough information to interpret the figure. In the paper, we propose a novel end-to-end framework for scholarly figure captioning. In the figure parsing module, figures are localized, classified, and analyzed. The plotted data and its association with the legend entries are extracted. In text processing module, the figure-related sentences are identified and measured with the sentence’s relevance to the figure. The sentence subset with the optimum size is selected considering a balance between information content and the size of the generated caption. The final complete captions enable a variety of current exciting applications, such as figure search engine and figure query answering. Empirical experiments show that our proposed framework can effectively generate captions for figures under several metrics.

References

[1]

Sumit Bhatia and Prasenjit Mitra. 2012. Summarizing figures, tables, and algorithms in scientific publications to augment search results. ACM Transactions on Information Systems (TOIS) 30, 1 (2012), 1–24.

Digital Library

Google Scholar

[2]

Falk Böschen, Tilman Beck, and Ansgar Scherp. 2018. Survey and empirical comparison of different approaches for text extraction from scholarly figures. Multimedia Tools and Applications 77, 22 (2018), 29475–29505.

Digital Library

Google Scholar

[3]

Charles Chen, Ruiyi Zhang, Eunyee Koh, Sungchul Kim, Scott Cohen, Tong Yu, Ryan Rossi, and Razvan Bunescu. 2019. Figure Captioning with Reasoning and Sequence-Level Training. arXiv preprint arXiv:1906.02850(2019).

Google Scholar

[4]

Christopher Andreas Clark and Santosh Divvala. 2015. Looking beyond text: Extracting figures, tables and captions from computer science papers. In Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence.

Google Scholar

[5]

Mathieu Cliche, David Rosenberg, Dhruv Madeka, and Connie Yee. 2017. Scatteract: Automated extraction of data from scatter plots. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 135–150.

Crossref

Google Scholar

[6]

Xiaobing Han, Yanfei Zhong, Liqin Cao, and Liangpei Zhang. 2017. Pre-trained alexnet architecture with pyramid pooling and supervision for high spatial resolution remote sensing image scene classification. Remote Sensing 9, 8 (2017), 848.

Crossref

Google Scholar

[7]

Weihua Huang and Chew Lim Tan. 2007. A system for understanding imaged infographics and its applications. In Proceedings of the 2007 ACM symposium on Document engineering. 9–18.

Digital Library

Google Scholar

[8]

Daekyoung Jung, Wonjae Kim, Hyunjoo Song, Jeong-in Hwang, Bongshin Lee, Bohyoung Kim, and Jinwook Seo. 2017. ChartSense: Interactive data extraction from chart images. In Proceedings of the 2017 chi conference on human factors in computing systems. 6706–6717.

Digital Library

Google Scholar

[9]

Heechul Jung, Min-Kook Choi, Jihun Jung, Jin-Hee Lee, Soon Kwon, and Woo Young Jung. 2017. ResNet-based vehicle classification and localization in traffic surveillance systems. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops. 61–67.

Crossref

Google Scholar

[10]

David Morris, Peichen Tang, and Ralph Ewerth. 2019. A neural approach for text extraction from scholarly figures. In 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1438–1443.

Crossref

Google Scholar

[11]

Jorge Poco and Jeffrey Heer. 2017. Reverse-engineering visualizations: Recovering visual encodings from chart images. In Computer Graphics Forum, Vol. 36. Wiley Online Library, 353–363.

Google Scholar

[12]

Noah Siegel, Zachary Horvitz, Roie Levin, Santosh Divvala, and Ali Farhadi. 2016. FigureSeer: Parsing result-figures in research papers. In European Conference on Computer Vision. Springer, 664–680.

Crossref

Google Scholar

[13]

Yunchao Wei, Wei Xia, Min Lin, Junshi Huang, Bingbing Ni, Jian Dong, Yao Zhao, and Shuicheng Yan. 2015. HCP: A flexible CNN framework for multi-label image classification. IEEE transactions on pattern analysis and machine intelligence 38, 9(2015), 1901–1907.

Google Scholar

[14]

Anna Wilbik, James M Keller, and Gregory Lynn Alexander. 2011. Linguistic summarization of sensor data for eldercare. In 2011 IEEE International Conference on Systems, Man, and Cybernetics. IEEE, 2595–2599.

Crossref

Google Scholar

Cited By

View all

Ramesh Kashyap AYang YKan M(2023)Scientific document processing: challenges for modern learning methodsInternational Journal on Digital Libraries10.1007/s00799-023-00352-724:4(283-309)Online publication date: 24-Mar-2023
https://doi.org/10.1007/s00799-023-00352-7

Recommendations

A Formative Study on Designing Accurate and Natural Figure Captioning Systems
CHI EA '20: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems

Automatic figure captioning is widely useful for improving the readability and accessibility of figures. Despite recent advances in figure question answering and parsing figure elements that enable machines to accurately read information from figures, ...
Generating Accurate Caption Units for Figure Captioning
WWW '21: Proceedings of the Web Conference 2021

Scientific-style figures are commonly used on the web to present numerical information. Captions that tell accurate figure information and sound natural would significantly improve figure accessibility. In this paper, we present promising results on ...
Re-ranking search results using query logs
CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge management

This work addresses two common problems in search, frequently occurring with underspecified user queries: the top-ranked results for such queries may not contain documents relevant to the user's search intent, and fresh and relevant pages may not get ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

SSDBM '20: Proceedings of the 32nd International Conference on Scientific and Statistical Database Management

July 2020

241 pages

ISBN:9781450388146

DOI:10.1145/3400903

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 July 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper
Research
Refereed limited

Conference

SSDBM 2020

SSDBM 2020: 32nd International Conference on Scientific and Statistical Database Management

July 7 - 9, 2020

Vienna, Austria

Acceptance Rates

Overall Acceptance Rate 56 of 146 submissions, 38%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
66
Total Downloads

Downloads (Last 12 months)6
Downloads (Last 6 weeks)1

Reflects downloads up to 22 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Ramesh Kashyap AYang YKan M(2023)Scientific document processing: challenges for modern learning methodsInternational Journal on Digital Libraries10.1007/s00799-023-00352-724:4(283-309)Online publication date: 24-Mar-2023
https://doi.org/10.1007/s00799-023-00352-7

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Abstract

References

Cited By

Recommendations

A Formative Study on Designing Accurate and Natural Figure Captioning Systems

Generating Accurate Caption Units for Figure Captioning

Re-ranking search results using query logs

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

HTML Format

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations