Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2063576.2063843acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections

Enriching textbooks with images

Published: 24 October 2011 Publication History


Textbooks have a direct bearing on the quality of education imparted to the students. Therefore, it is of paramount importance that the educational content of textbooks should provide rich learning experience to the students. Recent studies on understanding learning behavior suggest that the incorporation of digital visual material can greatly enhance learning. However, textbooks used in many developing regions are largely text-oriented and lack good visual material. We propose techniques for finding images from the web that are most relevant for augmenting a section of the textbook, while respecting the constraint that the same image is not repeated in different sections of the same chapter. We devise a rigorous formulation of the image assignment problem and present a polynomial time algorithm for solving the problem optimally. We also present two image mining algorithms that utilize orthogonal signals and hence obtain different sets of relevant images. Finally, we provide an ensembling algorithm for combining the assignments. To empirically evaluate our techniques, we use a corpus of high school textbooks in use in India. Our user study utilizing the Amazon Mechanical Turk platform indicates that the proposed techniques are able to obtain images that can help increase the understanding of the textbook material.


Knowledge for Development: World Development Report 1998/99. World Bank, 1998.
Improving India's education system through information technology. IBM, 2005.
A. Adams and J. van der Gaag. First step to literacy: Getting books in the hands of children. The Brookings Institution, January 2011.
R. Agrawal, S. Gollapudi, A. Kannan, and K. Kenthapadi. Identifying enrichment candidates in textbooks. In WWW, 2011.
R. Agrawal, S. Gollapudi, K. Kenthapadi, N. Srivastava, and R. Velu. Enriching textbooks through data mining. In First Annual ACM Symposium on Computing for Development (ACM DEV), 2010.
K. Barnard, P. Duygulu, D. Forsyth, N. de. Freitas, D. M. Blei, and M. I. Jordan. Matching words and pictures. In Journal of Machine Learning Research, volume 3, 2003.
S. Boss. What's next: Curling up with e-readers. Stanford Social Innovation Review, Winter 2011.
J. P. G. Chimombo. Issues in basic education in developing countries: An exploration of policy options for improved delivery. Journal of International Cooperation in Education, 8(1), 2005.
J. Coiro, M. Knobel, C. Lankshear, and D. Leu, editors. Handbook of research on new literacies. Lawrence Erlbaum, 2008.
B. Coyne and R. Sproat. WordsEye: An automatic text-to-scene conversion system. In SIGGRAPH, 2001.
M. Crossley and M. Murby. Textbook provision and the quality of the school curriculum in developing countries: Issues and policy options. Comparative Education, 30(2), 1994.
A. Csomai and R. Mihalcea. Linking educational materials to encyclopedic knowledge. In AIED, 2007.
R. Datta, D. Joshi, J. Li, and J. Z. Wang. Image retrieval: Ideas, influences, and trends of the new age. ACM Computing Surveys, 40, September 2008.
L. Downes. The laws of disruption: Harnessing the new forces that govern life and business in the digital age. Basic Books, 2009.
Y. Feng and M. Lapata. Topic models for image annotation and text illustration. In HLT-NAACL, 2010.
B. Fuller. What school factors raise achievement in the third world? Review of educational research, 57(3), 1987.
K. Gaikwad, G. Paruthi, and W. Thies. Interactive DVDs as a platform for education. In ICTD, 2010.
J. Gillies and J. Quijada. Opportunity to learn: A high impact strategy for improving educational outcomes in developing countries. USAID Educational Quality Improvement Program (EQUIP2), 2008.
P. Glewwe, M. Kremer, and S. Moulin. Many children left behind? Textbooks and test scores in Kenya. American Economic Journal: Applied Economics, 1(1), 2009.
R. Gorman and J. Ginsburg. Copyright: Cases and materials. Foundation Press, 2006.
W. Grabe. Efficiency in reading -- thirty fiver years later. In Reading in a Foreign Language, 2010.
G. Grefenstette. Comparing the language used in Flickr, general web pages, Yahoo images and Wikipedia. In LREC Workshop on Language Resources for Content-Based Image Retrieval (OntoImage), 2008.
G. Guo, G. Xu, H. Li, and X. Cheng. A unified and discriminative model for query refinement. In SIGIR, 2008.
E. A. Hanushek and L. Woessmann. The role of education quality for economic growth. Policy Research Department Working Paper 4122, World Bank, 2007.
E. Hatcher and O. Gospodnetic. Lucene in Action. Manning, 2004.
S. Heyneman, J. Farrell, and M. Sepulveda-Stuardo. Textbooks and achievement in developing countries: What we know. Journal of Curriculum Studies, 13(3), 1981.
K. Holmqvist, J. Holsanova, M. Barthelson, and D. Lundqvist. Reading or Scanning? A study of newspaper and net paper reading. In J. R. Hyönä and H. Deubel, editors, The mind's eye: Cognitive and applied aspects of eye movement research. Elsevier Science, 2003.
S. Huston and W. B. Croft. Evaluating verbose query processing techniques. In SIGIR, 2010.
P. G. Ipeirotis. Analyzing the Amazon mechanical turk marketplace. ACM Crossroads, 17(2), 2010.
A. Jain and B. Yu. Automatic text location in images and video frames. Pattern recognition, 31(12), 1998.
A. Jawa, S. Datta, S. Nanda, V. Garg, V. Varma, S. Chande, and M. K. P. Venkata. Smeo: A platform for smart classrooms with enhanced information access and operations automation. In 10th International Conference on Next Generation Wired/Wireless Advanced Networking, 2010.
D. Joshi. The story picturing engine: Finding elite images to illustrate a story using mutual reinforcement. In ACM SIGMM International Workshop on Multimedia Information Retrieval, 2004.
D. Jurafsky and J. Martin. Speech and language processing. Prentice Hall, 2008.
J. S. Justeson and S. M. Katz. Technical terminology: Some linguistic properties and an algorithm for indentification in text. Natural Language Engineering, 1(1), 1995.
P. Katsioloudis. Identification of Quality Indicators of Visual-Based Learning Material in Technology Education Programs for Grades 7--12. PhD thesis, North Carolina State University, 2007.
E. Kuiper, M. Volman, and J. Terwel. The Web as an information resource in K-12 education: Strategies for supporting students in searching and processing information. Review of Educational Research, 75(3), 2005.
G. Kumaran and V. R. Carvalho. Reducing long queries using query quality predictors. In SIGIR, 2009.
R. Lienhart and A. Wernicke. Localizing and segmenting text in images and videos. IEEE Transactions on Circuits and Systems for Video Technology, 12(4), 2002.
R. Mayer. Multimedia Learning. Cambridge University Press, 2001.
O. Medelyan. Human-competitive automatic topic indexing. PhD thesis, The University of Waikato, 2009.
R. Mihalcea and A. Csomai. Wikify!: Linking documents to encyclopedic knowledge. In CIKM, 2007.
R. Mihalcea and C. W. Leong. Toward communicating simple sentences using pictorial representations. Machine Translation, 22(3), 2008.
D. Milne. Applying Wikipedia to Interactive Information Retrieval. PhD thesis, University of Waikato, 2010.
R. Mohammad and R. Kumari. Effective use of textbooks: A neglected aspect of education in Pakistan. Journal of Education for International Development, 3(1), 2007.
J. Moulton. How do teachers use textbooks and other print materials: A review of the literature. The Improving Educational Quality Project, South Africa, 1994.
H. Müller, P. Clough, T. Deselaers, and B. Caputo. ImageCLEF: Experimental Evaluation in Visual Information Retrieval. Springer, 2010.
N. Mulvany. Indexing books. University of Chicago Press, 2005.
S. Panjwani, L. Micallef, K. Fenech, and K. Toyama. Effects of integrating digital visual materials with textbook scans in the classroom. International Journal of Education and Development using Information and Communication Technology, 5(3), 2009.
C. Papadimitriou and K. Steiglitz. Combinatorial optimization: Algorithms and complexity. Dover, 1998.
A. Riddell. Factors influencing educational quality and effectiveness in developing countries: A review of research. Deutsche Gesellschaft fur Technische Zusammenarbeit (GTZ), Germany, 2008.
J. Rocchio. Relevance feedback in information retrieval. In The SMART Retrieval System -- Experiments in Automatic Document Processing, 1971.
D. Saari. Decisions and elections: Explaining the unexpected. Cambridge University Press, 2001.
B. W. Speck, T. R. Johnson, C. P. Dice, and L. B. Heaton. Collaborative writing: An annotated bibliography. Greenwood Press, 1999.
A. Spink, B. Jansen, and H. Ozmultu. Use of query reformulation and relevance feedback by Excite users. In Internet Research: Electronic Networking Applications and Policy, 2000.
M. Stein, C. Stuen, D. Carnine, and R. M. Long. Textbook evaluation and adoption. Reading & Writing Quarterly, 17(1), 2001.
L. Von Ahn and L. Dabbish. Labeling images with a computer game. In CHI, 2004.
X. Xue, S. Huston, and W. B. Croft. Improving verbose queries using subset distribution. In CIKM, 2010.
Y. Yang, N. Bansal, W. Dakka, P. Ipeirotis, N. Koudas, and D. Papadias. Query by document. In WSDM, 2009.
X. Zhu, A. B. Goldberg, M. Eldawy, C. R. Dyer, and B. Strock. A text-to-picture synthesis system for augmenting communication. In AAAI, 2007.

Cited By

View all
  • (2024)Towards Multi-Objective Behavior and Knowledge Modeling in StudentsAdjunct Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and Personalization10.1145/3631700.3664880(183-188)Online publication date: 27-Jun-2024
  • (2024)The Rise of AI‐Generated News VideosHow Machine Learning is Innovating Today's World10.1002/9781394214167.ch25(423-451)Online publication date: 16-Jun-2024
  • (2023)A novel multi document summarization with document-elements augmentation for learning materials using concept based ILP and clustering methodsInternational Journal of Computers and Applications10.1080/1206212X.2023.228444646:2(78-89)Online publication date: 24-Nov-2023
  • Show More Cited By

Index Terms

  1. Enriching textbooks with images



    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors


    Published In

    cover image ACM Conferences
    CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management
    October 2011
    2712 pages
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]



    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 October 2011


    Request permissions for this article.

    Check for updates

    Author Tags

    1. data mining
    2. education
    3. image mining
    4. text augmentation
    5. textbooks


    • Research-article


    CIKM '11

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25


    Other Metrics

    Bibliometrics & Citations


    Article Metrics

    • Downloads (Last 12 months)21
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 16 Dec 2024

    Other Metrics


    Cited By

    View all
    • (2024)Towards Multi-Objective Behavior and Knowledge Modeling in StudentsAdjunct Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and Personalization10.1145/3631700.3664880(183-188)Online publication date: 27-Jun-2024
    • (2024)The Rise of AI‐Generated News VideosHow Machine Learning is Innovating Today's World10.1002/9781394214167.ch25(423-451)Online publication date: 16-Jun-2024
    • (2023)A novel multi document summarization with document-elements augmentation for learning materials using concept based ILP and clustering methodsInternational Journal of Computers and Applications10.1080/1206212X.2023.228444646:2(78-89)Online publication date: 24-Nov-2023
    • (2023)Big graph based online learning through social networksPrinciples of Big Graph: In-depth Insight10.1016/bs.adcom.2021.10.012(313-328)Online publication date: 2023
    • (2022)Remediating textbook deficiencies by leveraging community question answersEducation and Information Technologies10.1007/s10639-022-10937-527:7(10065-10105)Online publication date: 11-Apr-2022
    • (2021)İlkokul Türkçe Ders Kitaplarının Sınıf Öğretmenlerinin Görüşlerine Dayalı Olarak Bütüncül Bir Yaklaşımla DeğerlendirilmesiAhi Evran Üniversitesi Sosyal Bilimler Enstitüsü Dergisi10.31592/aeusbed.9792637:3(830-849)Online publication date: 29-Nov-2021
    • (2021)Approximate Nearest Neighbour Search on Privacy-aware Encoding of User Locations to Identify Susceptible Infections in Simulated EpidemicsProceedings of the 13th Annual Meeting of the Forum for Information Retrieval Evaluation10.1145/3503162.3503164(35-42)Online publication date: 13-Dec-2021
    • (2020)Generating Audio-Visual Slideshows from Text Articles Using Word ConcretenessProceedings of the 2020 CHI Conference on Human Factors in Computing Systems10.1145/3313831.3376519(1-11)Online publication date: 21-Apr-2020
    • (2019)A Dynamic Illustration Approach For Arabic Text2019 IEEE 10th GCC Conference & Exhibition (GCC)10.1109/GCC45510.2019.1570512466(1-6)Online publication date: Apr-2019
    • (2019)Improving Arabic Text to Image Mapping Using a Robust Machine Learning TechniqueIEEE Access10.1109/ACCESS.2019.28967137(18772-18782)Online publication date: 2019
    • Show More Cited By

    View Options

    Login options

    View options


    View or Download as a PDF file.



    View online with eReader.








    Share this Publication link

    Share on social media