XQM: Interactive Learning on Mobile Phones

Alexandra M. Bagi¹⁵,
Kim I. Schild¹⁵,
Omar Shahbaz Khan¹⁵,
Jan Zahálka ORCID: orcid.org/0000-0002-6743-3607¹⁶ &
…
Björn Þór Jónsson ORCID: orcid.org/0000-0003-0889-3491¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12573))

Included in the following conference series:

International Conference on Multimedia Modeling

1844 Accesses
1 Citations

Abstract

There is an increasing need for intelligent interaction with media collections, and mobile phones are gaining significant traction as the device of choice for many users. In this paper, we present XQM, a mobile approach for intelligent interaction with the user’s media on the phone, tackling the inherent challenges of the highly dynamic nature of mobile media collections and limited computational resources of the mobile device. We employ interactive learning, a method that conducts interaction rounds with the user, each consisting of the system suggesting relevant images based on its current model, the user providing relevance labels, the system’s model retraining itself based on these labels, and the system obtaining a new set of suggestions for the next round. This method is suitable for the dynamic nature of mobile media collections and the limited computational resources. We show that XQM, a full-fledged app implemented for Android, operates on 10K image collections in interactive time (less than 1.4 s per interaction round), and evaluate user experience in a user study that confirms XQM’s effectiveness.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Interactive Learning for Multimedia at Large

MirBot: A Multimodal Interactive Image Retrieval System

Mobile Visual Search for Digital Heritage Applications

Notes

1.
XQM is an acronym of Exquisitor Mobile, as the design of XQM relies heavily on Exquisitor, the state-of-the-art interactive learning system [10]. The XQM app is available to the research community at www.github.com/ITU-DASYALab/XQM.
2.
In the current implementation, at least one positive and one negative example are needed; until these have been identified, random images replace the judged images.
3.
Loading random images is useful when the model is missing positive examples with concepts that have not yet been seen; in a future version we plan to implement search functionality to further help find positive examples.
4.
http://lear.inrialpes.fr/people/jegou/data.php.
5.
http://lsc.dcu.ie/.

References

Aggarwal, C., Kong, X., Gu, Q., Han, J., Yu, P.: Active learning: a survey. In: Data Classification, pp. 571–605. CRC Press (2014)
Google Scholar
Andoni, A., Indyk, P.: Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. Commun. ACM 51(1), 117–122 (2008)
Article Google Scholar
Chang, C., Lin, C.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2, 27:1–27:27 (2011)
Article Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
MATH Google Scholar
Datar, M., Immorlica, N., Indyk, P., Mirrokni, V.S.: Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of SCG, pp. 253–262 (2004)
Google Scholar
Ensor, A., Hall, S.: GPU-based image analysis on mobile devices. CoRR abs/1112.3110 http://arxiv.org/abs/1112.3110 (2011)
Geirhos, R., Temme, C.R.M., Rauber, J., Schütt, H.H., Bethge, M., Wichmann, F.A.: Generalisation in humans and deep neural networks. In: Proceedings of NIPS, pp. 7538–7550 (2018)
Google Scholar
Guðmundsson, G.Þ., Amsaleg, L., Jónsson, B.Þ.: Impact of storage technology on the efficiency of cluster-based high-dimensional index creation. In: Proceedings of DASFAA, pp. 53–64 (2012)
Google Scholar
Jégou, H., Douze, M., Schmid, C.: Product quantization for nearest neighbor search. IEEE PAMI 33(1), 117–128 (2010)
Article Google Scholar
Khan, O.S., et al.: Interactive learning for multimedia at large. In: Jose, J.M., et al. (eds.) ECIR 2020. LNCS, vol. 12035, pp. 495–510. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-45439-5_33
Chapter Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Proceedings of NIPS, pp. 1097–1105 (2012)
Google Scholar
Nielsen, J.: 10 usability heuristics for user interface design (1995). https://www.nngroup.com/articles/ten-usability-heuristics/. Accessed 25 Mar 2020
Settles, B.: Active learning literature survey. Computer Sciences Technical Report 1648, University of Wisconsin-Madison (2009)
Google Scholar
Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE PAMI 22(12), 1349–1380 (2000)
Article Google Scholar
Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of CVPR, pp. 1–9 (2015)
Google Scholar
Xie, S., Girshick, R., Dollar, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks. In: Proceedings of CVPR (2017)
Google Scholar
Zahálka, J., Rudinac, S., Jónsson, B., Koelma, D., Worring, M.: Blackthorn: large-scale interactive multimodal learning. IEEE TMM 20, 687–698 (2018)
Google Scholar
Zahálka, J., Worring, M.: Towards interactive, intelligent, and integrated multimedia analytics. In: Proceedings of IEEE VAST, pp. 3–12 (2014)
Google Scholar
Zhou, X., Huang, T.: Relevance feedback in image retrieval: a comprehensive review. Multimed. Syst. 8, 536–544 (2003)
Article Google Scholar

Download references

Acknowledgments

This work was supported by a PhD grant from the IT University of Copenhagen and by the European Regional Development Fund (project Robotics for Industry 4.0, CZ.02.1.01/0.0/0.0/15 003/0000470). Thanks to Dennis C. Koelma for his help with adopting the ResNext101 model.

Author information

Authors and Affiliations

IT University of Copenhagen, Copenhagen, Denmark
Alexandra M. Bagi, Kim I. Schild, Omar Shahbaz Khan & Björn Þór Jónsson
Czech Technical University, Prague, Czech Republic
Jan Zahálka

Authors

Alexandra M. Bagi
View author publications
You can also search for this author in PubMed Google Scholar
Kim I. Schild
View author publications
You can also search for this author in PubMed Google Scholar
Omar Shahbaz Khan
View author publications
You can also search for this author in PubMed Google Scholar
Jan Zahálka
View author publications
You can also search for this author in PubMed Google Scholar
Björn Þór Jónsson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Björn Þór Jónsson .

Editor information

Editors and Affiliations

Charles University, Prague, Czech Republic
Jakub Lokoč
Charles University, Prague, Czech Republic
Tomáš Skopal
Klagenfurt University, Klagenfurt, Austria
Klaus Schoeffmann
CERTH-ITI, Thessaloniki, Greece
Vasileios Mezaris
Renmin University of China, Beijing, China
Xirong Li
CERTH-ITI, Thessaloniki, Greece
Stefanos Vrochidis
Queen Mary University of London, London, UK
Ioannis Patras

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bagi, A.M., Schild, K.I., Khan, O.S., Zahálka, J., Jónsson, B.Þ. (2021). XQM: Interactive Learning on Mobile Phones. In: Lokoč, J., et al. MultiMedia Modeling. MMM 2021. Lecture Notes in Computer Science(), vol 12573. Springer, Cham. https://doi.org/10.1007/978-3-030-67835-7_24

Download citation

DOI: https://doi.org/10.1007/978-3-030-67835-7_24
Published: 21 January 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-67834-0
Online ISBN: 978-3-030-67835-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

XQM: Interactive Learning on Mobile Phones

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Interactive Learning for Multimedia at Large

MirBot: A Multimodal Interactive Image Retrieval System

Mobile Visual Search for Digital Heritage Applications

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

XQM: Interactive Learning on Mobile Phones

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Interactive Learning for Multimedia at Large

MirBot: A Multimodal Interactive Image Retrieval System

Mobile Visual Search for Digital Heritage Applications

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation