Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Mobile Multi-Food Recognition Using Deep Learning

Published: 10 August 2017 Publication History

Abstract

In this article, we propose a mobile food recognition system that uses the picture of the food, taken by the user’s mobile device, to recognize multiple food items in the same meal, such as steak and potatoes on the same plate, to estimate the calorie and nutrition of the meal. To speed up and make the process more accurate, the user is asked to quickly identify the general area of the food by drawing a bounding circle on the food picture by touching the screen. The system then uses image processing and computational intelligence for food item recognition. The advantage of recognizing items, instead of the whole meal, is that the system can be trained with only single item food images. At the training stage, we first use region proposal algorithms to generate candidate regions and extract the convolutional neural network (CNN) features of all regions. Second, we perform region mining to select positive regions for each food category using maximum cover by our proposed submodular optimization method. At the testing stage, we first generate a set of candidate regions. For each region, a classification score is computed based on its extracted CNN features and predicted food names of the selected regions. Since fast response is one of the important parameters for the user who wants to eat the meal, certain heavy computational parts of the application are offloaded to the cloud. Hence, the processes of food recognition and calorie estimation are performed in cloud server. Our experiments, conducted with the FooDD dataset, show an average recall rate of 90.98%, precision rate of 93.05%, and accuracy of 94.11% compared to 50.8% to 88% accuracy of other existing food recognition systems.

References

[1]
www.obesitynetwork.ca.
[2]
http://www.who.int.
[3]
Parisa Pouladzadeh, Shervin Shirmohammadi, and Abdulsalam Yassine. 2016. You are what you eat: So, measure what you eat. IEEE Instrum. Meas. Mag. 19, 1, 9--15.
[4]
Parisa Pouladzadeh, Pallavi Kuhad, Sri Vijay Bharat Peddi, Abdulsalam Yassine, and Shervin Shirmohammadi. 2016. Calorie measurement and food classification using deep learning neural network In Proceedings of the IEEE International Conference on Instrumentation and Measurement Technology (I2MTC’16).
[5]
P. Pouladzadeh, A. Yassine, and S. Shirmohammadi. 2015 FooDD: Food detection dataset for calorie measurement using food images, in new trends in image analysis and processing. In Proceedings of the ICIAP 2015 Workshops, V. Murino, E. Puppo, D. Sona, M. Cristani, and C. Sansone (eds.). Lecture Notes in Computer Science, Springer, Vol. 9281, 441--448.
[6]
Sri Vijay Bharat Peddi, Abdulsalam Yassine, and Shervin Shirmohammadi. 2015. Cloud based virtualization for a calorie measurement e-health mobile application. In Proceedings of the 2015 International Conference on Multimedia and Expo Workshops (ICME’15). 1--6
[7]
Lukas Bossard, Matthieu Guillaumin, and Luc Van Gool. 2014. Food-101--mining discriminative components with random forests. In Proceeding of the European Conference on Computer Vision--(ECCV’14). 446--461.
[8]
Yuji Matsuda, Hajime Hoashi, and Keiji Yanai. 2012. Recognition of multiple-food images by detecting candidate regions. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME’12). 25--30.
[9]
Yoshihiro Kawano and Katsuki Yanai. 2013. Real-time mobile food recognition system. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW’13). 1--7.
[10]
Weiyu Zhang, Qian Yu, Behjat Siddiquie, Ajay Divakaran, and Harpreet Sawhney. 2015. Food recognition and nutrition estimation on a smartphone. J. Diabetes Sci. Technol. 9, 3, 525--533.
[11]
Satoru Fujishige. 2005. Submodular Functions and Optimization, Vol. 58. Elsevier.
[12]
Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Cafe: Convolutional architecture for fast feature embedding. In Proceedings of the ACM International Conference on Multimedia. 675--678.
[13]
Laurence Awolsey. 1982. An analysis of the greedy algorithm for the submodular set covering problem. Combinatorica 2, 4, 385--393.
[14]
Fengqing Zhu, Marc Bosch, Nitin Khanna, and Carol J. Boushey. 2015. Multiple hypotheses image segmentation and classification with application to dietary assessment. IEEE J. Biomed. Health Informat. 19, 1, 377--389.
[15]
Hokuto Kagaya, Kiyoharu Aizawa, and Makoto Ogawa. 2014. Food detection and recognition using convolutional neural network. In Proceedings of the ACM International Conference on Multimedia, 1085--1088.
[16]
Kiyoharu Aizawa and Makoto Ogawa. 2015. FoodLog: Multimedia tool for healthcare applications. IEEE MultiMed. 22, 2, 4--9.
[17]
Sosuke Amano, Kiyoharu Aizawa, and Makoto Ogawa. 2015. Food category representatives: Extracting categories from meal names in food recordings and recipe data. In Proceedings of the IEEE International Conference on Multimedia Big Data. 48--55.
[18]
Colin Ware. 2008. Toward a perceptual theory of flow visualization. IEEE Comput. Graph. Appl. 28, 2 (2008), 6--11.
[19]
Xi-Jin Zhang, Yi-Fan Lu, and Song-Hai Zhang. 2016. Multi-task learning for food identification and analysis with deep convolutional neural networks. J. Comput. Sci. Technol. 31, 3, 489--500.
[20]
Morteza Akbari Fard, Hamed Haddadi, and Alireza Tavakoli Targhi. 2016. Fruits and vegetables calorie counter using, convolutional neural networks. In ACM Dig. Health, 121--122.
[21]
Ashutosh Singla, Lin Yuan, and Touradj Ebrahimi. 2016. Food/non-food image classification and food categorization using pre-trained googlenet model. In Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. 3--11.
[22]
Wataru Shimoda Keiji Yanai. 2016. Foodness proposal for multiple food detection by training of single food images. In Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. 13--21.
[23]
Joachim Dehais, Marios Anthimopoulos, and Stavroula Mougiakakou. 2016. Food image segmentation for dietary assessment. In Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. 23--28.
[24]
A. Krizhevsky, I. Sutskever, and G. Hinton, 2012. Imagenet classification with deep convolutional neural networks. In Proceedings of Neural Information Processing Systems (NIPS’12).
[25]
M. D. Zeiler, M. Ranzato, R. Monga, M. Mao, K. Yang, Q. V. Le, P. Nguyen, A. Senior, V. Vanhoucke, J. Dean and G. E. Hinton. 2013. Onrectied linear units for speech processing. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP’13).
[26]
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. 1998. “Gradientbased learning applied to document recognition. Proc. IEEE, 86, 11: 2278--2324.
[27]
Parisa Pouladzadeh, Shervin Shirmohammadi, and Rana Almaghrabi. 2014. Measuring calorie and nutrition from food image. IEEE Trans. Instrument. Measure. 63, 8, 1947--1956.
[28]
Z. Su, Q. XU, M. Fei, and M. Dong. 2016. Game theoretic resource allocation in media cloud with mobile social users. IEEE Trans. Multimed. (TMM) 18, 8, 1650--1660.

Cited By

View all
  • (2024)A Lightweight Hybrid Model with Location-Preserving ViT for Efficient Food RecognitionNutrients10.3390/nu1602020016:2(200)Online publication date: 8-Jan-2024
  • (2024)Lightweight Food Recognition via Aggregation Block and Feature EncodingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/368028520:10(1-25)Online publication date: 22-Jul-2024
  • (2024)FIRE: Food Image to REcipe generation2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00800(8169-8179)Online publication date: 3-Jan-2024
  • Show More Cited By

Index Terms

  1. Mobile Multi-Food Recognition Using Deep Learning

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Multimedia Computing, Communications, and Applications
    ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 13, Issue 3s
    Special Section on Deep Learning for Mobile Multimedia and Special Section on Best Papers from ACM MMSys/NOSSDAV 2016
    August 2017
    258 pages
    ISSN:1551-6857
    EISSN:1551-6865
    DOI:10.1145/3119899
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 August 2017
    Accepted: 01 March 2017
    Revised: 01 February 2017
    Received: 01 October 2016
    Published in TOMM Volume 13, Issue 3s

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Mobile food recognition
    2. cloud computing
    3. deep learning

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)99
    • Downloads (Last 6 weeks)14
    Reflects downloads up to 26 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A Lightweight Hybrid Model with Location-Preserving ViT for Efficient Food RecognitionNutrients10.3390/nu1602020016:2(200)Online publication date: 8-Jan-2024
    • (2024)Lightweight Food Recognition via Aggregation Block and Feature EncodingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/368028520:10(1-25)Online publication date: 22-Jul-2024
    • (2024)FIRE: Food Image to REcipe generation2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00800(8169-8179)Online publication date: 3-Jan-2024
    • (2024)Lightweight Food Image Recognition With Global Shuffle ConvolutionIEEE Transactions on AgriFood Electronics10.1109/TAFE.2024.33867132:2(392-402)Online publication date: Sep-2024
    • (2024)Preliminary results on food weight estimation with RGB-D images2024 14th International Conference on Pattern Recognition Systems (ICPRS)10.1109/ICPRS62101.2024.10677821(1-7)Online publication date: 15-Jul-2024
    • (2024)HealthMate: A Comprehensive Mobile Application for Personalized Health and Fitness Management2024 IEEE 3rd World Conference on Applied Intelligence and Computing (AIC)10.1109/AIC61668.2024.10730864(1304-1309)Online publication date: 27-Jul-2024
    • (2024)Food Recognition and Segmentation Using Detectron2 FrameworkAdvanced Technologies, Systems, and Applications IX10.1007/978-3-031-71694-2_30(409-419)Online publication date: 1-Oct-2024
    • (2023)Enhancing Object Detection for VIPs Using YOLOv4_Resnet101 and Text-to-Speech Conversion ModelMultimodal Technologies and Interaction10.3390/mti70800777:8(77)Online publication date: 2-Aug-2023
    • (2023)Smart Diet Diary: Real-Time Mobile Application for Food RecognitionApplied System Innovation10.3390/asi60200536:2(53)Online publication date: 20-Apr-2023
    • (2023)Image Processing Model to Estimate Nutritional Values in Raw and Cooked Vegetables2023 34th Conference of Open Innovations Association (FRUCT)10.23919/FRUCT60429.2023.10328158(183-191)Online publication date: 15-Nov-2023
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media