research-article

Mobile Multi-Food Recognition Using Deep Learning

Authors:

Parisa Pouladzadeh,

Shervin ShirmohammadiAuthors Info & Claims

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 13, Issue 3s

Article No.: 36, Pages 1 - 21

https://doi.org/10.1145/3063592

Published: 10 August 2017 Publication History

Abstract

In this article, we propose a mobile food recognition system that uses the picture of the food, taken by the user’s mobile device, to recognize multiple food items in the same meal, such as steak and potatoes on the same plate, to estimate the calorie and nutrition of the meal. To speed up and make the process more accurate, the user is asked to quickly identify the general area of the food by drawing a bounding circle on the food picture by touching the screen. The system then uses image processing and computational intelligence for food item recognition. The advantage of recognizing items, instead of the whole meal, is that the system can be trained with only single item food images. At the training stage, we first use region proposal algorithms to generate candidate regions and extract the convolutional neural network (CNN) features of all regions. Second, we perform region mining to select positive regions for each food category using maximum cover by our proposed submodular optimization method. At the testing stage, we first generate a set of candidate regions. For each region, a classification score is computed based on its extracted CNN features and predicted food names of the selected regions. Since fast response is one of the important parameters for the user who wants to eat the meal, certain heavy computational parts of the application are offloaded to the cloud. Hence, the processes of food recognition and calorie estimation are performed in cloud server. Our experiments, conducted with the FooDD dataset, show an average recall rate of 90.98%, precision rate of 93.05%, and accuracy of 94.11% compared to 50.8% to 88% accuracy of other existing food recognition systems.

References

[1]

www.obesitynetwork.ca.

[2]

http://www.who.int.

[3]

Parisa Pouladzadeh, Shervin Shirmohammadi, and Abdulsalam Yassine. 2016. You are what you eat: So, measure what you eat. IEEE Instrum. Meas. Mag. 19, 1, 9--15.

[4]

Parisa Pouladzadeh, Pallavi Kuhad, Sri Vijay Bharat Peddi, Abdulsalam Yassine, and Shervin Shirmohammadi. 2016. Calorie measurement and food classification using deep learning neural network In Proceedings of the IEEE International Conference on Instrumentation and Measurement Technology (I2MTC’16).

[5]

P. Pouladzadeh, A. Yassine, and S. Shirmohammadi. 2015 FooDD: Food detection dataset for calorie measurement using food images, in new trends in image analysis and processing. In Proceedings of the ICIAP 2015 Workshops, V. Murino, E. Puppo, D. Sona, M. Cristani, and C. Sansone (eds.). Lecture Notes in Computer Science, Springer, Vol. 9281, 441--448.

Digital Library

[6]

Sri Vijay Bharat Peddi, Abdulsalam Yassine, and Shervin Shirmohammadi. 2015. Cloud based virtualization for a calorie measurement e-health mobile application. In Proceedings of the 2015 International Conference on Multimedia and Expo Workshops (ICME’15). 1--6

[7]

Lukas Bossard, Matthieu Guillaumin, and Luc Van Gool. 2014. Food-101--mining discriminative components with random forests. In Proceeding of the European Conference on Computer Vision--(ECCV’14). 446--461.

[8]

Yuji Matsuda, Hajime Hoashi, and Keiji Yanai. 2012. Recognition of multiple-food images by detecting candidate regions. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME’12). 25--30.

Digital Library

[9]

Yoshihiro Kawano and Katsuki Yanai. 2013. Real-time mobile food recognition system. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW’13). 1--7.

Digital Library

[10]

Weiyu Zhang, Qian Yu, Behjat Siddiquie, Ajay Divakaran, and Harpreet Sawhney. 2015. Food recognition and nutrition estimation on a smartphone. J. Diabetes Sci. Technol. 9, 3, 525--533.

[11]

Satoru Fujishige. 2005. Submodular Functions and Optimization, Vol. 58. Elsevier.

[12]

Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Cafe: Convolutional architecture for fast feature embedding. In Proceedings of the ACM International Conference on Multimedia. 675--678.

Digital Library

[13]

Laurence Awolsey. 1982. An analysis of the greedy algorithm for the submodular set covering problem. Combinatorica 2, 4, 385--393.

[14]

Fengqing Zhu, Marc Bosch, Nitin Khanna, and Carol J. Boushey. 2015. Multiple hypotheses image segmentation and classification with application to dietary assessment. IEEE J. Biomed. Health Informat. 19, 1, 377--389.

[15]

Hokuto Kagaya, Kiyoharu Aizawa, and Makoto Ogawa. 2014. Food detection and recognition using convolutional neural network. In Proceedings of the ACM International Conference on Multimedia, 1085--1088.

Digital Library

[16]

Kiyoharu Aizawa and Makoto Ogawa. 2015. FoodLog: Multimedia tool for healthcare applications. IEEE MultiMed. 22, 2, 4--9.

[17]

Sosuke Amano, Kiyoharu Aizawa, and Makoto Ogawa. 2015. Food category representatives: Extracting categories from meal names in food recordings and recipe data. In Proceedings of the IEEE International Conference on Multimedia Big Data. 48--55.

Digital Library

[18]

Colin Ware. 2008. Toward a perceptual theory of flow visualization. IEEE Comput. Graph. Appl. 28, 2 (2008), 6--11.

Digital Library

[19]

Xi-Jin Zhang, Yi-Fan Lu, and Song-Hai Zhang. 2016. Multi-task learning for food identification and analysis with deep convolutional neural networks. J. Comput. Sci. Technol. 31, 3, 489--500.

[20]

Morteza Akbari Fard, Hamed Haddadi, and Alireza Tavakoli Targhi. 2016. Fruits and vegetables calorie counter using, convolutional neural networks. In ACM Dig. Health, 121--122.

[21]

Ashutosh Singla, Lin Yuan, and Touradj Ebrahimi. 2016. Food/non-food image classification and food categorization using pre-trained googlenet model. In Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. 3--11.

Digital Library

[22]

Wataru Shimoda Keiji Yanai. 2016. Foodness proposal for multiple food detection by training of single food images. In Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. 13--21.

[23]

Joachim Dehais, Marios Anthimopoulos, and Stavroula Mougiakakou. 2016. Food image segmentation for dietary assessment. In Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. 23--28.

Digital Library

[24]

A. Krizhevsky, I. Sutskever, and G. Hinton, 2012. Imagenet classification with deep convolutional neural networks. In Proceedings of Neural Information Processing Systems (NIPS’12).

[25]

M. D. Zeiler, M. Ranzato, R. Monga, M. Mao, K. Yang, Q. V. Le, P. Nguyen, A. Senior, V. Vanhoucke, J. Dean and G. E. Hinton. 2013. Onrectied linear units for speech processing. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP’13).

[26]

Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. 1998. “Gradientbased learning applied to document recognition. Proc. IEEE, 86, 11: 2278--2324.

[27]

Parisa Pouladzadeh, Shervin Shirmohammadi, and Rana Almaghrabi. 2014. Measuring calorie and nutrition from food image. IEEE Trans. Instrument. Measure. 63, 8, 1947--1956.

[28]

Z. Su, Q. XU, M. Fei, and M. Dong. 2016. Game theoretic resource allocation in media cloud with mobile social users. IEEE Trans. Multimed. (TMM) 18, 8, 1650--1660.

Digital Library

Cited By

Sheng GMin WZhu XXu LSun QYang YWang LJiang S(2024)A Lightweight Hybrid Model with Location-Preserving ViT for Efficient Food RecognitionNutrients10.3390/nu1602020016:2(200)Online publication date: 8-Jan-2024
https://doi.org/10.3390/nu16020200
Yang YMin WSong JSheng GWang LJiang S(2024)Lightweight Food Recognition via Aggregation Block and Feature EncodingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/368028520:10(1-25)Online publication date: 22-Jul-2024
https://dl.acm.org/doi/10.1145/3680285
Chhikara PChaurasia DJiang YMasur OIlievski F(2024)FIRE: Food Image to REcipe generation2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00800(8169-8179)Online publication date: 3-Jan-2024
https://doi.org/10.1109/WACV57701.2024.00800
Show More Cited By

Index Terms

Mobile Multi-Food Recognition Using Deep Learning
1. Computer systems organization
  1. Real-time systems
    1. Real-time operating systems

Recommendations

Food Detection and Recognition Using Convolutional Neural Network
MM '14: Proceedings of the 22nd ACM international conference on Multimedia

In this paper, we apply a convolutional neural network (CNN) to the tasks of detecting and recognizing food images. Because of the wide diversity of types of food, image recognition of food items is generally very difficult. However, deep learning has ...
Food/Non-food Image Classification and Food Categorization using Pre-Trained GoogLeNet Model
MADiMa '16: Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management

Recent past has seen a lot of developments in the field of image-based dietary assessment. Food image classification and recognition are crucial steps for dietary assessment. In the last couple of years, advancements in the deep learning and ...
Smartphone-based food recognition system using multiple deep CNN models
Abstract
People with blindness or low vision utilize mobile assistive tools for various applications such as object recognition, text recognition, etc. Most of the available applications are focused on recognizing generic objects. And they have not ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Multimedia Computing, Communications, and Applications

ACM Transactions on Multimedia Computing, Communications, and Applications Volume 13, Issue 3s

Special Section on Deep Learning for Mobile Multimedia and Special Section on Best Papers from ACM MMSys/NOSSDAV 2016

August 2017

258 pages

ISSN:1551-6857

EISSN:1551-6865

DOI:10.1145/3119899

Editor:
Alberto Del Bimbo
University of Firenze, Italy

Issue’s Table of Contents

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 August 2017

Accepted: 01 March 2017

Revised: 01 February 2017

Received: 01 October 2016

Published in TOMM Volume 13, Issue 3s

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

52
Total Citations
View Citations
1,445
Total Downloads

Downloads (Last 12 months)99
Downloads (Last 6 weeks)12

Reflects downloads up to 28 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Sheng GMin WZhu XXu LSun QYang YWang LJiang S(2024)A Lightweight Hybrid Model with Location-Preserving ViT for Efficient Food RecognitionNutrients10.3390/nu1602020016:2(200)Online publication date: 8-Jan-2024
https://doi.org/10.3390/nu16020200
Yang YMin WSong JSheng GWang LJiang S(2024)Lightweight Food Recognition via Aggregation Block and Feature EncodingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/368028520:10(1-25)Online publication date: 22-Jul-2024
https://dl.acm.org/doi/10.1145/3680285
Chhikara PChaurasia DJiang YMasur OIlievski F(2024)FIRE: Food Image to REcipe generation2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00800(8169-8179)Online publication date: 3-Jan-2024
https://doi.org/10.1109/WACV57701.2024.00800
Sheng GMin WYao TSong JYang YWang LJiang S(2024)Lightweight Food Image Recognition With Global Shuffle ConvolutionIEEE Transactions on AgriFood Electronics10.1109/TAFE.2024.33867132:2(392-402)Online publication date: Sep-2024
https://doi.org/10.1109/TAFE.2024.3386713
Gonzalez BGarcia GGecele ORamirez JVelastin SFarias G(2024)Preliminary results on food weight estimation with RGB-D images2024 14th International Conference on Pattern Recognition Systems (ICPRS)10.1109/ICPRS62101.2024.10677821(1-7)Online publication date: 15-Jul-2024
https://doi.org/10.1109/ICPRS62101.2024.10677821
Soraganvi ABembde MTodakari NBhimartwar H(2024)HealthMate: A Comprehensive Mobile Application for Personalized Health and Fitness Management2024 IEEE 3rd World Conference on Applied Intelligence and Computing (AIC)10.1109/AIC61668.2024.10730864(1304-1309)Online publication date: 27-Jul-2024
https://doi.org/10.1109/AIC61668.2024.10730864
Mujagić AMujagić AMehanović D(2024)Food Recognition and Segmentation Using Detectron2 FrameworkAdvanced Technologies, Systems, and Applications IX10.1007/978-3-031-71694-2_30(409-419)Online publication date: 1-Oct-2024
https://doi.org/10.1007/978-3-031-71694-2_30
Alahmadi TRahman AAlkahtani HKholidy H(2023)Enhancing Object Detection for VIPs Using YOLOv4_Resnet101 and Text-to-Speech Conversion ModelMultimodal Technologies and Interaction10.3390/mti70800777:8(77)Online publication date: 2-Aug-2023
https://doi.org/10.3390/mti7080077
Nadeem MShen HChoy LBarakat J(2023)Smart Diet Diary: Real-Time Mobile Application for Food RecognitionApplied System Innovation10.3390/asi60200536:2(53)Online publication date: 20-Apr-2023
https://doi.org/10.3390/asi6020053
Yen TVengusamy SCaraffini FKuhn SColreavy-Donnelly S(2023)Image Processing Model to Estimate Nutritional Values in Raw and Cooked Vegetables2023 34th Conference of Open Innovations Association (FRUCT)10.23919/FRUCT60429.2023.10328158(183-191)Online publication date: 15-Nov-2023
https://doi.org/10.23919/FRUCT60429.2023.10328158
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Issue’s Table of Contents