Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3576842.3582385acmconferencesArticle/Chapter ViewAbstractPublication PagesiotdiConference Proceedingsconference-collections
research-article

REACT: Streaming Video Analytics On The Edge With Asynchronous Cloud Support

Published: 09 May 2023 Publication History

Abstract

Emerging Internet of Things (IoT) and mobile computing applications are expected to support latency-sensitive deep neural network (DNN) workloads. To realize this vision, the Internet is evolving towards an edge-computing architecture, where computing infrastructure is located closer to the end device to help achieve low latency. However, edge computing may have limited resources compared to cloud environments and thus, cannot run large DNN models that often have high accuracy. In this work, we develop REACT, a framework that leverages cloud resources to execute large DNN models with higher accuracy to improve the accuracy of models running on edge devices. To do so, we propose a novel edge-cloud fusion algorithm that fuses edge and cloud predictions, achieving low latency and high accuracy. We extensively evaluate our approach and show that our approach can significantly improve the accuracy compared to baseline approaches. We focus specifically on object detection in videos (applicable in many video analytics scenarios) and show that the fused edge-cloud predictions can outperform the accuracy of edge-only and cloud-only scenarios by as much as 50%. REACT shows that for Edge AI, the choice between offloading and on-device inference is not binary — redundant execution at cloud and edge locations complement each other when carefully employed.

References

[1]
Ganesh Ananthanarayanan, Victor Bahl, Landon Cox, Alex Crown, Shadi Nogbahi, and Yuanchao Shu. 2019. Demo: Video Analytics-Killer App for Edge Computing. In Proc. ACM MobiSys.
[2]
Kittipat Apicharttrisorn, Xukan Ran, Jiasi Chen, Srikanth V Krishnamurthy, and Amit K Roy-Chowdhury. 2019. Frugal following: Power thrifty object detection and tracking for mobile augmented reality. In Proc. SenSys. 96–109.
[3]
Ashwin Ashok, Peter Steenkiste, and Fan Bai. 2015. Enabling vehicular applications using cloud services through adaptive computation offloading. In Proceedings of the 6th International Workshop on Mobile Cloud Computing and Services. 1–7.
[4]
Mohammad Farhadi Bajestani and Yezhou Yang. 2020. TKD: Temporal Knowledge Distillation for Active Perception. In Proc. WACV. 953–962.
[5]
Ravi Bhandari, Akshay Uttama Nambi, Venkata N Padmanabhan, and Bhaskaran Raman. 2018. DeepLane: camera-assisted GPS for driving lane detection. In Proc. BuildSys. 73–82.
[6]
Erik Bochinski, Volker Eiselein, and Thomas Sikora. 2017. High-speed tracking-by-detection without using image information. In 2017 14th IEEE international conference on advanced video and signal based surveillance (AVSS). IEEE, 1–6.
[7]
Daniel Bolya, Sean Foley, James Hays, and Judy Hoffman. 2020. Tide: A general toolbox for identifying object detection errors. In Proc. ECCV.
[8]
Rainer E Burkard and Ulrich Derigs. 1980. The linear sum assignment problem. In Assignment and Matching Problems: Solution Methods with FORTRAN-Programs. Springer, 1–15.
[9]
Zhengping Che, Guangyu Li, Tracy Li, Bo Jiang, Xuefeng Shi, Xinsheng Zhang, Ying Lu, Guobin Wu, Yan Liu, and Jieping Ye. 2019. D2-City: A Large-Scale Dashcam Video Dataset of Diverse Traffic Scenarios. arXiv preprint arXiv:1904.01975 (2019).
[10]
Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, Jingdong Wang, Jianping Shi, Wanli Ouyang, Chen Change Loy, and Dahua Lin. 2019. MMDetection: Open MMLab Detection Toolbox and Benchmark. arXiv preprint arXiv:1906.07155 (2019).
[11]
Tiffany Yu-Han Chen, Lenin Ravindranath, Shuo Deng, Paramvir Bahl, and Hari Balakrishnan. 2015. Glimpse: Continuous, real-time object recognition on mobile devices. In Proc. SenSys. 155–168.
[12]
Byung-Gon Chun, Sunghwan Ihm, Petros Maniatis, Mayur Naik, and Ashwin Patti. 2011. Clonecloud: elastic execution between mobile device and cloud. In Proceedings of the sixth conference on Computer systems. 301–314.
[13]
Mark Everingham, Luc Van Gool, Christopher KI Williams, John Winn, and Andrew Zisserman. 2010. The pascal visual object classes (voc) challenge. IJCV 88, 2 (2010).
[14]
Anurag Ghosh, Akshay Nambi, Aditya Singh, Harish YVS, and Tanuja Ganu. 2021. Adaptive streaming perception using deep reinforcement learning. arXiv preprint arXiv:2106.05665 (2021).
[15]
Google. 2020. Google Coral USB Accelerator. https://coral.ai/products/accelerator.
[16]
Song Han, Huizi Mao, and William J Dally. 2015. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149 (2015).
[17]
Jonatan Heyman, Carl Byström, Joakim Hamrén, and Hugo Heyman. 2020. Locust: An Open Source Load Testing Tool. https://locust.io/
[18]
Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, 2017. Speed/accuracy trade-offs for modern convolutional object detectors. In Proc. CVPR.
[19]
Itay Hubara, Matthieu Courbariaux, Daniel Soudry, Ran El-Yaniv, and Yoshua Bengio. 2017. Quantized neural networks: Training neural networks with low precision weights and activations. The Journal of Machine Learning Research 18, 1 (2017), 6869–6898.
[20]
Intel. 2020. Intel Neural Compute Stick 2. https://software.intel.com/en-us/neural-compute-stick.
[21]
Srinivasan Iyengar, Ravi Raj Saxena, Joydeep Pal, Bhawana Chhaglani, Anurag Ghosh, Venkata N Padmanabhan, and Prabhakar T Venkata. 2021. Holistic energy awareness for intelligent drones. In Proc. BuildSys.
[22]
Junchen Jiang, Ganesh Ananthanarayanan, Peter Bodik, Siddhartha Sen, and Ion Stoica. 2018. Chameleon: scalable adaptation of video analytics. In Proc. SIGCOMM. 253–266.
[23]
Yiping Kang, Johann Hauswald, Cao Gao, Austin Rovinski, Trevor Mudge, Jason Mars, and Lingjia Tang. 2017. Neurosurgeon: Collaborative intelligence between the cloud and mobile edge. ACM SIGARCH Computer Architecture News 45, 1 (2017), 615–629.
[24]
Harold W Kuhn. 1955. The Hungarian method for the assignment problem. Naval research logistics quarterly 2, 1-2 (1955), 83–97.
[25]
Mengtian Li, Yu-Xiong Wang, and Deva Ramanan. 2020. Towards Streaming Image Understanding. arXiv preprint arXiv:2005.10420 (2020).
[26]
Yuanqi Li, Arthi Padmanabhan, Pengzhan Zhao, Yufei Wang, Guoqing Harry Xu, and Ravi Netravali. 2020. Reducto: On-Camera Filtering for Resource-Efficient Real-Time Video Analytics. In Proc. SIGCOMM.
[27]
Robert LiKamWa, Yunhui Hou, Julian Gao, Mia Polansky, and Lin Zhong. 2016. RedEye: analog ConvNet image sensor architecture for continuous mobile vision. ACM SIGARCH Computer Architecture News 44, 3 (2016).
[28]
Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proc. ICCV.
[29]
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In Proc. ECCV.
[30]
Luyang Liu, Hongyu Li, and Marco Gruteser. 2019. Edge assisted real-time object detection for mobile augmented reality. In Proc. MobiCom. 1–16.
[31]
Alan Lukezic, Tomas Vojir, Luka Cehovin Zajc, Jiri Matas, and Matej Kristan. 2017. Discriminative correlation filter with channel and spatial reliability. In Proc. CVPR.
[32]
Ravi Netravali, Anirudh Sivaraman, Somak Das, Ameesh Goyal, Keith Winstein, James Mickens, and Hari Balakrishnan. 2015. Mahimahi: Accurate record-and-replay for { HTTP}. In USENIX ATC. 417–429.
[33]
Nvidia. 2020. Meet Jetson, the Platform for AI at the Edge.https://developer.nvidia.com/embedded-computing.
[34]
Xukan Ran, Haolianz Chen, Xiaodan Zhu, Zhenming Liu, and Jiasi Chen. 2018. Deepdecision: A mobile deep learning framework for edge video analytics. In Proc. INFOCOM.
[35]
Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You only look once: Unified, real-time object detection. In Proc. CVPR. 779–788.
[36]
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Proc. NeurIPS. 91–99.
[37]
Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, and Liang-Chieh Chen. 2018. Mobilenetv2: Inverted residuals and linear bottlenecks. In Proc. CVPR.
[38]
Mahadev Satyanarayanan, Paramvir Bahl, Ramón Caceres, and Nigel Davies. 2009. The case for vm-based cloudlets in mobile computing. IEEE pervasive Computing 8, 4 (2009), 14–23.
[39]
Xiaofan Zhang, Haoming Lu, Cong Hao, Jiachen Li, Bowen Cheng, Yuhong Li, Kyle Rupnow, Jinjun Xiong, Thomas Huang, Honghui Shi, 2020. Skynet: a hardware-efficient method for object detection and tracking on embedded systems. In Proc. MLSys.
[40]
Huajun Zhou, Zechao Li, Chengcheng Ning, and Jinhui Tang. 2017. Cad: Scale invariant framework for real-time object detection. In Proc. ICCV Workshops.
[41]
Xingyi Zhou, Dequan Wang, and Philipp Krähenbühl. 2019. Objects as points. arXiv preprint arXiv:1904.07850 (2019).
[42]
Pengfei Zhu, Longyin Wen, Dawei Du, Xiao Bian, Qinghua Hu, and Haibin Ling. 2020. Vision Meets Drones: Past, Present and Future. arXiv preprint arXiv:2001.06303 (2020).
[43]
Pengfei Zhu, Longyin Wen, Dawei Du, Xiao Bian, Haibin Ling, Qinghua Hu, Haotian Wu, Qinqin Nie, Hao Cheng, Chenfeng Liu, 2018. Visdrone-vdt2018: The vision meets drone video detection and tracking challenge results. In Proc. ECCV Workshops.

Cited By

View all
  • (2024)FrameFeedback: A Closed-Loop Control System for Dynamic Offloading Real-Time Edge Inference2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)10.1109/IPDPSW63119.2024.00116(584-591)Online publication date: 27-May-2024
  • (2024)VATE: Edge-Cloud System for Object Detection in Real-Time Video Streams2024 IEEE 8th International Conference on Fog and Edge Computing (ICFEC)10.1109/ICFEC61590.2024.00017(27-34)Online publication date: 6-May-2024
  • (2023)ChanakyaProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668551(55668-55680)Online publication date: 10-Dec-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
IoTDI '23: Proceedings of the 8th ACM/IEEE Conference on Internet of Things Design and Implementation
May 2023
514 pages
ISBN:9798400700378
DOI:10.1145/3576842
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 May 2023

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

IoTDI '23
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)116
  • Downloads (Last 6 weeks)8
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)FrameFeedback: A Closed-Loop Control System for Dynamic Offloading Real-Time Edge Inference2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)10.1109/IPDPSW63119.2024.00116(584-591)Online publication date: 27-May-2024
  • (2024)VATE: Edge-Cloud System for Object Detection in Real-Time Video Streams2024 IEEE 8th International Conference on Fog and Edge Computing (ICFEC)10.1109/ICFEC61590.2024.00017(27-34)Online publication date: 6-May-2024
  • (2023)ChanakyaProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668551(55668-55680)Online publication date: 10-Dec-2023
  • (2023)Internet-of-Things Edge Computing Systems for Streaming Video Analytics: Trails Behind and the Paths AheadIoT10.3390/iot40400214:4(486-513)Online publication date: 24-Oct-2023
  • (2023)CWASI: A WebAssembly Runtime Shim for Inter-function Communication in the Serverless Edge-Cloud ContinuumProceedings of the Eighth ACM/IEEE Symposium on Edge Computing10.1145/3583740.3626611(158-170)Online publication date: 6-Dec-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media