Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Adaptive Model Scheduling for Resource-efficient Data Labeling

Published: 08 January 2022 Publication History

Abstract

Labeling data (e.g., labeling the people, objects, actions, and scene in images) comprehensively and efficiently is a widely needed but challenging task. Numerous models were proposed to label various data and many approaches were designed to enhance the ability of deep learning models or accelerate them. Unfortunately, a single machine-learning model is not powerful enough to extract various semantic information from data. Given certain applications, such as image retrieval platforms and photo album management apps, it is often required to execute a collection of models to obtain sufficient labels. With limited computing resources and stringent delay, given a data stream and a collection of applicable resource-hungry deep-learning models, we design a novel approach to adaptively schedule a subset of these models to execute on each data item, aiming to maximize the value of the model output (e.g., the number of high-confidence labels). Achieving this lofty goal is nontrivial since a model’s output on any data item is content-dependent and unknown until we execute it. To tackle this, we propose an Adaptive Model Scheduling framework, consisting of (1) a deep reinforcement learning-based approach to predict the value of unexecuted models by mining semantic relationship among diverse models, and (2) two heuristic algorithms to adaptively schedule the model execution order under a deadline or deadline-memory constraints, respectively. The proposed framework does not require any prior knowledge of the data, which works as a powerful complement to existing model optimization technologies. We conduct extensive evaluations on five diverse image datasets and 30 popular image labeling models to demonstrate the effectiveness of our design: our design could save around 53% execution time without loss of any valuable labels.

References

[1]
Brandon Amos, Bartosz Ludwiczuk, and Mahadev Satyanarayanan. 2016. OpenFace: A general-purpose face recognition library with mobile applications. Technical Report. CMU-CS-16-118, CMU School of Computer Science.
[2]
Zhe Cao, Gines Hidalgo, Tomas Simon, Shih-En Wei, and Yaser Sheikh. 2017. OpenPose: Realtime multi-person 2D pose estimation using part Affinity fields. In Proceedings of The IEEE Conference on Computer Vision and Pattern Recognition. 7291–7299.
[3]
Joao Carreira and Andrew Zisserman. 2017. Quo vadis, action recognition? A new model and the kinetics dataset. In Proceedings of the IEEE CVPR. 6299–6308.
[4]
Wenlin Chen, James Wilson, Stephen Tyree, Kilian Q. Weinberger, and Yixin Chen. 2016. Compressing convolutional neural networks in the frequency domain. In Proceedings of the 22nd ACM SIGKDD. ACM, 1475–1484.
[5]
Marco Corazza and Andrea Sangalli. 2015. Q-Learning and SARSA: A comparison between two intelligent stochastic control approaches for financial trading. Research Paper Series No. 15. University Ca’Foscari of Venice, Dept. of Economics.
[6]
Mark Everingham, Luc Van Gool, Christopher K. I. Williams, John Winn, and Andrew Zisserman. 2010. The pascal visual object classes (voc) challenge. International Journal of Computer Vision 88, 2 (2010), 303–338.
[7]
Daniel Golovin and Andreas Krause. 2011. Adaptive submodularity: Theory and applications in active learning and stochastic optimization. Journal of Artificial Intelligence Research 42 (2011), 427–486.
[8]
Ian J. Goodfellow, David Warde-Farley, Pascal Lamblin, Vincent Dumoulin, Mehdi Mirza, Razvan Pascanu, James Bergstra, Frédéric Bastien, and Yoshua Bengio. 2013. Pylearn2: A machine learning research library. arXiv preprint arXiv:1308.4214 (2013).
[9]
Song Han, Jeff Pool, John Tran, and William Dally. 2015. Learning both weights and connections for efficient neural network. In Proceedings of the Advances in Neural Information Processing Systems. 1135–1143.
[10]
Xin Han, Kazuo Iwama, and Guochuan Zhang. 2008. On two dimensional orthogonal knapsack problem. arXiv preprint arXiv:0803.4260 (2008).
[11]
Mark J. Huiskes and Michael S. Lew. 2008. The MIR flickr retrieval evaluation. In Proceedings of the 1st ACM International Conference on Multimedia Information Retrieval. ACM, 39–43.
[12]
Junchen Jiang, Ganesh Ananthanarayanan, Peter Bodik, Siddhartha Sen, and Ion Stoica. 2018. Chameleon: Scalable adaptation of video analytics. In Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication. ACM, 253–266.
[13]
Lukasz Kaiser, Aidan N. Gomez, Noam Shazeer, Ashish Vaswani, Niki Parmar, Llion Jones, and Jakob Uszkoreit. 2017. One model to learn them all. arXiv preprint arXiv:1706.05137 (2017).
[14]
Aditya Khosla, Nityananda Jayadevaprakash, Bangpeng Yao, and Li Fei-Fei. 2011. Novel dataset for fine-grained image categorization. In Proceedings of the 1st Workshop on Fine-Grained Visual Categorization.
[15]
Jens Kober, J. Andrew Bagnell, and Jan Peters. 2013. Reinforcement learning in robotics: A survey. The International Journal of Robotics Research 32, 11 (2013), 1238–1274.
[16]
Andreas Krause and Daniel Golovin. 2011. Submodular function maximization. In Tractability: Practical Approaches to Hard Problems, Lucas Bordeaux, Youssef Hamadi, and Pushmeet Kohli (Eds.). Cambridge University Press, 71–104. DOI:https://doi.org/10.1017/CBO9781139177801.004
[17]
Tong Li, Sheng Gao, and Yajing Xu. 2017. Deep multi-similarity hashing for multi-label image retrieval. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. ACM, 2159–2162.
[18]
Xiang-Yang Li, Jianwei Qian, and Xiaoyang Wang. 2018. Can China lead the development of data trading and sharing markets? Communications of the ACM 61, 11 (2018), 50–51.
[19]
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C. Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In Proceedings of the 2014 European Conference on Computer Vision. Springer, 740–755.
[20]
Jiaqi Ma, Zhe Zhao, Xinyang Yi, Jilin Chen, Lichan Hong, and Ed H. Chi. 2018. Modeling task relationships in multi-task learning with multi-gate mixture-of-experts. In Proceedings of the 24th ACM SIGKDD. ACM, 1930–1939.
[21]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602 (2013).
[22]
Joseph Redmon and Ali Farhadi. 2018. YOLOv3: An incremental improvement. arXiv (2018).
[23]
Sebastian Ruder. 2017. An overview of multi-task learning in deep neural networks. arXiv preprint arXiv:1706.05098 (2017).
[24]
Tomas Simon, Hanbyul Joo, Iain Matthews, and Yaser Sheikh. 2017. Hand keypoint detection in single images using multiview bootstrapping. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
[25]
Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
[26]
Maxim Sviridenko. 2004. A note on maximizing a submodular set function subject to a knapsack constraint. Operations Research Letters 32, 1 (2004), 41–43.
[27]
Hado Van Hasselt, Arthur Guez, and David Silver. 2016. Deep reinforcement learning with double q-learning. In Proceedings of the 13th AAAI Conference on Artificial Intelligenc.
[28]
Ziyu Wang, Tom Schaul, Matteo Hessel, Hado Van Hasselt, Marc Lanctot, and Nando De Freitas. 2016. Dueling network architectures for deep reinforcement learning. In Proceedings of the 33rd International Conference on International Conference on Machine Learning, Vol. 48. 1995–2003.
[29]
Bichen Wu, Forrest N. Iandola, Peter H. Jin, and Kurt Keutzer. 2017. SqueezeDet: Unified, small, low power fully convolutional neural networks for real-time object detection for autonomous driving. In Proceedings of the CVPR Workshops. 446–454.
[30]
Yuliang Xiu, Jiefeng Li, Haoyu Wang, Yinghong Fang, and Cewu Lu. 2018. Pose Flow: Efficient online pose tracking. In Proceedings of the British Machine Vision Conference.
[31]
Donna Xu, Yaxin Shi, Ivor W. Tsang, Yew-Soon Ong, Chen Gong, and Xiaobo Shen. 2020. A survey on multi-output learning. IEEE Transactions on Neural Networks and Learning Systems 31, 7 (2020), 2409–2429. DOI:https://doi.org/10.1109/TNNLS.2019.2945133
[32]
Yang Yang, Yi-Feng Wu, De-Chuan Zhan, Zhi-Bin Liu, and Yuan Jiang. 2018. Complex object classification: A multi-modal multi-instance multi-label deep network with optimal transport. In Proceedings of the 24th ACM SIGKDD. ACM, 2594–2603.
[33]
Bangpeng Yao, Xiaoye Jiang, Aditya Khosla, Andy Lai Lin, Leonidas Guibas, and Li Fei-Fei. 2011. Human action recognition by learning bases of action attributes and parts. In Proceedings of the International Conference on Computer Vision. IEEE, 1331–1338.
[34]
Mu Yuan, Lan Zhang, Xiang-Yang Li, and Hui Xiong. 2020. Comprehensive and efficient data labeling via adaptive model scheduling. In Proceedings of the 2020 IEEE 36th International Conference on Data Engineering (ICDE’20). 1858–1861. https://doi.org/10.1109/ICDE48307.2020.00188
[35]
Ji Zhang, Yu Liu, Ke Zhou, Guoliang Li, Zhili Xiao, Bin Cheng, Jiashu Xing, Yangtao Wang, Tianheng Cheng, Li Liu, et al. 2019. An end-to-end automatic cloud database tuning system using deep reinforcement learning. In Proceedings of the 2019 International Conference on Management of Data. ACM, 415–432. DOI:https://doi.org/10.1145/3299869.3300085
[36]
Zheng Zhang, Qin Zou, Qian Wang, Yuewei Lin, and Qingquan Li. 2018. Instance similarity deep hashing for multi-label image retrieval. arXiv preprint arXiv:1803.02987 (2018).
[37]
Fang Zhao, Yongzhen Huang, Liang Wang, and Tieniu Tan. 2015. Deep semantic ranking based hashing for multi-label image retrieval. In Proceedings of the IEEE CVPR. 1556–1564.
[38]
Bolei Zhou, Agata Lapedriza, Aditya Khosla, Aude Oliva, and Antonio Torralba. 2017. Places: A 10 million Image Database for Scene Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 6 (2017), 1452–1464.

Cited By

View all
  • (2024)Data-Driven Causal Effect Estimation Based on Graphical Causal Modelling: A SurveyACM Computing Surveys10.1145/363642356:5(1-37)Online publication date: 12-Jan-2024
  • (2023)Instrumental Variable-Driven Domain Generalization with Unobserved ConfoundersACM Transactions on Knowledge Discovery from Data10.1145/359538017:8(1-21)Online publication date: 28-Jun-2023
  • (2023)Kernel Fisher Dictionary Transfer LearningACM Transactions on Knowledge Discovery from Data10.1145/358857517:8(1-17)Online publication date: 12-May-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Knowledge Discovery from Data
ACM Transactions on Knowledge Discovery from Data  Volume 16, Issue 4
August 2022
529 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/3505210
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 January 2022
Accepted: 01 October 2021
Received: 01 February 2021
Published in TKDD Volume 16, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Model scheduling
  2. reinforcement learning
  3. data labeling

Qualifiers

  • Research-article
  • Refereed

Funding Sources

  • National Key R&D Program of China
  • China National Natural Science Foundation
  • Key Research Program of Frontier Sciences, CAS

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)117
  • Downloads (Last 6 weeks)12
Reflects downloads up to 27 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Data-Driven Causal Effect Estimation Based on Graphical Causal Modelling: A SurveyACM Computing Surveys10.1145/363642356:5(1-37)Online publication date: 12-Jan-2024
  • (2023)Instrumental Variable-Driven Domain Generalization with Unobserved ConfoundersACM Transactions on Knowledge Discovery from Data10.1145/359538017:8(1-21)Online publication date: 28-Jun-2023
  • (2023)Kernel Fisher Dictionary Transfer LearningACM Transactions on Knowledge Discovery from Data10.1145/358857517:8(1-17)Online publication date: 12-May-2023
  • (2023)Enhancing Recommendation with Search Data in a Causal Learning MannerACM Transactions on Information Systems10.1145/358242541:4(1-31)Online publication date: 8-Apr-2023
  • (2023)InFi: End-to-End Learning to Filter Input for Resource-Efficiency in Mobile-Centric InferenceIEEE Transactions on Mobile Computing10.1109/TMC.2023.327598123:5(3523-3538)Online publication date: 15-May-2023
  • (2023)Efficient Deep Ensemble Inference via Query Difficulty-dependent Task Scheduling2023 IEEE 39th International Conference on Data Engineering (ICDE)10.1109/ICDE55515.2023.00082(1005-1018)Online publication date: Apr-2023
  • (2022)Quality-aided Annotation Service Selection in MLaaS Market2022 IEEE/ACM 30th International Symposium on Quality of Service (IWQoS)10.1109/IWQoS54832.2022.9812877(1-11)Online publication date: 10-Jun-2022
  • (2022)COCA: Cost-Effective Collaborative Annotation System by Combining Experts and Amateurs2022 IEEE 38th International Conference on Data Engineering (ICDE)10.1109/ICDE53745.2022.00055(674-685)Online publication date: May-2022

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media