research-article

Learning from the Past: Fast NAS for Tasks and Datasets

Author:

Ming CheungAuthors Info & Claims

ACM Transactions on Multimedia Computing, Communications and Applications, Volume 20, Issue 3

Article No.: 70, Pages 1 - 18

https://doi.org/10.1145/3618000

Published: 23 October 2023 Publication History

Abstract

Nowadays, with the advancement of technology, many retail companies require in-house data scientist teams to build machine learning tasks, such as user segmentation and item price prediction. These teams typically use a trial-and-error process to obtain a good model for a given dataset and machine learning task, which is time-consuming and requires expertise. However, the team may have built models for other tasks on different datasets. This article proposes a framework to obtain a model architecture using the previous solved machine learning tasks and datasets. By analyzing real datasets with over 70,000 images from 11 online retail e-commerce websites, it is demonstrated that the performance of a model is related to the similarity among datasets, models, and machine learning tasks. A framework is hence proposed to obtain the model using the similarities among them. It was proven that the model was 26.6% better in accuracy, and using only 20% of the runtime while comparing to an auto network architecture search library, Auto-Keras, in predicting the attributes of fashion images. To the best of our knowledge, this is the first article to obtain the best model based on the similarity among machine learning tasks, models, and datasets.

References

[1]

Alessandro Achille, Michael Lam, Rahul Tewari, Avinash Ravichandran, Subhransu Maji, Charless C. Fowlkes, Stefano Soatto, and Pietro Perona. 2019. Task2Vec: Task embedding for meta-learning. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 6430–6439.

[2]

Shun-Ichi Amari. 1998. Natural gradient works efficiently in learning. Neural Comput. 10, 2 (1998), 251–276.

Digital Library

[3]

Ekaba Bisong. 2019. Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners. Apress.

[4]

Léon Bottou. 2012. Stochastic gradient descent tricks. In Neural Networks: Tricks of the Trade. Springer, 421–436.

[5]

Andrew Brock, Jeff Donahue, and Karen Simonyan. 2018. Large scale GAN training for high fidelity natural image synthesis. In Proceedings of the International Conference on Learning Representations.

[6]

Xinlei Chen and C. Lawrence Zitnick. 2015. Mind’s eye: A recurrent visual representation for image caption generation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2422–2431.

[7]

Ming Cheung. 2022. Learning from the past: Fast NAS for price predictions. In Proceedings of the 4th World Symposium on Software Engineering (WSSE’22).

Digital Library

[8]

Ming Cheung and James She. 2017. An analytic system for user gender identification through user shared images. ACM Trans. Multim. Comput., Commun. Applic. 13, 3 (2017), 1–20.

Digital Library

[9]

Ming Cheung and James She. 2019. Detecting social signals in user-shared images for connection discovery using deep learning. IEEE Trans. Multim. 22, 2 (2019), 407–420.

Digital Library

[10]

Ming Cheung, James She, and Zhanming Jie. 2015. Connection discovery using big data of user-shared images in social media. IEEE Trans. Multim. 17, 9 (2015), 1417–1428.

Digital Library

[11]

Antonia Creswell, Tom White, Vincent Dumoulin, Kai Arulkumaran, Biswa Sengupta, and Anil A. Bharath. 2018. Generative adversarial networks: An overview. IEEE Sig. Process. Mag. 35, 1 (2018), 53–65.

[12]

Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, and Neil Houlsby. 2020. An image is worth \(16\times 16\) words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020).

[13]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 770–778.

[14]

Xin He, Kaiyong Zhao, and Xiaowen Chu. 2021. AutoML: A survey of the state-of-the-art. Knowl.-based Syst. 212 (2021), 106622.

[15]

Maryam Heidari, James H. Jones, and Ozlem Uzuner. 2020. Deep contextualized word embedding for text-based online user profiling to detect social bots on Twitter. In Proceedings of the International Conference on Data Mining Workshops (ICDMW’20). IEEE, 480–487.

[16]

Weijun Hong, Guilin Li, Weinan Zhang, Ruiming Tang, Yunhe Wang, Zhenguo Li, and Yong Yu. 2021. DropNAS: Grouped operation dropout for differentiable architecture search. In Proceedings of the 29th International Conference on International Joint Conferences on Artificial Intelligence. 2326–2332.

[17]

Xia Hu, Lingyang Chu, Jian Pei, Weiqing Liu, and Jiang Bian. 2021. Model complexity of deep learning: A survey. Knowl. Inf. Syst. 63, 10 (2021), 2585–2619.

Digital Library

[18]

Lun Huang, Wenmin Wang, Jie Chen, and Xiao-Yong Wei. 2019. Attention on attention for image captioning. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 4634–4643.

[19]

Hyunwoo Hwangbo, Yang Sok Kim, and Kyung Jin Cha. 2018. Recommendation system development for fashion retail e-commerce. Electron. Commerce Res. Applic. 28 (2018), 94–101.

Digital Library

[20]

Haifeng Jin, Qingquan Song, and Xia Hu. 2019. Auto-Keras: An efficient neural architecture search system. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1946–1956.

Digital Library

[21]

Xue-Bo Jin, Wen-Tao Gong, Jian-Lei Kong, Yu-Ting Bai, and Ting-Li Su. 2022. PFVAE: A planar flow-based variational auto-encoder prediction model for time series data. Mathematics 10, 4 (2022), 610.

[22]

I. Kevin, Kai Wang, Xiaokang Zhou, Wei Liang, Zheng Yan, and Jinhua She. 2021. Federated transfer learning based cross-domain prediction for smart manufacturing. IEEE Trans. Industr. Inform. 18, 6 (2021), 4088–4096.

[23]

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2017. ImageNet classification with deep convolutional neural networks. Commun. ACM 60, 6 (2017), 84–90.

Digital Library

[24]

Andrea Landherr, Bettina Friedl, and Julia Heidemann. 2010. A critical review of centrality measures in social networks. Busin. Inf. Syst. Eng. 2, 6 (2010), 371–385.

[25]

Jianan Li, Xiaodan Liang, Yunchao Wei, Tingfa Xu, Jiashi Feng, and Shuicheng Yan. 2017. Perceptual generative adversarial networks for small object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1222–1230.

[26]

Xinle Liang, Yang Liu, Jiahuan Luo, Yuanqin He, Tianjian Chen, and Qiang Yang. 2021. Self-supervised cross-silo federated neural architecture search. arXiv preprint arXiv:2101.11896 (2021).

[27]

Hanxiao Liu, Karen Simonyan, and Yiming Yang. 2018. DARTS: Differentiable architecture search. In Proceedings of the International Conference on Learning Representations.

[28]

Ziwei Liu, Ping Luo, Shi Qiu, Xiaogang Wang, and Xiaoou Tang. 2016. DeepFashion: Powering robust clothes recognition and retrieval with rich annotations. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1096–1104.

[29]

Roberto Maestre, Juan Duque, Alberto Rubio, and Juan Arévalo. 2018. Reinforcement learning for fair dynamic pricing. In Proceedings of the SAI Intelligent Systems Conference. Springer, 120–135.

[30]

Fausto Milletari, Nassir Navab, and Seyed-Ahmad Ahmadi. 2016. V-Net: Fully convolutional neural networks for volumetric medical image segmentation. In Proceedings of the 4th International Conference on 3D Vision (3DV’16). IEEE, 565–571.

[31]

Shun Moriya and Chihiro Shibata. 2018. Transfer learning method for very deep CNN for text classification and methods for its evaluation. In Proceedings of the IEEE 42nd Annual Computer Software and Applications Conference (COMPSAC’18), Vol. 2. IEEE, 153–158.

[32]

Arjun Mukherjee, Bing Liu, and Natalie Glance. 2012. Spotting fake reviewer groups in consumer reviews. In Proceedings of the 21st International Conference on World Wide Web. 191–200.

Digital Library

[33]

Samira Pouyanfar, Saad Sadiq, Yilin Yan, Haiman Tian, Yudong Tao, Maria Presa Reyes, Mei-Ling Shyu, Shu-Ching Chen, and Sundaraja S. Iyengar. 2018. A survey on deep learning: Algorithms, techniques, and applications. ACM Comput. Surv. 51, 5 (2018), 1–36.

Digital Library

[34]

K. Simonyan and A. Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. In Proceedings of the 3rd International Conference on Learning Representations (ICLR’15). Computational and Biological Learning Society.

[35]

Jasper Snoek, Hugo Larochelle, and Ryan P. Adams. 2012. Practical Bayesian optimization of machine learning algorithms. Adv. Neural Inf. Process. Syst. 25 (2012).

[36]

Yi Sun, Ding Liang, Xiaogang Wang, and Xiaoou Tang. 2015. DeepID3: Face recognition with very deep neural networks. arXiv preprint arXiv:1502.00873 (2015).

[37]

Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1–9.

[38]

Jan N. van Rijn, Salisu Mamman Abdulrahman, Pavel Brazdil, and Joaquin Vanschoren. 2015. Fast algorithm selection using learning curves. In Proceedings of the International Symposium on Intelligent Data Analysis. Springer, 298–309.

[39]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Adv. Neural Inf. Process. Syst. 30 (2017).

[40]

Bochao Wang, Hang Xu, Jiajin Zhang, Chen Chen, Xiaozhi Fang, Yixing Xu, Ning Kang, Lanqing Hong, Chenhan Jiang, Xinyue Cai, et al. 2020. VEGA: Towards an end-to-end configurable AutoML pipeline. arXiv preprint arXiv:2011.01507 (2020).

[41]

Karl Weiss, Taghi M. Khoshgoftaar, and DingDing Wang. 2016. A survey of transfer learning. J. Big Data 3, 1 (2016), 1–40.

[42]

Quanming Yao, Mengshuo Wang, Yuqiang Chen, Wenyuan Dai, Yu-Feng Li, Wei-Wei Tu, Qiang Yang, and Yang Yu. 2018. Taking human out of learning applications: A survey on automated machine learning. arXiv preprint arXiv:1810.13306 (2018).

[43]

Junlong Zhang and Yu Luo. 2017. Degree centrality, betweenness centrality, and closeness centrality in social network. In Proceedings of the 2nd International Conference on Modelling, Simulation and Applied Mathematics (MSAM’17). Atlantis Press, 300–303.

[44]

Barret Zoph and Quoc Le. 2016. Neural architecture search with reinforcement learning. In Proceedings of the International Conference on Learning Representations.

Index Terms

Learning from the Past: Fast NAS for Tasks and Datasets
1. Computing methodologies
  1. Machine learning
    1. Learning settings

Recommendations

Learning from the Past: Fast NAS for price predictions
WSSE '22: Proceedings of the 4th World Symposium on Software Engineering

In e-commerce sites, one of the most important tasks is pricing. The price has to be high enough to be profitable, while low enough to attract customers for competitors. One of the solutions is to use deep learning to predict a suitable price, with a ...
GA-auto-PU: a genetic algorithm-based automated machine learning system for positive-unlabeled learning
GECCO '22: Proceedings of the Genetic and Evolutionary Computation Conference Companion

Positive-Unlabeled (PU) learning is a growing field of machine learning that now consists of numerous algorithms; the number is now so large that considering an extensive manual search to select the best algorithm for a given task is impractical. As such,...
Image Watermarking for Machine Learning Datasets
DEC '23: Proceedings of the Second ACM Data Economy Workshop

Machine learning has received increasing attention for the last decade due to its significant success in classification problems in almost every application domain. For its success, the amount of available data for training plays a crucial role in the ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Multimedia Computing, Communications, and Applications

ACM Transactions on Multimedia Computing, Communications, and Applications Volume 20, Issue 3

March 2024

665 pages

EISSN:1551-6865

DOI:10.1145/3613614

Editor:
Abdulmotaleb El Saddik
Mohamed Bin Zayed University of Artificial Intelligence, UAE and University of Ottawa, Canada

Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 October 2023

Online AM: 01 September 2023

Accepted: 23 August 2023

Revised: 29 July 2023

Received: 13 February 2023

Published in TOMM Volume 20, Issue 3

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
149
Total Downloads

Downloads (Last 12 months)51
Downloads (Last 6 weeks)2

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Full Text

View this article in Full Text.

Figures

Tables

Media

View full text|Download PDF

View Issue’s Table of Contents