Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3580305.3599277acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Free access

Cognitive Evolutionary Search to Select Feature Interactions for Click-Through Rate Prediction

Published: 04 August 2023 Publication History

Abstract

Click-Through Rate (CTR) prediction of intelligent marketing systems is of great importance, in which feature interaction selection plays a key role. Most approaches model interactions of features by the same pre-defined operation under expert guidance, among which improper interactions may bring unnecessary noise and complicate the training process. To that end, in this paper, we aim to adaptively evolve the model to select proper operations to interact on feature pairs under task guidance. Inspired by natural evolution, we propose a general Cognitive EvoLutionary Search (CELS) framework, where cognitive ability refers to the malleability of organisms to orientate to the environment. Specifically, we conceptualize interactions as genomes, models as organisms, and tasks as natural environments. Mirroring how genetic malleability develops environmental adaptability, we thus diagnose the fitness of models to simulate the survival rates of organisms for natural selection, thereby an evolution path can be planned and visualized, offering an intuitive interpretation of the mechanisms underlying interaction modeling and selection. Based on the CELS framework, we develop four instantiations including individual-based search and population-based search. We demonstrate how individual mutation and population crossover enable CELS to evolve into diverse models suitable for various tasks and data, providing ready-to-use models. Extensive experiments on real-world datasets demonstrate that CELS significantly outperforms state-of-the-art approaches.

Supplementary Material

MP4 File (rtfp0687-2min-promo.mp4)
Presentation video - short version Predicting Click-Through Rates for intelligent marketing systems holds immense significance in today's digitally oriented business landscape. Notably challenging is the task of selecting feature interactions, a process often entangled with complexity and the demand for specialized knowledge. In response to these challenges, a new method called CELS - short for Cognitive Evolutionary Search - has been proposed. CELS conceptualizes feature interactions as genomes, models as organisms, and tasks as natural environments. By integrating cognitive science principles, CELS reflects the process of genetic malleability developing environmental adaptability. CELS simulates survival rates for natural selection, assessing the fitness of models. In doing so, CELS plans and visualizes an evolution path, offering an intuitive interpretation of the mechanisms underpinning feature interaction modeling and selection.

References

[1]
Thomas Back. 1996. Evolutionary algorithms in theory and practice: evolution strategies, evolutionary programming, genetic algorithms. Oxford University Press.
[2]
H. Beyer. 2007. Evolution strategies. Scholarpedia, Vol. 2, 8 (2007), 1965. https://doi.org/10.4249/scholarpedia.1965 revision #193589.
[3]
Hans-Georg Beyer and Hans-Paul Schwefel. 2002. Evolution strategies--a comprehensive introduction. Natural Computing, Vol. 1 (2002), 3--52.
[4]
Scott L Boyar, Grant T Savage, and Eric S Williams. 2022. An adaptive leadership approach: The impact of reasoning and emotional intelligence (EI) abilities on leader adaptability. Employee Responsibilities and Rights Journal (2022), 1--16.
[5]
Shih-Kang Chao and Guang Cheng. 2019. A generalization of regularized dual averaging and its dynamics. arXiv preprint arXiv:1909.10072 (2019).
[6]
Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, et al. 2016. Wide & deep learning for recommender systems. In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems. 7--10.
[7]
Manoranjan Dash and Huan Liu. 1997. Feature selection for classification. Intelligent Data Analysis, Vol. 1, 1--4 (1997), 131--156.
[8]
Yanyan Dong, Jie Hou, Ning Zhang, and Maocong Zhang. 2020. Research on how human intelligence, consciousness, and cognitive computing affect the development of artificial intelligence. Complexity, Vol. 2020 (2020), 1--10.
[9]
Dennis Garlick. 2002. Understanding the nature of the general factor of intelligence: the role of individual differences in neural plasticity as an explanatory mechanism. Psychological Review, Vol. 109, 1 (2002), 116.
[10]
Huifeng Guo, Bo Chen, Ruiming Tang, Weinan Zhang, Zhenguo Li, and Xiuqiang He. 2021a. An embedding learning framework for numerical features in ctr prediction. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD). 2910--2918.
[11]
Huifeng Guo, Ruiming Tang, et al. 2017. DeepFM: a factorization-machine based neural network for CTR prediction. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI). 1725--1731.
[12]
Wei Guo, Rong Su, Renhao Tan, Huifeng Guo, Yingxue Zhang, Zhirong Liu, Ruiming Tang, and Xiuqiang He. 2021b. Dual graph enhanced embedding neural network for ctr prediction. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD). 496--504.
[13]
Angela Gutchess. 2014. Plasticity of the aging brain: new directions in cognitive neuroscience. Science, Vol. 346, 6209 (2014), 579--582.
[14]
Isabelle Guyon and André Elisseeff. 2003. An introduction to variable and feature selection. Journal of Machine Learning Research, Vol. 3, Mar (2003), 1157--1182.
[15]
Jun He and Xin Yao. 2002. From an individual to a population: An analysis of the first hitting time of population-based evolutionary algorithms. IEEE Transactions on Evolutionary Computation, Vol. 6, 5 (2002), 495--511.
[16]
Xiangnan He and Tat-Seng Chua. 2017. Neural factorization machines for sparse predictive analytics. In Proceedings of the 40th International ACM SIGIR conference on Research and Development in Information Retrieval (SIGIR). 355--364.
[17]
William Huitt and John Hummel. 2003. Piaget's theory of cognitive development. Educational Psychology Interactive, Vol. 3, 2 (2003), 1--5.
[18]
William E Hyland et al. 2022. Interest--ability profiles: An integrative approach to knowledge acquisition. Journal of Intelligence, Vol. 10, 3 (2022), 43.
[19]
Yuchin Juan, Yong Zhuang, Wei-Sheng Chin, and Chih-Jen Lin. 2016. Field-aware factorization machines for CTR prediction. In Proceedings of the 10th ACM Conference on Recommender Systems (RecSys). 43--50.
[20]
Chia-Feng Juang, Ching-Yu Chou, and Chin-Teng Lin. 2022. Navigation of a fuzzy-controlled wheeled robot through the combination of expert knowledge and data-driven multiobjective evolutionary learning. IEEE Transactions on Cybernetics, Vol. 52, 8 (2022), 7388--7401.
[21]
Kenneth S Kendler, Eric Turkheimer, Henrik Ohlsson, Jan Sundquist, and Kristina Sundquist. 2015. Family environment and the malleability of cognitive ability: A Swedish national home-reared and adopted-away cosibling control study. Proceedings of the National Academy of Sciences (PNAS), Vol. 112, 15 (2015), 4612--4617.
[22]
Farhan Khawar, Xu Hang, Ruiming Tang, Bin Liu, Zhenguo Li, and Xiuqiang He. 2020. Autofeature: Searching for feature interactions and their architectures for click-through rate prediction. In ACM International Conference on Information and Knowledge Management (CIKM). 625--634.
[23]
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[24]
Ron Kohavi and George H John. 1997. Wrappers for feature subset selection. Artificial Intelligence, Vol. 97, 1--2 (1997), 273--324.
[25]
V Lee and A Thornton. 2021. Animal cognition in an urbanised world. Frontiers in Ecology and Evolution, Vol. 9 (2021).
[26]
Pan Li et al. 2021. Dual attentive sequential learning for cross-domain click-through rate prediction. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD). 3172--3180.
[27]
Xiaoping Li, Yadi Wang, and Rubén Ruiz. 2022. A survey on sparse learning models for feature selection. IEEE transactions on Cybernetics (2022), 1642--1660.
[28]
Jianxun Lian, Xiaohuan Zhou, Fuzheng Zhang, Zhongxia Chen, Xing Xie, and Guangzhong Sun. 2018. xdeepfm: Combining explicit and implicit feature interactions for recommender systems. In Proceedings of the 24th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD). 1754--1763.
[29]
Jung-Yi Lin, Hao-Ren Ke, Been-Chian Chien, and Wei-Pang Yang. 2008. Classifier design with feature selection and feature extraction using layered genetic programming. Expert Systems with Applications, Vol. 34, 2 (2008), 1384--1393.
[30]
Bin Liu, Niannan Xue, Huifeng Guo, Ruiming Tang, Stefanos Zafeiriou, Xiuqiang He, and Zhenguo Li. 2020a. AutoGroup: Automatic feature grouping for modelling explicit high-order feature interactions in CTR prediction. In Proceedings of the 43rd international ACM SIGIR conference on Research and Development in Information Retrieval (SIGIR). 199--208.
[31]
Bin Liu, Chenxu Zhu, Guilin Li, Weinan Zhang, et al. 2020b. Autofis: Automatic feature interaction selection in factorization models for click-through rate prediction. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD). 2636--2645.
[32]
Huan Liu and Hiroshi Motoda. 2012. Feature selection for knowledge discovery and data mining. Vol. 454. Springer Science & Business Media.
[33]
Hanxiao Liu, Karen Simonyan, and Yiming Yang. 2019. DARTS: Differentiable architecture search. In International Conference on Learning Representations.
[34]
Huijie Liu, Han Wu, Le Zhang, Runlong Yu, Ye Liu, Chunli Liu, Minglei Li, Qi Liu, and Enhong Chen. 2022. A hierarchical interactive multi-channel graph neural network for technological knowledge flow forecasting. Knowledge and Information Systems, Vol. 64, 7 (2022), 1723--1757.
[35]
Qi Liu, Yong Ge, Zhongmou Li, Enhong Chen, and Hui Xiong. 2011. Personalized travel package recommendation. In 2011 IEEE 11th international conference on data mining. IEEE, 407--416.
[36]
Qi Liu, Runze Wu, Enhong Chen, Guandong Xu, Yu Su, Zhigang Chen, and Guoping Hu. 2018. Fuzzy cognitive diagnosis for modelling examinee performance. ACM Transactions on Intelligent Systems and Technology, Vol. 9, 4 (2018), 1--26.
[37]
Ze Lyu, Yu Dong, Chengfu Huo, and Weijun Ren. 2020. Deep match to rank model for personalized click-through rate prediction. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Vol. 34. 156--163.
[38]
Rammohan Mallipeddi and Ponnuthurai N Suganthan. 2008. Empirical study on the effect of population size on differential evolution algorithm. In 2008 IEEE Congress on Evolutionary Computation. IEEE, 3663--3670.
[39]
Durga Prasad Muni, Nikhil R Pal, and Jyotirmay Das. 2006. Genetic programming for simultaneous feature selection and classifier design. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), Vol. 36, 1 (2006), 106--117.
[40]
Yanru Qu, Han Cai, Kan Ren, Weinan Zhang, et al. 2016. Product-based neural networks for user response prediction. In 2016 IEEE 16th International Conference on Data Mining (ICDM). IEEE, 1149--1154.
[41]
Yanru Qu, Bohui Fang, Weinan Zhang, et al. 2018. Product-based neural networks for user response prediction over multi-field categorical data. ACM Transactions on Information Systems, Vol. 37, 1 (2018), 1--35.
[42]
Steffen Rendle. 2010. Factorization machines. In 2010 IEEE International Conference on Data Mining (ICDM). IEEE, 995--1000.
[43]
Matthew Richardson et al. 2007. Predicting clicks: estimating the click-through rate for new ads. In International Conference on World Wide Web. 521--530.
[44]
Natascha Schaefer, Carola Rotermund, Eva-Maria Blumrich, et al. 2017. The malleable brain: plasticity of neural circuits and behavior--a review from students to students. Journal of Neurochemistry, Vol. 142, 6 (2017), 790--811.
[45]
Qixiang Shao, Runlong Yu, Hongke Zhao, Chunli Liu, Mengyi Zhang, Hongmei Song, and Qi Liu. 2021. Toward intelligent financial advisors for identifying potential clients: a multitask perspective. Big Data Mining and Analytics, Vol. 5, 1 (2021), 64--78.
[46]
Shu-Ting Shi, Wenhao Zheng, et al. 2020. Deep time-stream framework for click-through rate prediction by tracking interest evolution. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI). 5726--5733.
[47]
Qingquan Song, Dehua Cheng, Hanning Zhou, Jiyan Yang, Yuandong Tian, and Xia Hu. 2020. Towards automated neural interaction discovery for click-through rate prediction. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD). 945--955.
[48]
Weiping Song, Chence Shi, Zhiping Xiao, Zhijian Duan, Yewen Xu, Ming Zhang, and Jian Tang. 2019. Autoint: Automatic feature interaction learning via self-attentive neural networks. In ACM International Conference on Information and Knowledge Management (CIKM). 1161--1170.
[49]
Lukasz Stasielowicz. 2020. How important is cognitive ability when adapting to changes? A meta-analysis of the performance adaptation literature. Personality and Individual Differences, Vol. 166 (2020), 110178.
[50]
Ke Tang, Peng Yang, and Xin Yao. 2016. Negatively correlated search. IEEE Journal on Selected Areas in Communications, Vol. 34, 3 (2016), 542--550.
[51]
Wanjie Tao, Yu Li, Liangyue Li, Zulong Chen, Hong Wen, Peilin Chen, Tingting Liang, and Quan Lu. 2022. SMINet: State-aware multi-aspect interests representation network for cold-start users recommendation. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Vol. 36. 8476--8484.
[52]
Akbar Telikani, Amirhessam Tahmassebi, et al. 2021. Evolutionary machine learning: A survey. Comput. Surveys, Vol. 54, 8 (2021), 1--35.
[53]
Fei Wang, Qi Liu, Enhong Chen, Zhenya Huang, et al. 2020. Neural cognitive diagnosis for intelligent education systems. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Vol. 34. 6153--6161.
[54]
Ruoxi Wang, Bin Fu, Gang Fu, and Mingliang Wang. 2017. Deep & cross network for ad click predictions. In Proceedings of the ADKDD'17. 1--7.
[55]
Zhiqiang Wang, Qingyun She, and Junlin Zhang. 2021. MaskNet: Introducing feature-wise multiplication to ctr ranking models by instance-guided mask. arXiv preprint arXiv:2102.07619 (2021).
[56]
Jun Xiao, Hao Ye, Xiangnan He, Hanwang Zhang, Fei Wu, and Tat-Seng Chua. 2017. Attentional factorization machines: learning the weight of feature interactions via attention networks. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI). 3119--3125.
[57]
Lin Xiao. 2009. Dual averaging method for regularized stochastic learning and online optimization. Advances in Neural Information Processing Systems, Vol. 22 (2009).
[58]
Yuexiang Xie, Zhen Wang, Yaliang Li, Bolin Ding, Nezihe Merve Gürel, Ce Zhang, Minlie Huang, Wei Lin, and Jingren Zhou. 2021. Fives: Feature interaction via edge search for large-scale tabular data. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD). 3795--3805.
[59]
Bing Xue, Mengjie Zhang, Will N Browne, and Xin Yao. 2015. A survey on evolutionary computation approaches to feature selection. IEEE Transactions on Evolutionary Computation, Vol. 20, 4 (2015), 606--626.
[60]
Hongyun Ye, Zhiwei Ni, and Enhong Chen. 2005. A mixed algorithm of integrated learning based evolutionary decision tree. In Proceedings of Digital Anhui Doctoral Science and Technology Forum.
[61]
Runlong Yu, Qi Liu, Yuyang Ye, Mingyue Cheng, Enhong Chen, and Jianhui Ma. 2022. Collaborative list-and-pairwise filtering from implicit feedback. IEEE Transactions on Knowledge & Data Engineering, Vol. 34, 06 (2022), 2667--2680.
[62]
Runlong Yu, Yuyang Ye, Qi Liu, Zihan Wang, Chunfeng Yang, Yucheng Hu, and Enhong Chen. 2021. Xcrossnet: Feature structure-oriented learning for click-through rate prediction. In Advances in Knowledge Discovery and Data Mining: 25th Pacific-Asia Conference (PAKDD). Springer, 436--447.
[63]
Runlong Yu, Hongke Zhao, Zhong Wang, Yuyang Ye, Peining Zhang, Qi Liu, and Enhong Chen. 2019. Negatively correlated search with asymmetry for real-parameter optimization problems. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, Vol. 56, 8 (2019), 1746--1757.
[64]
Kai Zhang, Hao Qian, Qing Cui, Qi Liu, Longfei Li, Jun Zhou, Jianhui Ma, and Enhong Chen. 2021. Multi-interactive attention network for fine-grained feature learning in ctr prediction. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining (WSDM). 984--992.
[65]
Weinan Zhang et al. 2016. Deep learning over multi-field categorical data. In European Conference on Information Retrieval. Springer, 45--57.
[66]
Hongke Zhao, Xinpeng Wu, et al. 2021. CoEA: A cooperative--competitive evolutionary algorithm for bidirectional recommendations. IEEE Transactions on Evolutionary Computation, Vol. 26, 1 (2021), 28--42.
[67]
Guorui Zhou, Na Mou, Ying Fan, et al. 2019a. Deep interest evolution network for click-through rate prediction. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Vol. 33. 5941--5948.
[68]
Zhi-Hua Zhou, Yang Yu, and Chao Qian. 2019b. Evolutionary learning: Advances in theories and algorithms. Springer.

Cited By

View all
  • (2024)PAIL: Performance based Adversarial Imitation Learning Engine for Carbon Neutral OptimizationProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671611(6148-6157)Online publication date: 25-Aug-2024
  • (2024)Evolution-Based Feature Selection for Predicting Dissolved Oxygen Concentrations in LakesParallel Problem Solving from Nature – PPSN XVIII10.1007/978-3-031-70085-9_25(398-415)Online publication date: 7-Sep-2024

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
August 2023
5996 pages
ISBN:9798400701030
DOI:10.1145/3580305
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 August 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cognitive ability
  2. evolutionary learning
  3. feature selection
  4. nature inspired computing
  5. recommender systems

Qualifiers

  • Research-article

Funding Sources

Conference

KDD '23
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)898
  • Downloads (Last 6 weeks)38
Reflects downloads up to 24 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)PAIL: Performance based Adversarial Imitation Learning Engine for Carbon Neutral OptimizationProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671611(6148-6157)Online publication date: 25-Aug-2024
  • (2024)Evolution-Based Feature Selection for Predicting Dissolved Oxygen Concentrations in LakesParallel Problem Solving from Nature – PPSN XVIII10.1007/978-3-031-70085-9_25(398-415)Online publication date: 7-Sep-2024

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media

Access Granted

The conference sponsors are committed to making content openly accessible in a timely manner.
This article is provided by ACM and the conference, through the ACM OpenTOC service.