Computer Science > Information Retrieval

arXiv:2403.12660 (cs)

[Submitted on 19 Mar 2024 (v1), last revised 19 Jun 2024 (this version, v3)]

Title:ERASE: Benchmarking Feature Selection Methods for Deep Recommender Systems

Authors:Pengyue Jia, Yejing Wang, Zhaocheng Du, Xiangyu Zhao, Yichao Wang, Bo Chen, Wanyu Wang, Huifeng Guo, Ruiming Tang

View PDF HTML (experimental)

Abstract:Deep Recommender Systems (DRS) are increasingly dependent on a large number of feature fields for more precise recommendations. Effective feature selection methods are consequently becoming critical for further enhancing the accuracy and optimizing storage efficiencies to align with the deployment demands. This research area, particularly in the context of DRS, is nascent and faces three core challenges. Firstly, variant experimental setups across research papers often yield unfair comparisons, obscuring practical insights. Secondly, the existing literature's lack of detailed analysis on selection attributes, based on large-scale datasets and a thorough comparison among selection techniques and DRS backbones, restricts the generalizability of findings and impedes deployment on DRS. Lastly, research often focuses on comparing the peak performance achievable by feature selection methods, an approach that is typically computationally infeasible for identifying the optimal hyperparameters and overlooks evaluating the robustness and stability of these methods. To bridge these gaps, this paper presents ERASE, a comprehensive bEnchmaRk for feAture SElection for DRS. ERASE comprises a thorough evaluation of eleven feature selection methods, covering both traditional and deep learning approaches, across four public datasets, private industrial datasets, and a real-world commercial platform, achieving significant enhancement. Our code is available online for ease of reproduction.

Comments:	Accepted to KDD 2024
Subjects:	Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.12660 [cs.IR]
	(or arXiv:2403.12660v3 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2403.12660

Submission history

From: Pengyue Jia [view email]
[v1] Tue, 19 Mar 2024 11:49:35 UTC (210 KB)
[v2] Wed, 20 Mar 2024 05:10:22 UTC (210 KB)
[v3] Wed, 19 Jun 2024 12:48:25 UTC (224 KB)

Computer Science > Information Retrieval

Title:ERASE: Benchmarking Feature Selection Methods for Deep Recommender Systems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:ERASE: Benchmarking Feature Selection Methods for Deep Recommender Systems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators