Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleOctober 2024
Differentiable gated autoencoders for unsupervised feature selection
AbstractUnsupervised feature selection (UFS) aims to identify a subset of the most informative features from high-dimensional data without labels. However, most existing UFS methods cannot adequately capture the intricate nonlinear relationships present ...
- research-articleNovember 2024
Enhancement of the performance of high-dimensional fuzzy classification with feature combination optimization
Information Sciences: an International Journal (ISCI), Volume 680, Issue Chttps://doi.org/10.1016/j.ins.2024.121183AbstractIn high-dimensional classification, an important issue is how to enhance the performance of the classification processing mechanism. Various dimensionality reduction-based techniques such as feature selection and feature extraction have been ...
- research-articleAugust 2024
A new maximum mean discrepancy based two-sample test for equal distributions in separable metric spaces
AbstractThis paper presents a novel two-sample test for equal distributions in separable metric spaces, utilizing the maximum mean discrepancy (MMD). The test statistic is derived from the decomposition of the total variation of data in the reproducing ...
- ArticleAugust 2024
A High-Dimensional Data Trust Publishing Method Based on Attention Mechanism and Differential Privacy
Advanced Intelligent Computing Technology and ApplicationsPages 208–219https://doi.org/10.1007/978-981-97-5606-3_18AbstractAiming at the challenge posted by existing differential privacy high-dimensional data safe publishing methods, which often struggle to holistically consider the comprehensive performance of computational efficiency, privacy protection degree, and ...
- research-articleNovember 2024
An adaptive dual-strategy constrained optimization-based coevolutionary optimizer for high-dimensional feature selection
Computers and Electrical Engineering (CENG), Volume 118, Issue PAhttps://doi.org/10.1016/j.compeleceng.2024.109362AbstractThe feature subset obtained by traditional feature selection algorithms usually contains many irrelevant features and redundant features, which increases the size of the feature set and reduces classification accuracy. More importantly, these ...
-
- research-articleJuly 2024
Finding community structure in Bayesian networks by heuristic K-standard deviation method
Future Generation Computer Systems (FGCS), Volume 158, Issue CPages 556–568https://doi.org/10.1016/j.future.2024.03.047Highlights- The community structure is very common in large-scale BNs.
- Detect community structure in BN is vital for modeling BN by divide-and-conquer.
- The modified algorithm of A* finds the shortest paths in the BN skeleton graph.
- The ...
When constructing a Bayesian network for high-dimensional data, due to the complex relationships among distinct nodes, the difficulty in detecting the community structure will directly restrict the feasibility of the divide-and-conquer learning ...
- ArticleAugust 2024
Refining Gene Selection and Outlier Detection in Glioblastoma Based on a Consensus Approach for Regularized Survival Models
AbstractGlioblastoma, the most malignant brain cancer in adults, exhibits vast heterogeneities in prognosis, clinicopathological features, immune landscapes, and immunotherapeutic responses, which calls the need to develop personalized therapeutic ...
- research-articleJuly 2024
Embedded feature selection approach based on TSK fuzzy system with sparse rule base for high-dimensional classification problems
AbstractIn high-dimensional problems of fuzzy rule-based embedded feature selection, the challenges include loss of interpretability, curse of dimensionality, and arithmetic underflow, among others. The primary reason for these problems is the ...
- research-articleJuly 2024
Estimation of multiple networks with common structures in heterogeneous subgroups
Journal of Multivariate Analysis (JMUL), Volume 202, Issue Chttps://doi.org/10.1016/j.jmva.2024.105298AbstractNetwork estimation has been a critical component of high-dimensional data analysis and can provide an understanding of the underlying complex dependence structures. Among the existing studies, Gaussian graphical models have been highly popular. ...
- research-articleJuly 2024
Augmentation of degranulation mechanism for high-dimensional data with a multi-round optimization strategy
AbstractVarious fuzzy clustering-based granulation–degranulation techniques have been developed for constructing and optimizing information granules, which help reveal the underlying structure of experimental data in Granular Computing (GrC). Basically, ...
- research-articleJuly 2024
Elastic net-based high dimensional data selection for regression
Expert Systems with Applications: An International Journal (EXWA), Volume 244, Issue Chttps://doi.org/10.1016/j.eswa.2023.122958AbstractHigh-dimensional feature selection is of particular interest to researchers. In some domains, such as microarray data, it is quite common for a group of highly correlated explanatory variables to be of equal importance for inclusion in the ...
- research-articleJuly 2024
Search space division method for wrapper feature selection on high-dimensional data classification
AbstractFeature selection (FS) is an essential pre-processing technique for high-dimensional data. Wrapper-based FS techniques are known for their superior performance over filter FS. However, when the dimensionality of data is very high the wrapper ...
- research-articleJune 2024
A fusion of centrality and correlation for feature selection
Expert Systems with Applications: An International Journal (EXWA), Volume 241, Issue Chttps://doi.org/10.1016/j.eswa.2023.122548AbstractThe rapid development of computer and database technologies has led to the high growth of large-scale datasets. This produces an important issue for data mining applications called the curse of dimensionality, where the number of features is much ...
Highlights- A metric is proposed to measure the influence of features themselves.
- The new discriminant function is proposed to select the optimal feature subset.
- Our method can effectively reduce the dimensionality and computational ...
- research-articleMarch 2024
Variable selection using axis-aligned random projections for partial least-squares regression
AbstractIn high-dimensional data modeling, variable selection plays a crucial role in improving predictive accuracy and enhancing model interpretability through sparse representation. Unfortunately, certain variable selection methods encounter challenges ...
- research-articleMay 2024
High dimensional mislabeled learning
AbstractHigh-dimensional mislabeled learning is essential in AI theory and applications but rarely investigated. In this study, we present a novel learning technique to detect and rectify high-dimensional mislabeled data using proposed Feature Self-...
- research-articleJune 2024
Enhanced NSGA-II-based feature selection method for high-dimensional classification
Information Sciences: an International Journal (ISCI), Volume 663, Issue Chttps://doi.org/10.1016/j.ins.2024.120269AbstractFeature selection in high-dimensional data faces significant challenges owing to large and discrete decision spaces. In this study, we propose a feature selection method based on the nondominated sorting genetic algorithm-II (NSGA-II) to enhance ...
- research-articleFebruary 2024
- research-articleFebruary 2024
Variables selection using L 0 penalty
Computational Statistics & Data Analysis (CSDA), Volume 190, Issue Chttps://doi.org/10.1016/j.csda.2023.107860AbstractThe determination of a tuning parameter by the generalized information criterion (GIC) is considered an important issue in variable selection. It is shown that the GIC and the L 0 penalized objective functions are equivalent, leading to a new L 0 ...
- research-articleDecember 2023
CSViz: Class Separability Visualization for high-dimensional datasets
Applied Intelligence (KLU-APIN), Volume 54, Issue 1Pages 924–946https://doi.org/10.1007/s10489-023-05149-4AbstractData visualization is an essential task during the lifecycle of any Data Science (DS) project, particularly during the Exploratory Data Analysis (EDA) for a correct data preparation and understanding. In classification problems, data visualization ...
- review-articleDecember 2023
Feature selection techniques for machine learning: a survey of more than two decades of research
Knowledge and Information Systems (KAIS), Volume 66, Issue 3Pages 1575–1637https://doi.org/10.1007/s10115-023-02010-5AbstractLearning algorithms can be less effective on datasets with an extensive feature space due to the presence of irrelevant and redundant features. Feature selection is a technique that effectively reduces the dimensionality of the feature space by ...