Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleFebruary 2025
Rapid and optimized parallel attribute reduction based on neighborhood rough sets and MapReduce
Expert Systems with Applications: An International Journal (EXWA), Volume 260, Issue Chttps://doi.org/10.1016/j.eswa.2024.125323AbstractAttribute reduction is a crucial step in data pre-processing and feature engineering. It is the selection of a subset of relevant data attributes to reduce the computational complexity of machine learning models and improve their performance. ...
- research-articleFebruary 2025
Entropy-weighted medoid shift: An automated clustering algorithm for high-dimensional data
AbstractUnveiling the intrinsic structure within high-dimensional data presents a significant challenge, particularly when clusters manifest themselves in lower-dimensional subspaces rather than in the full feature space. This complexity is prevalent in ...
Graphical abstractDisplay Omitted
Highlights- Novel mode-seeking algorithm for clustering high-dimensional datasets in projected subspaces.
- Subspace-determining scheme enhances accuracy of cluster identification.
- Guaranteed convergence without stopping criteria in the proposed ...
- research-articleJanuary 2025
A fully interpretable stacking fuzzy classifier with stochastic configuration-based learning for high-dimensional data
Information Sciences: an International Journal (ISCI), Volume 686, Issue Chttps://doi.org/10.1016/j.ins.2024.121359AbstractThis study proposes a stacking fuzzy classifier with stochastic configuration-based learning that can achieve higher training and testing performances and sound interpretability of fuzzy rules. By using the understandable first-order Takagi–...
- research-articleDecember 2024
A graph partitioning-based hybrid feature selection method in microarray datasets: A graph partitioning-based...
Knowledge and Information Systems (KAIS), Volume 67, Issue 1Pages 633–660https://doi.org/10.1007/s10115-024-02292-3AbstractFeature selection depicts one of the foremost methodologies in dimensionality reduction, with its primary objective being the extraction of pertinent features from an extensive dataset. Its process is driven by two principal objectives: reducing ...
-
- research-articleNovember 2024
Maximal cliques-based hybrid high-dimensional feature selection with interaction screening for regression
AbstractStudies on feature selection have been extensively conducted in the literature, as it plays a significant role in both supervised and unsupervised machine learning tasks. Since the bulk of features in high-dimensional data sets might not be ...
- research-articleNovember 2024
Discriminative fuzzy K-means clustering with local structure preservation for high-dimensional data
AbstractK-means clustering, which aims to distribute data points into K different clusters, is an effective technique for data mining and machine learning. However, several factors degrade clustering performance in real-world applications. High-...
- research-articleNovember 2024
Transfer learning for high-dimensional linear regression via the elastic net
AbstractIn this paper, the high-dimensional linear regression problem is explored via the Elastic Net under the transfer learning framework. Within this framework, potentially related source datasets are leveraged to enhance estimation or prediction ...
- research-articleNovember 2024
Mean and covariance estimation for discretely observed high-dimensional functional data: Rates of convergence and division of observational regimes
Journal of Multivariate Analysis (JMUL), Volume 204, Issue Chttps://doi.org/10.1016/j.jmva.2024.105355AbstractEstimation of the mean and covariance parameters for functional data is a critical task, with local linear smoothing being a popular choice. In recent years, many scientific domains are producing multivariate functional data for which p, the ...
- research-articleNovember 2024
Enhancing protection in high-dimensional data: Distributed differential privacy with feature selection
Information Processing and Management: an International Journal (IPRM), Volume 61, Issue 6https://doi.org/10.1016/j.ipm.2024.103870AbstractThe computational cost for implementing data privacy protection tends to rise as the dimensions increase, especially on correlated datasets. For this reason, a faster data protection mechanism is needed to handle high-dimensional data while ...
Highlights- A Differential Privacy approach to handle correlated and high-dimensional data.
- Combining feature selection, hyperparameter tuning, and distributed computing.
- It reduces the computational cost in evaluating correlated ...
- research-articleOctober 2024
Differentiable gated autoencoders for unsupervised feature selection
AbstractUnsupervised feature selection (UFS) aims to identify a subset of the most informative features from high-dimensional data without labels. However, most existing UFS methods cannot adequately capture the intricate nonlinear relationships present ...
- research-articleOctober 2024
Enhancement of the performance of high-dimensional fuzzy classification with feature combination optimization
Information Sciences: an International Journal (ISCI), Volume 680, Issue Chttps://doi.org/10.1016/j.ins.2024.121183AbstractIn high-dimensional classification, an important issue is how to enhance the performance of the classification processing mechanism. Various dimensionality reduction-based techniques such as feature selection and feature extraction have been ...
- research-articleSeptember 2024
Finding community structure in Bayesian networks by heuristic K-standard deviation method
Future Generation Computer Systems (FGCS), Volume 158, Issue CPages 556–568https://doi.org/10.1016/j.future.2024.03.047Highlights- The community structure is very common in large-scale BNs.
- Detect community structure in BN is vital for modeling BN by divide-and-conquer.
- The modified algorithm of A* finds the shortest paths in the BN skeleton graph.
- The ...
When constructing a Bayesian network for high-dimensional data, due to the complex relationships among distinct nodes, the difficulty in detecting the community structure will directly restrict the feasibility of the divide-and-conquer learning ...
- research-articleAugust 2024
A new maximum mean discrepancy based two-sample test for equal distributions in separable metric spaces
AbstractThis paper presents a novel two-sample test for equal distributions in separable metric spaces, utilizing the maximum mean discrepancy (MMD). The test statistic is derived from the decomposition of the total variation of data in the reproducing ...
- ArticleAugust 2024
A High-Dimensional Data Trust Publishing Method Based on Attention Mechanism and Differential Privacy
Advanced Intelligent Computing Technology and ApplicationsPages 208–219https://doi.org/10.1007/978-981-97-5606-3_18AbstractAiming at the challenge posted by existing differential privacy high-dimensional data safe publishing methods, which often struggle to holistically consider the comprehensive performance of computational efficiency, privacy protection degree, and ...
- research-articleAugust 2024
An adaptive dual-strategy constrained optimization-based coevolutionary optimizer for high-dimensional feature selection
Computers and Electrical Engineering (CENG), Volume 118, Issue PAhttps://doi.org/10.1016/j.compeleceng.2024.109362AbstractThe feature subset obtained by traditional feature selection algorithms usually contains many irrelevant features and redundant features, which increases the size of the feature set and reduces classification accuracy. More importantly, these ...
- ArticleAugust 2024
Refining Gene Selection and Outlier Detection in Glioblastoma Based on a Consensus Approach for Regularized Survival Models
AbstractGlioblastoma, the most malignant brain cancer in adults, exhibits vast heterogeneities in prognosis, clinicopathological features, immune landscapes, and immunotherapeutic responses, which calls the need to develop personalized therapeutic ...
- research-articleJuly 2024
Embedded feature selection approach based on TSK fuzzy system with sparse rule base for high-dimensional classification problems
AbstractIn high-dimensional problems of fuzzy rule-based embedded feature selection, the challenges include loss of interpretability, curse of dimensionality, and arithmetic underflow, among others. The primary reason for these problems is the ...
- research-articleJuly 2024
Estimation of multiple networks with common structures in heterogeneous subgroups
Journal of Multivariate Analysis (JMUL), Volume 202, Issue Chttps://doi.org/10.1016/j.jmva.2024.105298AbstractNetwork estimation has been a critical component of high-dimensional data analysis and can provide an understanding of the underlying complex dependence structures. Among the existing studies, Gaussian graphical models have been highly popular. ...
- research-articleJune 2024
Augmentation of degranulation mechanism for high-dimensional data with a multi-round optimization strategy
AbstractVarious fuzzy clustering-based granulation–degranulation techniques have been developed for constructing and optimizing information granules, which help reveal the underlying structure of experimental data in Granular Computing (GrC). Basically, ...