Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleAugust 2023
Federated Optimization Under Intermittent Client Availability
- Yikai Yan,
- Chaoyue Niu,
- Yucheng Ding,
- Zhenzhe Zheng,
- Shaojie Tang,
- Qinya Li,
- Fan Wu,
- Chengfei Lyu,
- Yanghe Feng,
- Guihai Chen
INFORMS Journal on Computing (INFORMS-IJOC), Volume 36, Issue 1Pages 185–202https://doi.org/10.1287/ijoc.2022.0057Federated learning is a new distributed machine learning framework, where numerous heterogeneous clients collaboratively train a model without sharing training data. In this work, we consider a practical and ubiquitous issue when deploying federated ...
- research-articleMay 2023
Naive Feature Selection: A Nearly Tight Convex Relaxation for Sparse Naive Bayes
Mathematics of Operations Research (MOOR), Volume 49, Issue 1Pages 278–296https://doi.org/10.1287/moor.2023.1356Because of its linear complexity, naive Bayes classification remains an attractive supervised learning method, especially in very large-scale settings. We propose a sparse version of naive Bayes, which can be used for feature selection. This leads to a ...
- research-articleMay 2023
Recursive Importance Sketching for Rank Constrained Least Squares: Algorithms and High-Order Convergence
Solving Rank Constrained Least Squares via Recursive Importance Sketching
In statistics and machine learning, we sometimes run into the rank-constrained least squares problems, for which we need to find the best low-rank fit between sets of data, such as ...
In this paper, we propose a recursive importance sketching algorithm for rank-constrained least squares optimization (RISRO). The key step of RISRO is recursive importance sketching, a new sketching framework based on deterministically designed recursive ...
-
- research-articleMarch 2024
On learning rates and schrödinger operators
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 379, Pages 18153–18205Understanding the iterative behavior of stochastic optimization algorithms for minimizing nonconvex functions remains a crucial challenge in demystifying deep learning. In particular, it is not yet understood why certain simple techniques are remarkably ...
- research-articleMarch 2024
Preconditioned gradient descent for overparameterized nonconvex Burer—Monteiro factorization with global optimality certification
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 163, Pages 7781–7835We consider using gradient descent to minimize the nonconvex function f(X) = ϕ(XXT) over an n × r factor matrix X, in which ϕ is an underlying smooth convex cost function defined over n × n matrices. While only a second-order stationary point X can be ...
- research-articleMarch 2024
Restarted nonconvex accelerated gradient descent: no more polylogarithmic factor in the O(ε -7/4) complexity
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 157, Pages 7459–7495This paper studies accelerated gradient methods for nonconvex optimization with Lipschitz continuous gradient and Hessian. We propose two simple accelerated gradient methods, restarted accelerated gradient descent (AGD) and restarted heavy ball (HB) ...
- research-articleMarch 2024
Global convergence of sub-gradient method for robust matrix recovery: small initialization, noisy measurements, and over-parameterization
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 96, Pages 4434–4517In this work, we study the performance of sub-gradient method (SubGM) on a natural nonconvex and nonsmooth formulation of low-rank matrix recovery with ℓ1-loss, where the goal is to recover a low-rank matrix from a limited number of measurements, a ...
- research-articleMarch 2024
A line-search descent algorithm for strict saddle functions with complexity guarantees
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 10, Pages 307–340We describe a line-search algorithm which achieves the best-known worst-case complexity results for problems with a certain "strict saddle" property that has been observed to hold in low-rank matrix optimization problems. Our algorithm is adaptive, in ...
- research-articleJanuary 2023
Nonconvex resource allocation for inelastic enterprise applications deployment into the cloud via particle swarm optimization
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology (JIFS), Volume 44, Issue 3Pages 3807–3823https://doi.org/10.3233/JIFS-201239The advantages of cloud computing attract a large number of enterprises to deploy their applications into the cloud, thereby reducing their own operating costs. This paper considers deploying inelastic applications into the cloud and proposes an optimal ...
- research-articleDecember 2022
An improved stochastic gradient descent algorithm based on Rényi differential privacy
International Journal of Intelligent Systems (IJIS), Volume 37, Issue 12Pages 10694–10714https://doi.org/10.1002/int.22944AbstractDeep learning techniques based on the neural network have made significant achievements in various fields of artificial intelligence. However, model training requires large‐scale data sets, these data sets are crowd‐sourced and model parameters ...
- research-articleNovember 2022
Solving Nonsmooth and Nonconvex Compound Stochastic Programs with Applications to Risk Measure Minimization
Mathematics of Operations Research (MOOR), Volume 47, Issue 4Pages 3051–3083https://doi.org/10.1287/moor.2021.1247This paper studies a structured compound stochastic program (SP) involving multiple expectations coupled by nonconvex and nonsmooth functions. We present a successive convex programming-based sampling algorithm and establish its subsequential convergence. ...
- research-articleNovember 2022
The Exact Modulus of the Generalized Concave Kurdyka-Łojasiewicz Property
Mathematics of Operations Research (MOOR), Volume 47, Issue 4Pages 2765–2783https://doi.org/10.1287/moor.2021.1227We introduce a generalized version of the concave Kurdyka-Łojasiewicz (KL) property by employing nonsmooth desingularizing functions. We also present the exact modulus of the generalized concave KL property, which provides an answer to the open question ...
- research-articleSeptember 2022
Global Convergence of Stochastic Gradient Hamiltonian Monte Carlo for Nonconvex Stochastic Optimization: Nonasymptotic Performance Bounds and Momentum-Based Acceleration
Nonconvex Stochastic Optimization
Nonconvex stochastic optimization problems arise in many machine learning problems, including deep learning. The stochastic gradient Hamiltonian Monte Carlo (SGHMC) is a variant of stochastic gradients with a momentum ...
Stochastic gradient Hamiltonian Monte Carlo (SGHMC) is a variant of stochastic gradients with momentum where a controlled and properly scaled Gaussian noise is added to the stochastic gradients to steer the iterates toward a global minimum. Many works ...
- research-articleSeptember 2022
A wonderful triangle in compressed sensing
Information Sciences: an International Journal (ISCI), Volume 611, Issue CPages 95–106https://doi.org/10.1016/j.ins.2022.08.055AbstractIn order to explore some concrete metric relationship between the ‖ x ‖ 1 / ‖ x ‖ ∞ and ‖ x ‖ 0, we introduce a wonderful triangle whose sides are composed of ‖ x ‖ 0 , ‖ x ‖ 1 and ‖ x ‖ ∞ for any non-zero vector x ∈ R n by delving ...
- research-articleAugust 2022
Inf-Convolution, Optimal Allocations, and Model Uncertainty for Tail Risk Measures
Mathematics of Operations Research (MOOR), Volume 47, Issue 3Pages 2494–2519https://doi.org/10.1287/moor.2021.1217Inspired by the recent developments in risk sharing problems for the value at risk (VaR), the expected shortfall (ES), and the range value at risk (RVaR), we study the optimization of risk sharing for general tail risk measures. Explicit formulas of the ...
- research-articleMay 2022
A Nonconvex Optimization Approach to IMRT Planning with Dose–Volume Constraints
INFORMS Journal on Computing (INFORMS-IJOC), Volume 34, Issue 3Pages 1366–1386https://doi.org/10.1287/ijoc.2021.1129Fluence map optimization for intensity-modulated radiation therapy planning can be formulated as a large-scale inverse problem with competing objectives and constraints associated with the tumors and organs at risk. Unfortunately, clinically relevant dose–...
- research-articleMarch 2022
Nonconvex Low-Rank Tensor Completion from Noisy Data
AbstractThis paper investigates a problem of broad practical interest, namely, the reconstruction of a large-dimensional low-rank tensor from highly incomplete and randomly corrupted observations of its entries. Although a number of papers have been ...
We study a noisy tensor completion problem of broad practical interest, namely, the reconstruction of a low-rank tensor from highly incomplete and randomly corrupted observations of its entries. Whereas a variety of prior work has been dedicated to this ...
- research-articleMay 2022
Improved Performance Guarantees for Orthogonal Group Synchronization via Generalized Power Method
SIAM Journal on Optimization (SIOPT), Volume 32, Issue 2Pages 1018–1048https://doi.org/10.1137/20M1389571Given the noisy pairwise measurements among a set of unknown group elements, how does one recover them efficiently and robustly? This problem, known as group synchronization, has drawn tremendous attention in the scientific community. In this work, we ...