Keyword: nonconvex optimization : Search

Article

A Proximal Algorithm for Network Slimming

Machine Learning, Optimization, and Data SciencePages 69–83https://doi.org/10.1007/978-3-031-53966-4_6

Abstract

As a popular channel pruning method for convolutional neural networks (CNNs), network slimming (NS) has a three-stage process: (1) it trains a CNN with $ℓ_{1}$ regularization applied to the scaling factors of the batch normalization layers; (2) it ...

research-article

Federated Optimization Under Intermittent Client Availability

INFORMS Journal on Computing (INFORMS-IJOC), Volume 36, Issue 1Pages 185–202https://doi.org/10.1287/ijoc.2022.0057

Federated learning is a new distributed machine learning framework, where numerous heterogeneous clients collaboratively train a model without sharing training data. In this work, we consider a practical and ubiquitous issue when deploying federated ...

Article

A Relaxed Proximal Gradient Descent Algorithm for Convergent Plug-and-Play with Proximal Denoiser

Scale Space and Variational Methods in Computer VisionPages 379–392https://doi.org/10.1007/978-3-031-31975-4_29

Abstract

This paper presents a new convergent Plug-and-Play (PnP) algorithm. PnP methods are efficient iterative algorithms for solving image inverse problems formulated as the minimization of the sum of a data-fidelity term and a regularization term. PnP ...

research-article

Naive Feature Selection: A Nearly Tight Convex Relaxation for Sparse Naive Bayes

Mathematics of Operations Research (MOOR), Volume 49, Issue 1Pages 278–296https://doi.org/10.1287/moor.2023.1356

Because of its linear complexity, naive Bayes classification remains an attractive supervised learning method, especially in very large-scale settings. We propose a sparse version of naive Bayes, which can be used for feature selection. This leads to a ...

research-article

Recursive Importance Sketching for Rank Constrained Least Squares: Algorithms and High-Order Convergence

Operations Research (OPRH), Volume 72, Issue 1Pages 237–256https://doi.org/10.1287/opre.2023.2445

Solving Rank Constrained Least Squares via Recursive Importance Sketching

In statistics and machine learning, we sometimes run into the rank-constrained least squares problems, for which we need to find the best low-rank fit between sets of data, such as ...

In this paper, we propose a recursive importance sketching algorithm for rank-constrained least squares optimization (RISRO). The key step of RISRO is recursive importance sketching, a new sketching framework based on deterministically designed recursive ...

research-article

Free

On learning rates and schrödinger operators

The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 379, Pages 18153–18205

Understanding the iterative behavior of stochastic optimization algorithms for minimizing nonconvex functions remains a crucial challenge in demystifying deep learning. In particular, it is not yet understood why certain simple techniques are remarkably ...

research-article

Free

Preconditioned gradient descent for overparameterized nonconvex Burer—Monteiro factorization with global optimality certification

The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 163, Pages 7781–7835

We consider using gradient descent to minimize the nonconvex function f(X) = ϕ(XX^T) over an n × r factor matrix X, in which ϕ is an underlying smooth convex cost function defined over n × n matrices. While only a second-order stationary point X can be ...

research-article

Free

Restarted nonconvex accelerated gradient descent: no more polylogarithmic factor in the O(ε ^-7/4) complexity

The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 157, Pages 7459–7495

This paper studies accelerated gradient methods for nonconvex optimization with Lipschitz continuous gradient and Hessian. We propose two simple accelerated gradient methods, restarted accelerated gradient descent (AGD) and restarted heavy ball (HB) ...

research-article

Free

Global convergence of sub-gradient method for robust matrix recovery: small initialization, noisy measurements, and over-parameterization

The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 96, Pages 4434–4517

In this work, we study the performance of sub-gradient method (SubGM) on a natural nonconvex and nonsmooth formulation of low-rank matrix recovery with ℓ₁-loss, where the goal is to recover a low-rank matrix from a limited number of measurements, a ...

research-article

Free

A line-search descent algorithm for strict saddle functions with complexity guarantees

Stephen J. Wright

The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 10, Pages 307–340

We describe a line-search algorithm which achieves the best-known worst-case complexity results for problems with a certain "strict saddle" property that has been observed to hold in low-rank matrix optimization problems. Our algorithm is adaptive, in ...

research-article

Nonconvex resource allocation for inelastic enterprise applications deployment into the cloud via particle swarm optimization

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology (JIFS), Volume 44, Issue 3Pages 3807–3823https://doi.org/10.3233/JIFS-201239

The advantages of cloud computing attract a large number of enterprises to deploy their applications into the cloud, thereby reducing their own operating costs. This paper considers deploying inelastic applications into the cloud and proposes an optimal ...

research-article

An improved stochastic gradient descent algorithm based on Rényi differential privacy

International Journal of Intelligent Systems (IJIS), Volume 37, Issue 12Pages 10694–10714https://doi.org/10.1002/int.22944

Abstract

Deep learning techniques based on the neural network have made significant achievements in various fields of artificial intelligence. However, model training requires large‐scale data sets, these data sets are crowd‐sourced and model parameters ...

research-article

Solving Nonsmooth and Nonconvex Compound Stochastic Programs with Applications to Risk Measure Minimization

Mathematics of Operations Research (MOOR), Volume 47, Issue 4Pages 3051–3083https://doi.org/10.1287/moor.2021.1247

This paper studies a structured compound stochastic program (SP) involving multiple expectations coupled by nonconvex and nonsmooth functions. We present a successive convex programming-based sampling algorithm and establish its subsequential convergence. ...

research-article

The Exact Modulus of the Generalized Concave Kurdyka-Łojasiewicz Property

Mathematics of Operations Research (MOOR), Volume 47, Issue 4Pages 2765–2783https://doi.org/10.1287/moor.2021.1227

We introduce a generalized version of the concave Kurdyka-Łojasiewicz (KL) property by employing nonsmooth desingularizing functions. We also present the exact modulus of the generalized concave KL property, which provides an answer to the open question ...

research-article

Global Convergence of Stochastic Gradient Hamiltonian Monte Carlo for Nonconvex Stochastic Optimization: Nonasymptotic Performance Bounds and Momentum-Based Acceleration

Operations Research (OPRH), Volume 70, Issue 5Pages 2931–2947https://doi.org/10.1287/opre.2021.2162

Nonconvex Stochastic Optimization

Nonconvex stochastic optimization problems arise in many machine learning problems, including deep learning. The stochastic gradient Hamiltonian Monte Carlo (SGHMC) is a variant of stochastic gradients with a momentum ...

Stochastic gradient Hamiltonian Monte Carlo (SGHMC) is a variant of stochastic gradients with momentum where a controlled and properly scaled Gaussian noise is added to the stochastic gradients to steer the iterates toward a global minimum. Many works ...

research-article

A wonderful triangle in compressed sensing

Jun Wang

Information Sciences: an International Journal (ISCI), Volume 611, Issue CPages 95–106https://doi.org/10.1016/j.ins.2022.08.055

Abstract

In order to explore some concrete metric relationship between the ‖ x ‖ 1 / ‖ x ‖ ∞ and ‖ x ‖ 0, we introduce a wonderful triangle whose sides are composed of ‖ x ‖ 0 , ‖ x ‖ 1 and ‖ x ‖ ∞ for any non-zero vector x ∈ R n by delving ...

research-article

Inf-Convolution, Optimal Allocations, and Model Uncertainty for Tail Risk Measures

Mathematics of Operations Research (MOOR), Volume 47, Issue 3Pages 2494–2519https://doi.org/10.1287/moor.2021.1217

Inspired by the recent developments in risk sharing problems for the value at risk (VaR), the expected shortfall (ES), and the range value at risk (RVaR), we study the optimization of risk sharing for general tail risk measures. Explicit formulas of the ...

research-article

A Nonconvex Optimization Approach to IMRT Planning with Dose–Volume Constraints

INFORMS Journal on Computing (INFORMS-IJOC), Volume 34, Issue 3Pages 1366–1386https://doi.org/10.1287/ijoc.2021.1129

Fluence map optimization for intensity-modulated radiation therapy planning can be formulated as a large-scale inverse problem with competing objectives and constraints associated with the tumors and organs at risk. Unfortunately, clinically relevant dose–...

research-article

Open Access

Nonconvex Low-Rank Tensor Completion from Noisy Data

Operations Research (OPRH), Volume 70, Issue 2Pages 1219–1237https://doi.org/10.1287/opre.2021.2106

Abstract

This paper investigates a problem of broad practical interest, namely, the reconstruction of a large-dimensional low-rank tensor from highly incomplete and randomly corrupted observations of its entries. Although a number of papers have been ...

We study a noisy tensor completion problem of broad practical interest, namely, the reconstruction of a low-rank tensor from highly incomplete and randomly corrupted observations of its entries. Whereas a variety of prior work has been dedicated to this ...

research-article

Improved Performance Guarantees for Orthogonal Group Synchronization via Generalized Power Method

Shuyang Ling

SIAM Journal on Optimization (SIOPT), Volume 32, Issue 2Pages 1018–1048https://doi.org/10.1137/20M1389571

Given the noisy pairwise measurements among a set of unknown group elements, how does one recover them efficiently and robustly? This problem, known as group synchronization, has drawn tremendous attention in the scientific community. In this work, we ...

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences