A unified analysis of convex and non-convex $$\ell _p$$ -ball projection problems

523 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

The task of projecting onto $\ell _p$ norm balls is ubiquitous in statistics and machine learning, yet the availability of actionable algorithms for doing so is largely limited to the special cases of $p \in \left\{ 0, 1,2, \infty \right\}$. In this paper, we introduce novel, scalable methods for projecting onto the $\ell _p$-ball for general $p>0$. For $p \ge 1$, we solve the univariate Lagrangian dual via a dual Newton method. We then carefully design a bisection approach for $p<1$, presenting theoretical and empirical evidence of zero or a small duality gap in the non-convex case. The success of our contributions is thoroughly assessed empirically, and applied to large-scale regularized multi-task learning and compressed sensing. The code implementing our methods is publicly available on Github.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Doubly iteratively reweighted algorithm for constrained compressed sensing models

Article 20 March 2023

Compressed sensing: a discrete optimization approach

Article Open access 11 July 2024

Minimization over the $\ell _1$-ball using an active-set non-monotone projected gradient

Article Open access 27 August 2022

Notes

These properties have emerged in the context of studying theoretical properties of projected gradient descent for $\ell _p$-norm constrained least squares (problem (1) with $\phi (\varvec{x},\varvec{y})=\frac{1}{2}\Vert \varvec{y}- \varvec{A}\varvec{x}\Vert _2^2$). However, no actual algorithm for $\ell _p$-ball projection is provided in [2].
Refer to Theorem 1 in https://planetmath.org/newtonsmethodworksforconvexrealfunctions (retrieved 2022-05-08).
For a direct correspondence between Proposition 2.1 and [25, Theorem 1], $q \leftarrow p$, $\beta \leftarrow x$, $z \leftarrow y$, $\lambda \leftarrow \mu /p$, $h_a \leftarrow r_p$, $\beta _a \leftarrow \kappa _p$, and $\beta _* = z_p(y)$ when $\mu = 1$.
Available at https://github.com/albarji/proxTV/blob/master/src/LPopt.cpp.
Available at https://github.com/Optimizater/Lp-ball-Projection.
Available publicly at https://github.com/won-j/ LpBallProjection.
Available at https://github.com/JuliaPy/PyCall.jl.
This optimal tuning parameter as required by the theory of [27] can be relaxed. However, we closely follow the experiment setup of [27] here.

References

Argyriou, A., Evgeniou, T., Pontil, M.: Convex multi-task feature learning. Mach. Learn. 73(3), 243–272 (2008)
Article MATH Google Scholar
Bahmani, S., Raj, B.: A unifying analysis of projected gradient descent for $\ell _p$-constrained least squares. Appl. Comput. Harmon. Anal. 34(3), 366–378 (2013)
Article MathSciNet MATH Google Scholar
Barbero, A., Sra, S.: Modular proximal optimization for multidimensional total-variation regularization. J. Mach. Learn. Res. 19(1), 2232–2313 (2018)
MathSciNet MATH Google Scholar
Beck, A., Teboulle, M.: A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imag. Sci. 2(1), 183–202 (2009)
Article MathSciNet MATH Google Scholar
Bertsekas, D.: Nonlinear Programming, 2nd edn. Athena Scientific, Belmont, Mass., USA (1999)
MATH Google Scholar
Bertsekas, D.P.: Projected Newton methods for optimization problems with simple constraints. SIAM J. Control Optim. 20(2), 221–246 (1982)
Article MathSciNet MATH Google Scholar
Blumensath, T., Davies, M.E.: Iterative hard thresholding for compressed sensing. Appl. Comput. Harmon. Anal. 27(3), 265–274 (2009)
Article MathSciNet MATH Google Scholar
Boyd, S.P., Vandenberghe, L.: Convex Optimization. Cambridge University Press, Cambridge, UK (2004)
Book MATH Google Scholar
Candes, E.J., Tao, T.: Decoding by linear programming. IEEE Tran. Inform. Theory 51(12), 4203–4215 (2005)
Article MathSciNet MATH Google Scholar
Chartrand, R., Staneva, V.: Restricted isometry properties and nonconvex compressive sensing. Inverse Prob. 24(3), 035020 (2008)
Article MathSciNet MATH Google Scholar
Chartrand, R., Yin, W.: Nonconvex sparse regularization and splitting algorithms. In: Splitting Methods in Communication, Imaging, Science, and Engineering, pp. 237–249. Springer (2016)
Chen, L., Jiang, X., Liu, X., Kirubarajan, T., Zhou, Z.: Outlier-robust moving object and background decomposition via structured $\ell _p$-regularized low-rank representation. IEEE Trans. Emerg. Topics Comput. Intell. 5, 620–638 (2021)
Article Google Scholar
Chen, X., Niu, L., Yuan, Y.: Optimality conditions and a smoothing trust region newton method for nonlipschitz optimization. SIAM J. Optim. 23(3), 1528–1552 (2013)
Article MathSciNet MATH Google Scholar
Condat, L.: Fast projection onto the simplex and the $\ell _1$ ball. Math. Program. 158(1–2), 575–585 (2016)
Article MathSciNet MATH Google Scholar
Das Gupta, M., Kumar, S.: Non-convex p-norm projection for robust sparsity. In: Proc. IEEE Int. Conf. Computer Vision, pp. 1593–1600 (2013)
Donoho, D.L.: Compressed sensing. IEEE Tran. Inform. Theory 52(4), 1289–1306 (2006)
Article MathSciNet MATH Google Scholar
Duchi, J., Shalev-Shwartz, S., Singer, Y., Chandra, T.: Efficient projections onto the $\ell _1$-ball for learning in high dimensions. In: Proc. 25th Int. Conf. Mach. Learn., pp. 272–279. ACM (2008)
Fu, W.J.: Penalized regressions: the bridge versus the lasso. J. Comput. Graph. Stat. 7(3), 397–416 (1998)
MathSciNet Google Scholar
Hu, Y., Li, C., Meng, K., Qin, J., Yang, X.: Group sparse optimization via $\ell _{p, q}$ regularization. J. Mach. Learn. Res. 18(1), 960–1011 (2017)
MathSciNet Google Scholar
Lange, K.: MM Optimization Algorithms. SIAM, Philadelphia, PA, USA (2016)
Book MATH Google Scholar
Liu, H., Palatucci, M., Zhang, J.: Blockwise coordinate descent procedures for the multi-task lasso, with applications to neural semantic basis discovery. In: Proc. 26th Int. Conf. Mach. Learn., pp. 649–656. ACM (2009)
Liu, J., Ji, S., Ye, J.: SLEP: Sparse learning with efficient projections. Tech. rep., Arizona State University (2011). https://github.com/jiayuzhou/SLEP
Liu, J., Ye, J.: Efficient $\ell _1$/$\ell _q$ norm regularization. arXiv:1009.4766 (2010)
Lu, Z.: Iterative reweighted minimization methods for $\ell _p$ regularized unconstrained nonlinear programming. Math. Program. 147(1), 277–307 (2014)
Article MathSciNet MATH Google Scholar
Marjanovic, G., Solo, V.: On $\ell _q$ optimization and matrix completion. IEEE Trans. Signal Process. 60(11), 5714–5724 (2012)
Article MathSciNet MATH Google Scholar
Meier, L., Van De Geer, S., Bühlmann, P.: The group lasso for logistic regression. J. R. Stat. Soc. Ser. B. Stat. Methodol. 70(1), 53–71 (2008)
Article MathSciNet MATH Google Scholar
Oymak, S., Recht, B., Soltanolkotabi, M.: Sharp time-data tradeoffs for linear inverse problems. IEEE Tran. Inform. Theory 64(6), 4129–4158 (2017)
Article MathSciNet MATH Google Scholar
Quattoni, A., Carreras, X., Collins, M., Darrell, T.: An efficient projection for $\ell _{1,\infty }$ regularization. In: Proc. 26th Int. Conf. Mach. Learn., pp. 857–864. ACM (2009)
Sattar, Y., Oymak, S.: Quickly finding the best linear model in high dimensions via projected gradient descent. IEEE Trans. Signal Process 68, 818–829 (2020)
Article MathSciNet MATH Google Scholar
Sra, S.: Fast projections onto mixed-norm balls with applications. Data Min. Knowl. Discov. 25(2), 358–377 (2012)
Article MathSciNet MATH Google Scholar
Tibshirani, R., Wainwright, M., Hastie, T.: Statistical Learning with Sparsity: the Lasso and Generalizations. Chapman and Hall/CRC, Boca Raton (2015)
MATH Google Scholar
Vogt, J.E., Roth, V.: A complete analysis of the $\ell _{1,p}$ group-lasso. In: Proc. 29th Int. Conf. Mach. Learn., pp. 1091–1098. Omnipress (2012)
Wang, M., Xu, W., Tang, A.: On the performance of sparse recovery via $\ell _p$-minimization $(0 \le p \le 1)$. IEEE Tran. Inform. Theory 57(11), 7255–7278 (2011)
Article MATH Google Scholar
Xu, Z., Chang, X., Xu, F., Zhang, H.: ${L}_{1/2}$ regularization: a thresholding representation theory and a fast solver. IEEE Trans. Neural Netw. Learn. Syst. 23(7), 1013–1027 (2012)
Article Google Scholar
Yang, X., Wang, J., Wang, H.: Towards an efficient approach for the nonconvex $\ell _p$ ball projection: algorithm and analysis. arXiv:2101.01350 (2021)
Yuan, M., Lin, Y.: Model selection and estimation in regression with grouped variables. J. R. Stat. Soc. Ser. B. Stat. Methodol. 68(1), 49–67 (2006)
Article MathSciNet MATH Google Scholar
Yukawa, M., Amari, S.i.: $\ell _p$-regularized least squares $(0< p< 1)$ and critical path. IEEE Trans. Inform. Theory 62(1), 488–502 (2016)
Zhang, Y., Yeung, D.Y., Xu, Q.: Probabilistic multi-task feature selection. In: Adv. Neural Inf. Process. Syst., pp. 2559–2567 (2010)
Zhou, Z., Zhang, Q., So, A.M.C.: $\ell _{1,p}$-norm regularization: error bounds and convergence rate analysis of first-order methods. In: Proc. 32nd Int. Conf. Mach. Learn., vol. 37, pp. 1501–1510 (2015)

Download references

Acknowledgements

We thank the associate editor and anonymous referees for providing constructive comments, especially for pointing out references [12, 13, 15, 24, 35]. JW was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (2019R1A2C1007126). KL was supported by the United States Public Health Service (USPHS) grants GM53275 and HG006139.

Author information

Authors and Affiliations

Department of Statistics, Seoul National University, Seoul, Republic of Korea
Joong-Ho Won
Departments of Computational Medicine, Human Genetics and Statistics, University of California, Los Angeles, California, USA
Kenneth Lange
Department of Statistical Science, Duke University, Durham, North Carolina, USA
Jason Xu

Authors

Joong-Ho Won
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth Lange
View author publications
You can also search for this author in PubMed Google Scholar
Jason Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Joong-Ho Won.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Won, JH., Lange, K. & Xu, J. A unified analysis of convex and non-convex $\ell _p$-ball projection problems. Optim Lett 17, 1133–1159 (2023). https://doi.org/10.1007/s11590-022-01919-0

Download citation

Received: 21 December 2020
Accepted: 29 July 2022
Published: 04 September 2022
Issue Date: June 2023
DOI: https://doi.org/10.1007/s11590-022-01919-0

A unified analysis of convex and non-convex \(\ell _p\)-ball projection problems

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Doubly iteratively reweighted algorithm for constrained compressed sensing models

Compressed sensing: a discrete optimization approach

Minimization over the \(\ell _1\)-ball using an active-set non-monotone projected gradient

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

A unified analysis of convex and non-convex \(\ell _p\)-ball projection problems

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Doubly iteratively reweighted algorithm for constrained compressed sensing models

Compressed sensing: a discrete optimization approach

Minimization over the \(\ell _1\)-ball using an active-set non-monotone projected gradient

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation