Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1109/ISIT.2017.8007058guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
research-article

Encoded distributed optimization

Published: 25 June 2017 Publication History

Abstract

Today, many real-world machine learning and data analytics problems are of a scale that requires distributed optimization; unlike in centralized computing, these systems are vulnerable to network and node failures. Recently, coding-theoretic ideas have been applied to mitigate node failures in such distributed computing networks. Relaxing the exact recovery requirement of such techniques, we propose a novel approach for adding redundancy in large-scale convex optimization problems, making solvers more robust against sudden and persistent node failures and loss of data. This is done by linearly encoding the data variables; all other aspects the computation operate as usual. We show that under moderate amounts of redundancy, it is possible to recover a close approximation to the solution under node failures. In particular, we show that encoding with (equiangular) tight frames result in bounded objective error, and obtain an explicit error bound for a specific construction that uses Paley graphs. We also demonstrate the performance of the proposed technique for three specific machine learning problems, (two using real world datasets) namely ridge regression, binary support vector machine, and low-rank approximation.

References

[1]
J. Dean and L. A. Barroso, “The tail at scale,” Communications of the ACM, vol. 56, no. 2, pp. 74–80, 2013.
[2]
S. Li, M. A. Maddah-Ali, and A. S. Avestimehr, “Fundamental tradeoff between computation and communication in distributed computing,” in 2016 IEEE Sym. on. Information. Theory. pp. 1814–1818, 2016.
[3]
K. Lee et al., “Speeding up distributed machine learning using codes,” in 2016 IEEE Int. Sym. on Information Theory, pp. 1143–1147, 2016.
[4]
R. Tandon, Q. Lei, A. G. Dimakis, and N. Karampatziakis, “Gradient coding,” ML Systems Workshop (MLSyS), NIPS, 2016.
[5]
S. Dutta, V. Cadambe, and P. Grover, “Short-dot: Computing large linear transforms distributedly using coded short dot products,” in Advances In Neural Information Processing Systems, pp. 2092–2100, 2016.
[6]
M. W. Mahoney, “Randomized algorithms for matrices and data,” Foundations and Trends® in Machine Learning, vol. 3, no. 2, 2011.
[7]
R. B. Holmes and V. I. Paulsen, “Optimal frames for erasures,” Linear Algebra and its Applications, vol. 377, pp. 31–51, 2004.
[8]
P. G. Casazza and J. Kovačević “Equal-norm tight frames with erasures,” Advances in Comp. Math., vol. 18, no. 2-4, pp. 387–430, 2003.
[9]
T. Strohmer and R. W. Heath, “Grassmannian frames with applications to coding and communication,” Apnl. and comp. harmonic analysis, 2003.
[10]
J. Dean and S. Ghemawat, “MapReduce: simplified data processing on large clusters,” Comm. of the ACM, vol. 51, no. 1, pp. 107–113, 2008.
[11]
M. Zaharia et al., “Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing,” in Proc. 9th USENIX conf. on Networked Systems Design and Implementation, pp. 2–2, 2012.
[12]
I. Daubechies, Ten lectures on wavelets. SIAM, 1992.
[13]
L. Welch, “Lower bounds on the maximum cross correlation of signals (corresp.),” IEEE Trans. on Inf. theory, vol. 20, no. 3, 1974.
[14]
M. Pilanci and M. J. Wainwright, “Randomized sketches of convex programs with sharp guarantees,” IEEE Transactions on Information Theory, vol. 61, no. 9, pp. 5096–5115, 2015.
[15]
E. J. Candes and T. Tao, “Decoding by linear programming,” IEEE Trans. on Information Theory, vol. 51, no. 12, pp. 4203–4215, 2005.
[16]
Y. LeCun, C. Cortes, and C. J. Burges, “The MNIST database of handwritten digits,” 1998.
[17]
A. Beck and M. Teboulle, “A fast iterative shrinkage-thresholding algorithm for linear inverse problems.” SIAM. J. Imaging sciences. 2009.
[18]
J. Riedl and J. Konstan. “Movielens dataset,” 1998.
[19]
C. Karakus, Y. Sun, and S. Diggavi, “Encoded distributed optimization,” 2017, http://arxiv.org.

Cited By

View all
  • (2023)Impact of Redundancy on Resilience in Distributed Optimization and LearningProceedings of the 24th International Conference on Distributed Computing and Networking10.1145/3571306.3571393(80-89)Online publication date: 4-Jan-2023
  • (2022)OrchestraProceedings of the 19th ACM International Conference on Computing Frontiers10.1145/3528416.3530246(181-184)Online publication date: 17-May-2022
  • (2021)Adaptive Coding for Matrix Multiplication at Edge Networks2021 IEEE International Symposium on Information Theory (ISIT)10.1109/ISIT45174.2021.9517801(1064-1069)Online publication date: 12-Jul-2021

Index Terms

  1. Encoded distributed optimization
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image Guide Proceedings
        2017 IEEE International Symposium on Information Theory (ISIT)
        Jun 2017
        3247 pages

        Publisher

        IEEE Press

        Publication History

        Published: 25 June 2017

        Qualifiers

        • Research-article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 15 Feb 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)Impact of Redundancy on Resilience in Distributed Optimization and LearningProceedings of the 24th International Conference on Distributed Computing and Networking10.1145/3571306.3571393(80-89)Online publication date: 4-Jan-2023
        • (2022)OrchestraProceedings of the 19th ACM International Conference on Computing Frontiers10.1145/3528416.3530246(181-184)Online publication date: 17-May-2022
        • (2021)Adaptive Coding for Matrix Multiplication at Edge Networks2021 IEEE International Symposium on Information Theory (ISIT)10.1109/ISIT45174.2021.9517801(1064-1069)Online publication date: 12-Jul-2021

        View Options

        View options

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media