research-article

Encoded distributed optimization

Authors:

Suhas DiggaviAuthors Info & Claims

2017 IEEE International Symposium on Information Theory (ISIT)

Pages 2890 - 2894

https://doi.org/10.1109/ISIT.2017.8007058

Published: 25 June 2017 Publication History

Abstract

Today, many real-world machine learning and data analytics problems are of a scale that requires distributed optimization; unlike in centralized computing, these systems are vulnerable to network and node failures. Recently, coding-theoretic ideas have been applied to mitigate node failures in such distributed computing networks. Relaxing the exact recovery requirement of such techniques, we propose a novel approach for adding redundancy in large-scale convex optimization problems, making solvers more robust against sudden and persistent node failures and loss of data. This is done by linearly encoding the data variables; all other aspects the computation operate as usual. We show that under moderate amounts of redundancy, it is possible to recover a close approximation to the solution under node failures. In particular, we show that encoding with (equiangular) tight frames result in bounded objective error, and obtain an explicit error bound for a specific construction that uses Paley graphs. We also demonstrate the performance of the proposed technique for three specific machine learning problems, (two using real world datasets) namely ridge regression, binary support vector machine, and low-rank approximation.

References

[1]

J. Dean and L. A. Barroso, “The tail at scale,” Communications of the ACM, vol. 56, no. 2, pp. 74–80, 2013.

Digital Library

[2]

S. Li, M. A. Maddah-Ali, and A. S. Avestimehr, “Fundamental tradeoff between computation and communication in distributed computing,” in 2016 IEEE Sym. on. Information. Theory. pp. 1814–1818, 2016.

[3]

K. Lee et al., “Speeding up distributed machine learning using codes,” in 2016 IEEE Int. Sym. on Information Theory, pp. 1143–1147, 2016.

[4]

R. Tandon, Q. Lei, A. G. Dimakis, and N. Karampatziakis, “Gradient coding,” ML Systems Workshop (MLSyS), NIPS, 2016.

[5]

S. Dutta, V. Cadambe, and P. Grover, “Short-dot: Computing large linear transforms distributedly using coded short dot products,” in Advances In Neural Information Processing Systems, pp. 2092–2100, 2016.

[6]

M. W. Mahoney, “Randomized algorithms for matrices and data,” Foundations and Trends® in Machine Learning, vol. 3, no. 2, 2011.

[7]

R. B. Holmes and V. I. Paulsen, “Optimal frames for erasures,” Linear Algebra and its Applications, vol. 377, pp. 31–51, 2004.

[8]

P. G. Casazza and J. Kovačević “Equal-norm tight frames with erasures,” Advances in Comp. Math., vol. 18, no. 2-4, pp. 387–430, 2003.

[9]

T. Strohmer and R. W. Heath, “Grassmannian frames with applications to coding and communication,” Apnl. and comp. harmonic analysis, 2003.

[10]

J. Dean and S. Ghemawat, “MapReduce: simplified data processing on large clusters,” Comm. of the ACM, vol. 51, no. 1, pp. 107–113, 2008.

Digital Library

[11]

M. Zaharia et al., “Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing,” in Proc. 9th USENIX conf. on Networked Systems Design and Implementation, pp. 2–2, 2012.

[12]

I. Daubechies, Ten lectures on wavelets. SIAM, 1992.

[13]

L. Welch, “Lower bounds on the maximum cross correlation of signals (corresp.),” IEEE Trans. on Inf. theory, vol. 20, no. 3, 1974.

[14]

M. Pilanci and M. J. Wainwright, “Randomized sketches of convex programs with sharp guarantees,” IEEE Transactions on Information Theory, vol. 61, no. 9, pp. 5096–5115, 2015.

Digital Library

[15]

E. J. Candes and T. Tao, “Decoding by linear programming,” IEEE Trans. on Information Theory, vol. 51, no. 12, pp. 4203–4215, 2005.

Digital Library

[16]

Y. LeCun, C. Cortes, and C. J. Burges, “The MNIST database of handwritten digits,” 1998.

[17]

A. Beck and M. Teboulle, “A fast iterative shrinkage-thresholding algorithm for linear inverse problems.” SIAM. J. Imaging sciences. 2009.

[18]

J. Riedl and J. Konstan. “Movielens dataset,” 1998.

[19]

C. Karakus, Y. Sun, and S. Diggavi, “Encoded distributed optimization,” 2017, http://arxiv.org.

Cited By

Liu SGupta NVaidya N(2023)Impact of Redundancy on Resilience in Distributed Optimization and LearningProceedings of the 24th International Conference on Distributed Computing and Networking10.1145/3571306.3571393(80-89)Online publication date: 4-Jan-2023
https://dl.acm.org/doi/10.1145/3571306.3571393
Du HHuang SXiang QSterpone LBartolini AButko A(2022)OrchestraProceedings of the 19th ACM International Conference on Computing Frontiers10.1145/3528416.3530246(181-184)Online publication date: 17-May-2022
https://dl.acm.org/doi/10.1145/3528416.3530246
Vedadi ESeferoglu H(2021)Adaptive Coding for Matrix Multiplication at Edge Networks2021 IEEE International Symposium on Information Theory (ISIT)10.1109/ISIT45174.2021.9517801(1064-1069)Online publication date: 12-Jul-2021
https://dl.acm.org/doi/10.1109/ISIT45174.2021.9517801

Index Terms

Encoded distributed optimization

Index terms have been assigned to the content through auto-classification.

Recommendations

Sigma encoded inverted files
CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management

Compression of term frequency lists and very long document-id lists within an inverted file search engine are examined. Several compression schemes are compared including Elias γ and δ codes, Golomb Encoding, Variable Byte Encoding, and a class of word-...
Big data multi-query optimisation with Apache Flink

Big data analytic frameworks, such as MapReduce, Spark and Flink, have recently gained more popularity to process large data. Flink is an open-source Apache-hosted big data analytic framework for processing batch and streaming data. For historical data ...
Distributed Computing in Big Data Analytics: Concepts, Technologies and Applications

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings

2017 IEEE International Symposium on Information Theory (ISIT)

Jun 2017

3247 pages

Copyright © 2017.

Publisher

IEEE Press

Publication History

Published: 25 June 2017

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 15 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Liu SGupta NVaidya N(2023)Impact of Redundancy on Resilience in Distributed Optimization and LearningProceedings of the 24th International Conference on Distributed Computing and Networking10.1145/3571306.3571393(80-89)Online publication date: 4-Jan-2023
https://dl.acm.org/doi/10.1145/3571306.3571393
Du HHuang SXiang QSterpone LBartolini AButko A(2022)OrchestraProceedings of the 19th ACM International Conference on Computing Frontiers10.1145/3528416.3530246(181-184)Online publication date: 17-May-2022
https://dl.acm.org/doi/10.1145/3528416.3530246
Vedadi ESeferoglu H(2021)Adaptive Coding for Matrix Multiplication at Edge Networks2021 IEEE International Symposium on Information Theory (ISIT)10.1109/ISIT45174.2021.9517801(1064-1069)Online publication date: 12-Jul-2021
https://dl.acm.org/doi/10.1109/ISIT45174.2021.9517801

View Options

View options

Figures

Tables

Media

View Table of Conten