Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/378580.378685acmconferencesArticle/Chapter ViewAbstractPublication PagesspaaConference Proceedingsconference-collections
Article

Towards practical deteministic write-all algorithms

Published: 03 July 2001 Publication History

Abstract

The problem of performing t tasks on n asynchronous or undependable processors is a basic problem in parallel and distributed computing. We consider an abstraction of this problem called the Write-All problem— using n processors write 1's into all locations of an array of size t. The most efficient known deterministic asynchronous algorithms for this problem are due to Anderson and Woll. The first class of algorithms has work complexity of Ο(t . n ε), for nty and any ε > 0, and they are the best known for the full range of processors (n = t). To schedule the work of the processors, the algorithms use sets of q permutations on [q] (qn) that have certain combinatorial properties. Instantiating such an algorithm for a specific ε either requires substantial pre-processing (exponential in 1/ε2) to find the requisite permutations, or imposes a prohibitive constant (exponential in 1/ε3) hidden by the asymptotic analysis. The second class deals with the specific case of t = nu, u ≰ 2, and these algorithms have work complexity of Ο(t log t). They also use sets of permutations with the same combinatorial properties. However instantiating these algorithms requires exponential in n preprocessing to find the permutations. To alleviate this costly instantiation Kanellakis and Shvartsman proposed a simple way of computing the permutation schedules. They conjectured that their construction has the desired properties but they provided no analysis.
In this paper we show, for the first time, an analysis of the properties of the set of permutations proposed by Kanellakis and Shvartsman. Our result is hybrid as it includes analytical and empirical parts. The analytical result covers a subset of the possible adversarial patterns of asynchrony. The empirical results provide strong evidence that our analysis covers the worst case scenario, and we formally state it as a conjecture. We use these results to analyze an algorithm for t = nu (u ⪈ 2), tasks, that takes advantage of processor slackness and that has work Ο(t log2 t), conditioned on our conjecture. This algorithm requires only Ο(n log n) time to instantiate it. Next we study the case for the full range of processors nt. We define a family of deterministic asynchronous Write-All algorithms with work Ο(t . n ε) contingent upon our conjecture. We show that our method yields a faster construction of Ο(t . n ε) Write-All algorithms than the method developed by Anderson and Woll. Finally we show that our approach yields more efficient Write-all algorithms as compared to the algorithms induced by the constructions of Naor and Roth for the same asymptotic work complexity.

References

[1]
Anderson, R.J., Woll, H.: Algorithms for the Certified Write-All Problem. SIAM Journal on Computing 26 (5) (1997) 1277-1283
[2]
Aumann, Y., Rabin, M.O.: Clock Construction in Fully Asynchronous Parallel Systems and PRAM Simulation. Proc. of the 33rd IEEE Symp. on Foundations of Computer Science (1993) 147-156
[3]
Aumann, Y., Kedem, Z.M., Palem, K.V., Rabin, M.O.: Highly Efficient Asynchronous Execution of Large-Grained Parallel Programs. Proc. of the 34th IEEE Symp. on Foundations of Computer Science (1993) 271-280
[4]
Buss, J., Kanellakis, P.C., Ragde, P.L., Shvartsman, A.A.: Parallel Algorithms with Processor Failures and Delays. J. of Algorithms 20 (1996) 45-86
[5]
Chandrasekharan, K.: Introduction to Analytic Number Theory. Springer-Verlag (1968)
[6]
Cole, R., Zajicek, O.: The APRAM: Incorporating Asynchrony into the PRAM Model. Proc. of the 1989 ACM Symp. on Parallel Algorithms and Architectures (1989) 170-178
[7]
R. Cole and O. Zajicek: The Expected Advantage of Asynchrony. Proc. 2nd ACM Symp. on Parallel Algorithms and Architectures (1990) 85-94
[8]
Culler, D., Karp, R., Patterson, D., Sahay, A., Schauser, K.E., Santos, E., Subramonian, R., van Eicken, T.: LogP: Towards a Realistic Model of Parallel Computation. 4th ACM PPOPP (1993) 1-12
[9]
Dasgupta, P., Kedem, Z., Rabin, M.: Parallel Processing on Networks of Workstation: A Fault-Tolerant, High Performance Approach. Proc. of the International Conference on Distributed Computer Systems (1995) 467-474
[10]
Eppstein, D., Galil, Z.: Parallel techniques for combinatorial computation. Annual Computer Science Review, Vol. 3 (1988) 233-83
[11]
Fortune, S., Wyllie, J.: Parallelism in Random Access Machines Proc. the 10th ACM Symposium on Theory of Computing (1978) 114-118
[12]
Gibbons, A., Spirakis, P. (Eds.): Lectures on Parallel Computation. Cambridge International Series on Parallel Computation: 4, Cambridge University Press (1993)
[13]
Gibbons, P.: A More Practical PRAM Model. Proc. of the 1989 ACM Symposium on Parallel Algorithms and Architectures (1989) 158-168
[14]
Groote, J.F., Hesselink, W.H.: Synchronization-free parallel accessible hash-table. In preparation (1999)
[15]
Groote, J.F., Hesselink, W.H., Mauw, S., Vermeulen, R.: An algorithm for the asynchronous write-all problem based on process collision. Distributed Computing (2001)
[16]
Guy, R.: Unsolved Problems in Number Theory. Springer-Verlag (1994)
[17]
Hesselink, W.H., Groote, J.F.: Waitfree Distributed Memory Management by Create, and Read Until Deletion (CRUD). Technical report SEN-R9811, CWI, Amsterdam (1998)
[18]
Hornick, S.W., Preparata, F.P.: Deterministic P-RAM: simulation with constant redundancy. Proc. of the 1989 ACM Symp. on Parallel Algorithms and Arch. (1989) 103-109
[19]
Kanellakis, P.C., Shvartsman, A.A.: Efficient Parallel Algorithms Can Be Made Robust. Distributed Computing, vol. 5:4 (1992) 201-217 (Preliminary version: Proc. of the 8th ACM PODC (1989) 211-222)
[20]
Kanellakis, P.C., Shvartsman, A.A.: Fault-Tolerant Parallel Computation. Kluwer Academic Publishers (1997)
[21]
Karp, R.M., Ramachandran, V.: Survey of parallel algorithms for shared-memory machines. Handbook of Theoretical Computer Science, J. van Leeuwen, Ed., North-Holland (1990)
[22]
Kedem, Z.M., Palem, K.V., Rabin, M.O., Raghunathan, A.: Efficient Program Transformations for Resilient Parallel Computation via Randomization. Proc. 24th ACM Symp. on Theory of Comp (1992) 306-318
[23]
Kedem, Z.M., Palem, K.V., Spirakis, P.: Efficient Robust Parallel Computations. Proc. 22nd ACM Symp. on Theory of Computing (1990) 138-148
[24]
Kontogiannis, S., Pantziou, G., Spirakis, P., Yung, M.: Dynamic-Fault-prone BSP: A Paradigm for Robust Computation in Changing Environments. Proc. of the 10th ACM Symp. on Parallel Algorithms and Architectures (1998) 37-46
[25]
Knuth, D.E.: The Art of Computer Programming Vol. 2 (third edition). Addison-Wesley Pub Co. (1998)
[26]
Knuth, D.E.: The Art of Computer Programming Vol. 3 (third edition). Addison-Wesley Pub Co. (1998)
[27]
Martel, C., Park, A., Subramonian, R.: Work-optimal Asynchronous Algorithms for Shared Memory Parallel Computers. SIAM Journal on Computing, Vol. 21 (1992) 1070-1099
[28]
Martel, C., Subramonian, R., Park, A.: Asynchronous PRAMs are (Almost) as Good as Synchronous PRAMs. Proc. 32d IEEE Symp. on Foundations of Computer Science (1990) 590-599
[29]
Naor, J., Roth, R.M.: Constructions of Permutation Arrays for Certain Scheduling Cost Measures. Random Structures and Algorithms 6 (1) (1995) 39-50
[30]
Nishimura, N.: Asynchronous Shared Memory Parallel Computation. Proc. 3rd ACM Symp. on Parallel Algor. and Architect. (1990) 76-84
[31]
Papadimitriou C.H., Yannakakis, M.: Towards an Architecture-IndependentAnalysis of Parallel Algorithms. Proc. of the 20th Annual ACM Symp. on Theory of Computing (1988) 510-513
[32]
Shvartsman, A.A.: Achieving Optimal CRCW PRAM Fault-Tolerance. Information Processing Letters, Vol. 39:2 (1991) 59-66
[33]
Valiant, L.: General Purpose Parallel Architectures. Handbook of Theoretical Computer Science (ed. J. van Leeuwen), Vol. 1, North-Holland (1990)
[34]
Valiant, L.: A Bridging Model for Parallel Computation. Communications of the ACM, Vol. 33:8 (1990) 103-111
[35]
Yao, A., Knuth, D.: Analysis of the subtractive algorithm for greatest common divisors. Proceedings of National Academy of Sciences, USA, 72:4720-4722 (1975)

Cited By

View all
  • (2012)How to Allocate Tasks AsynchronouslyProceedings of the 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science10.1109/FOCS.2012.41(331-340)Online publication date: 20-Oct-2012
  • (2010)Emulating shared-memory Do-All algorithms in asynchronous message-passing systemsJournal of Parallel and Distributed Computing10.1016/j.jpdc.2009.12.00270:6(699-705)Online publication date: 1-Jun-2010
  • (2008)Writing-all deterministically and optimally using a nontrivial number of asynchronous processorsACM Transactions on Algorithms10.1145/1367064.13670734:3(1-22)Online publication date: 4-Jul-2008
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SPAA '01: Proceedings of the thirteenth annual ACM symposium on Parallel algorithms and architectures
July 2001
340 pages
ISBN:1581134096
DOI:10.1145/378580
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 July 2001

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Write-All
  2. contention of permutations
  3. parallel algorithms
  4. scheduling
  5. work

Qualifiers

  • Article

Conference

SPAA01

Acceptance Rates

SPAA '01 Paper Acceptance Rate 34 of 93 submissions, 37%;
Overall Acceptance Rate 447 of 1,461 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2012)How to Allocate Tasks AsynchronouslyProceedings of the 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science10.1109/FOCS.2012.41(331-340)Online publication date: 20-Oct-2012
  • (2010)Emulating shared-memory Do-All algorithms in asynchronous message-passing systemsJournal of Parallel and Distributed Computing10.1016/j.jpdc.2009.12.00270:6(699-705)Online publication date: 1-Jun-2010
  • (2008)Writing-all deterministically and optimally using a nontrivial number of asynchronous processorsACM Transactions on Algorithms10.1145/1367064.13670734:3(1-22)Online publication date: 4-Jul-2008
  • (2005)Cooperative asynchronous update of shared memoryProceedings of the thirty-seventh annual ACM symposium on Theory of computing10.1145/1060590.1060698(733-739)Online publication date: 22-May-2005
  • (2005)Explicit Combinatorial Structures for Cooperative Distributed AlgorithmsProceedings of the 25th IEEE International Conference on Distributed Computing Systems10.1109/ICDCS.2005.34(49-58)Online publication date: 6-Jun-2005
  • (2004)Writing-all deterministically and optimally using a non-trivial number of asynchronous processorsProceedings of the sixteenth annual ACM symposium on Parallelism in algorithms and architectures10.1145/1007912.1007964(311-320)Online publication date: 27-Jun-2004
  • (2004)Collective asynchronous reading with polylogarithmic worst-case overheadProceedings of the thirty-sixth annual ACM symposium on Theory of computing10.1145/1007352.1007406(321-330)Online publication date: 13-Jun-2004
  • (2004)A tight analysis and near-optimal instances of the algorithm of Anderson and WollTheoretical Computer Science10.1016/j.tcs.2004.10.015329:1-3(285-301)Online publication date: 13-Dec-2004
  • (2003)A work-optimal deterministic algorithm for the asynchronous certified write-all problemProceedings of the twenty-second annual symposium on Principles of distributed computing10.1145/872035.872075(255-264)Online publication date: 13-Jul-2003
  • (2003)A Method for Creating Near-Optimal Instances of a Certified Write-All AlgorithmAlgorithms - ESA 200310.1007/978-3-540-39658-1_39(422-433)Online publication date: 2003
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media