Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3673038.3673103acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicppConference Proceedingsconference-collections
research-article
Open access

Designing Non-uniform Locally Repairable Codes for Wide Stripes under Skewed File Accesses

Published: 12 August 2024 Publication History

Abstract

Large-scale storage systems increasingly deploy erasure coding to provide high-reliability and low-cost data storage. To achieve further storage savings while maintaining repair efficiency, enterprises and academia explore wide stripes using locally repairable codes (LRCs) and several constructions are proposed. However, we find that these wide stripe LRCs neglect the file access information within the stripe, which matters as these LRCs assign uniform local group sizes while a wide stripe comprises many files with non-uniform or skewed access frequencies. Consequently, the repair efficiency is compromised. To this end, we design a novel wide stripe LRC construction called non-uniform LRC by fully considering the skewed file accesses in a wide stripe. Our non-uniform LRC assigns small group sizes for hot files in the stripe to achieve low repair cost and large group sizes for cold files to realize low storage overhead. It performs well under general coding parameters, adapts to irregular file sizes, provides high fault tolerance, and can be flexibly deployed to various data placements in real-world clustered storage systems. We implement our non-uniform LRC in a distributed storage prototype. Evaluations show that it greatly reduces the degraded read time and improves the node repair rate compared with the state-of-the-art wide stripe LRCs.

References

[1]
[n. d.]. Ceph - Locally Repairable Erasure Code Plugin. https://docs.ceph.com/en/latest/rados/operations/erasure-code-lrc/.
[2]
[n. d.]. CubeFS - Erasure Code Subsystem. https://cubefs.readthedocs.io/en/latest/design/blobstore.html.
[3]
[n. d.]. Erasure coding used by Backblaze.https://www.backblaze.com/blog/reed-solomon/.
[4]
[n. d.]. IDC. Data age 2025. https://www.seagate.com/our-story/data-age-2025/.
[5]
[n. d.]. SWIM workload repository. https://github.com/SWIMProjectUCB/SWIM/wiki/Workloads-repository.
[6]
[n. d.]. The Wonder Shaper 1.4. https://github.com/magnific0/wondershaper.
[7]
[n. d.]. VastData.https://vastdata.com/providing-resilience-efficiently-part-ii/.
[8]
Johannes Blomer, Malik Kalfane, Richard Karp, Marek Karpinski, and David Zuckerman. 1995. An XOR-based erasure-resilient coding scheme. Technical Report TR-95-048, International Computer Science Institute (1995).
[9]
Haibo Chen, Heng Zhang, Mingkai Dong, Zhaoguo Wang, Yubin Xia, Haibing Guan, and Binyu Zang. 2017. Efficient and available in-memory KV-store with hybrid erasure coding and replication. ACM Trans. Storage 13, 3 (2017), 25.
[10]
Mosharaf Chowdhury, Srikanth Kandula, and Ion Stoica. 2013. Leveraging endpoint flexibility in data-intensive clusters. In Proc. of ACM SIGCOMM.
[11]
Cisco. 2014. Oversubscription and density best practices. https://www.cisco.com/c/en/us/solutions/collateral/data-centervirtualization/storage-networkingsolution/net_implementation_white_paper0900aecd800f592f.html.
[12]
Daniel Ford, François Labelle, Florentina Popovici, Murray Stokely, Van-Anh Truong, Luiz Barroso, Carrie Grimes, and Sean Quinlan. 2010. Availability in globally distributed storage systems. In Proc. of USENIX OSDI.
[13]
Parikshit Gopalan, Cheng Huang, Huseyin Simitci, and Sergey Yekhanin. 2012. On the locality of codeword symbols. IEEE Trans. on Information theory 58, 11 (2012), 6925–6934.
[14]
Hanxu Hou, Patrick PC Lee, Kenneth W Shum, and Yuchong Hu. 2019. Rack-aware regenerating codes for data centers. IEEE Trans. on Information Theory 65, 8 (Aug 2019), 4730–4745.
[15]
Yuchong Hu, Liangfeng Cheng, Qiaori Yao, Patrick PC Lee, Weichun Wang, and Wei Chen. 2021. Exploiting combined locality for wide-stripe erasure coding in distributed storage. In Proc. of USENIX FAST.
[16]
Yuchong Hu, Xiaolu Li, Mi Zhang, Patrick PC Lee, Xiaoyang Zhang, Pan Zhou, and Dan Feng. 2017. Optimal repair layering for erasure-coded data centers: From theory to practice. ACM Trans. on Storage 13, 4 (2017), 33.
[17]
Yupeng Hu, Yonghe Liu, Wenjia Li, Keqin Li, Kenli Li, Nong Xiao, and Zheng Qin. 2021. Unequal failure protection coding technique for distributed cloud storage systems. IEEE Trans. on Cloud Computing 9, 1 (2021), 386–400.
[18]
Cheng Huang, Huseyin Simitci, Yikang Xu, Aaron Ogus, Brad Calder, Parikshit Gopalan, Jin Li, and Sergey Yekhanin. 2012. Erasure coding in Windows Azure Storage. In Proc. of USENIX ATC.
[19]
Saurabh Kadekodi, Shashwat Silas, David Clausen, and Arif Merchant. 2023. Practical design considerations for wide Locally Recoverable Codes (LRCs). In Proc. of USENIX FAST.
[20]
Oleg Kolosov, Gala Yadgar, Matan Liram, Itzhak Tamo, and Alexander Barg. 2018. On fault tolerance, locality, and optimality in Locally Repairable Codes. In Proc. of USENIX ATC.
[21]
Jun Li and Baochun Li. 2021. Demand-aware erasure coding for distributed storage systems. IEEE Trans. on Cloud Computing 9, 2 (2021), 532–545.
[22]
Shuang Ma, Si Wu, Cheng Li, and Yinlong Xu. 2022. Repair-optimal data placement for Locally Repairable Codes with optimal minimum hamming distance. In Proc. of ACM ICPP.
[23]
Subramanian Muralidhar, Wyatt Lloyd, Sabyasachi Roy, Cory Hill, Ernest Lin, Weiwen Liu, Satadru Pan, Shiva Shankar, Viswanath Sivakumar, Linpeng Tang, 2014. f4: Facebook’s warm BLOB storage system. In Proc. of USENIX OSDI.
[24]
Michael Ovsiannikov, Silvius Rus, Damian Reeves, Paul Sutter, Sriram Rao, and Jim Kelly. 2013. The quantcast file system. Proc. VLDB Endow. 6, 11 (2013), 1092–1101.
[25]
James S Plank, Jianqiang Luo, Catherine D Schuman, Lihao Xu, Zooko Wilcox-O’Hearn, 2009. A performance evaluation and examination of open-source erasure coding libraries for storage. In Proc. of USENIX FAST.
[26]
N Prakash, Vitaly Abdrashitov, and Muriel Médard. 2018. The storage versus repair-bandwidth trade-off for clustered storage systems. IEEE Trans. on Information Theory 64, 8 (Aug 2018), 5783–5805.
[27]
Ankit Singh Rawat, Dimitris S Papailiopoulos, Alexandros G Dimakis, and Sriram Vishwanath. 2016. Locality and availability in distributed storage. IEEE Trans. on Information Theory 62, 8 (2016), 4481–4493.
[28]
Irving Reed and Gustave Solomon. 1960. Polynomial codes over certain finite fields. J. Soc. Indust. Appl. Math. 8, 2 (1960), 300–304.
[29]
Maheswaran Sathiamoorthy, Megasthenis Asteris, Dimitris Papailiopoulos, Alexandros G Dimakis, Ramkumar Vadali, Scott Chen, and Dhruba Borthakur. 2013. XORing Elephants: Novel erasure codes for big data. In Proc. of VLDB Endowment. 325–336.
[30]
Zhirong Shen, Jiwu Shu, and Patrick PC Lee. 2016. Reconsidering single failure recovery in clustered file systems. In Proc. of IEEE/IFIP DSN.
[31]
Natalia Silberstein, Ankit Singh Rawat, O Ozan Koyluoglu, and Sriram Vishwanath. 2013. Optimal Locally Repairable Codes via rank-metric codes. In Proc. of IEEE ISIT.
[32]
Itzhak Tamo and Alexander Barg. 2014. A family of optimal Locally Recoverable Codes. IEEE Trans. on Information Theory 60, 8 (2014), 4661–4676.
[33]
Itzhak Tamo, Dimitris S Papailiopoulos, and Alexandros G Dimakis. 2016. Optimal Locally Repairable Codes and connections to matroid theory. IEEE Trans. on Information Theory 62, 12 (2016), 6661–6671.
[34]
Ashish Vulimiri, Carlo Curino, P Brighten Godfrey, Thomas Jungblut, Jitu Padhye, and George Varghese. 2015. Global analytics in the face of bandwidth and regulatory constraints. In Proc. of USENIX NSDI.
[35]
Hakim Weatherspoon and John D Kubiatowicz. 2002. Erasure coding vs. replication: A quantitative comparison. In Proc. of IPTPS.
[36]
Si Wu, Qingpeng Du, Patrick PC Lee, Yongkun Li, and Yinlong Xu. 2022. Optimal data placement for stripe merging in Locally Repairable Codes. In Proc. of IEEE INFOCOM.
[37]
Si Wu, Zhirong Shen, Patrick P. C. Lee, and Yinlong Xu. 2022. Optimal repair-scaling trade-off in Locally Repairable Codes: analysis and evaluation. IEEE Trans. on Parallel and Distributed Systems 33, 1 (2022), 56–69.
[38]
Mingyuan Xia, Mohit Saxena, Mario Blaum, and David A Pease. 2015. A tale of two erasure codes in HDFS. In Proc. of USENIX FAST.
[39]
Zhikai Zhang, Shushi Gu, and Qinyu Zhang. 2022. Scalable Local Reconstruction Code design for hot data reads in cloud storage systems. Science China Information Sciences 65, 12 (2022).
[40]
Hao Zhao, Si Wu, Haifeng Liu, Zhixiang Tang, Xiaochun He, and Yinlong Xu. 2023. Toward optimal repair and load balance in Locally Repairable Codes. In Proc. of ACM ICPP.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICPP '24: Proceedings of the 53rd International Conference on Parallel Processing
August 2024
1279 pages
ISBN:9798400717932
DOI:10.1145/3673038
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 August 2024

Check for updates

Author Tags

  1. Data repair
  2. Locally Repairable Codes
  3. Wide stripes

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

Conference

ICPP '24

Acceptance Rates

Overall Acceptance Rate 91 of 313 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 122
    Total Downloads
  • Downloads (Last 12 months)122
  • Downloads (Last 6 weeks)54
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media