• Li S, Cao Q, Wan S, Xia W and Xie C. (2023). gPPM: A Generalized Matrix Operation and Parallel Algorithm to Accelerate the Encoding/Decoding Process of Erasure Codes. ACM Transactions on Architecture and Code Optimization. 20:4. (1-25). Online publication date: 31-Dec-2024.

    https://doi.org/10.1145/3625005

  • George A, Hanley J and Oral S. Disk Failure Trends in Alpine Storage System. Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis. (502-506).

    https://doi.org/10.1145/3624062.3624119

  • Han R, Gatla O, Zheng M, Cao J, Zhang D, Dai D, Chen Y and Cook J. (2022). A Study of Failure Recovery and Logging of High-Performance Parallel File Systems. ACM Transactions on Storage. 18:2. (1-44). Online publication date: 31-May-2022.

    https://doi.org/10.1145/3483447

  • Magalhaes A, Monteiro J and Brayner A. (2021). Main Memory Database Recovery. ACM Computing Surveys. 54:2. (1-36). Online publication date: 31-Mar-2022.

    https://doi.org/10.1145/3442197

  • Torquato M, Maciel P and Vieira M. (2022). Model-Based Performability and Dependability Evaluation of a System with VM Migration as Rejuvenation in the Presence of Bursty Workloads. Journal of Network and Systems Management. 30:1. Online publication date: 1-Jan-2022.

    https://doi.org/10.1007/s10922-021-09619-3

  • Shan Y, Chen K, Gong T, Zhou L, Zhou T and Wu Y. Geometric Partitioning. Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles. (457-471).

    https://doi.org/10.1145/3477132.3483558

  • Hong D, Ha K, Ko M, Chun M, Kim Y, Lee S and Kim J. (2021). Reparo: A Fast RAID Recovery Scheme for Ultra-large SSDs. ACM Transactions on Storage. 17:3. (1-24). Online publication date: 31-Aug-2021.

    https://doi.org/10.1145/3450977

  • Wu R, Chen J, Huang P, Wang L, Wang M and Wu Y. An efficient RAID scaling scheme for RS-RAID6. 2021 2nd International Conference on Artificial Intelligence and Information Systems. (1-6).

    https://doi.org/10.1145/3469213.3470260

  • Cini N and Yalcin G. (2020). A Methodology for Comparing the Reliability of GPU-Based and CPU-Based HPCs. ACM Computing Surveys. 53:1. (1-33). Online publication date: 31-Jan-2021.

    https://doi.org/10.1145/3372790

  • Bender M and Kuszmaul W. Randomized cup game algorithms against strong adversaries. Proceedings of the Thirty-Second Annual ACM-SIAM Symposium on Discrete Algorithms. (2059-2077).

    /doi/10.5555/3458064.3458187

  • Chakraborttii C and Litz H. Improving the accuracy, adaptability, and interpretability of SSD failure prediction models. Proceedings of the 11th ACM Symposium on Cloud Computing. (120-133).

    https://doi.org/10.1145/3419111.3421300

  • Brooker M, Chen T and Ping F. Millions of tiny databases. Proceedings of the 17th Usenix Conference on Networked Systems Design and Implementation. (463-478).

    /doi/10.5555/3388242.3388276

  • Lu S, Luo B, Patel T, Yao Y, Tiwari D and Shi W. Making disk failure predictions SMARTer!. Proceedings of the 18th USENIX Conference on File and Storage Technologies. (151-168).

    /doi/10.5555/3386691.3386706

  • Maneas S, Mahdaviani K, Emami T and Schroeder B. A study of SSD reliability in large scale enterprise storage deployments. Proceedings of the 18th USENIX Conference on File and Storage Technologies. (137-150).

    /doi/10.5555/3386691.3386705

  • Jauk D, Yang D and Schulz M. Predicting faults in high performance computing systems. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. (1-13).

    https://doi.org/10.1145/3295500.3356185

  • Alter J, Xue J, Dimnaku A and Smirni E. SSD failures in the field. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. (1-14).

    https://doi.org/10.1145/3295500.3356172

  • Wang J, Bao W, Zheng L, Zhu X and Yu P. (2019). An Attention-augmented Deep Architecture for Hard Drive Status Monitoring in Large-scale Storage Systems. ACM Transactions on Storage. 15:3. (1-26). Online publication date: 20-Aug-2019.

    https://doi.org/10.1145/3340290

  • Xu G, Feng D, Tan Z, Zhang X, Xu J, Shu X and Zhu Y. RFPL. Proceedings of the 48th International Conference on Parallel Processing. (1-10).

    https://doi.org/10.1145/3337821.3337887

  • Zhang J, Zhou K, Huang P, He X, Xiao Z, Cheng B, Ji Y and Wang Y. Transfer Learning based Failure Prediction for Minority Disks in Large Data Centers of Heterogeneous Disk Systems. Proceedings of the 48th International Conference on Parallel Processing. (1-10).

    https://doi.org/10.1145/3337821.3337881

  • Xie P, Yuan Z, Huang J and Qin X. N-Code. Proceedings of the 48th International Conference on Parallel Processing. (1-10).

    https://doi.org/10.1145/3337821.3337829

  • Xu E, Zheng M, Qin F, Xu Y and Wu J. Lessons and actions. Proceedings of the 2019 USENIX Conference on Usenix Annual Technical Conference. (961-975).

    /doi/10.5555/3358807.3358890

  • Panda B, Srinivasan D, Ke H, Gupta K, Khot V and Gunawi H. IASO. Proceedings of the 2019 USENIX Conference on Usenix Annual Technical Conference. (47-61).

    /doi/10.5555/3358807.3358812

  • Huang X, Wu C and Li J. OPS. Proceedings of the 3rd International Conference on High Performance Compilation, Computing and Communications. (139-148).

    https://doi.org/10.1145/3318265.3318274

  • Kadekodi S, Rashmi K and Ganger G. Cluster storage systems gotta have HeART. Proceedings of the 17th USENIX Conference on File and Storage Technologies. (345-358).

    /doi/10.5555/3323298.3323331

  • Di S, Guo H, Gupta R, Pershey E, Snir M and Cappello F. (2019). Exploring Properties and Correlations of Fatal Events in a Large-Scale HPC System. IEEE Transactions on Parallel and Distributed Systems. 30:2. (361-374). Online publication date: 1-Feb-2019.

    https://doi.org/10.1109/TPDS.2018.2864184

  • Gatla O, Zheng M, Hameed M, Dubeyko V, Manzanares A, Blagojevic F, Guyot C and Mateescu R. (2018). Towards Robust File System Checkers. ACM Transactions on Storage. 14:4. (1-25). Online publication date: 15-Dec-2018.

    https://doi.org/10.1145/3281031

  • Meza J, Xu T, Veeraraghavan K and Mutlu O. A Large Scale Study of Data Center Network Reliability. Proceedings of the Internet Measurement Conference 2018. (393-407).

    https://doi.org/10.1145/3278532.3278566

  • Nie B, Yang L, Jog A and Smirni E. Fault site pruning for practical reliability analysis of GPGPU applications. Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture. (749-761).

    https://doi.org/10.1109/MICRO.2018.00066

  • Shahrad M, Mosenia A, Song L, Chiang M, Wentzlaff D and Mittal P. Acoustic Denial of Service Attacks on Hard Disk Drives. Proceedings of the 2018 Workshop on Attacks and Solutions in Hardware Security. (34-39).

    https://doi.org/10.1145/3266444.3266448

  • Shan Y, Huang Y, Chen Y and Zhang Y. LegoOS. Proceedings of the 13th USENIX conference on Operating Systems Design and Implementation. (69-87).

    /doi/10.5555/3291168.3291175

  • Gunawi H, Suminto R, Sears R, Golliher C, Sundararaman S, Lin X, Emami T, Sheng W, Bidokhti N, McCaffrey C, Srinivasan D, Panda B, Baptist A, Grider G, Fields P, Harms K, Ross R, Jacobson A, Ricci R, Webb K, Alvaro P, Runesha H, Hao M and Li H. (2018). Fail-Slow at Scale. ACM Transactions on Storage. 14:3. (1-26). Online publication date: 31-Aug-2018.

    https://doi.org/10.1145/3242086

  • Pan W and Xie T. (2018). A Mirroring-Assisted Channel-RAID5 SSD for Mobile Applications. ACM Transactions on Embedded Computing Systems. 17:4. (1-27). Online publication date: 29-Aug-2018.

    https://doi.org/10.1145/3209625

  • Xiao J, Xiong Z, Wu S, Yi Y, Jin H and Hu K. Disk Failure Prediction in Data Centers via Online Learning. Proceedings of the 47th International Conference on Parallel Processing. (1-10).

    https://doi.org/10.1145/3225058.3225106

  • Cao J, Gatla O, Zheng M, Dai D, Eswarappa V, Mu Y and Chen Y. PFault. Proceedings of the 2018 International Conference on Supercomputing. (1-11).

    https://doi.org/10.1145/3205289.3205302

  • Hong H, Khan L and Xiaojuan L. SAT-based Important Data Reliability Enhancement Model for Big Data Storage. Proceedings of the 3rd International Conference on Big Data and Computing. (20-26).

    https://doi.org/10.1145/3220199.3220220

  • Song X, Peng X, Xu J, Shi G and Wu F. (2018). Unequal Error Protection for Scalable Video Storage in the Cloud. IEEE Transactions on Multimedia. 20:3. (699-710). Online publication date: 1-Mar-2018.

    https://doi.org/10.1109/TMM.2017.2751147

  • Vajha M, Ramkumar V, Puranik B, Kini G, Lobo E, Sasidharan B, Kumar P, Barg A, Ye M, Narayanamurthy S, Hussain S and Nandi S. Clay codes. Proceedings of the 16th USENIX Conference on File and Storage Technologies. (139-153).

    /doi/10.5555/3189759.3189773

  • Gatla O, Hameed M, Zheng M, Dubeyko V, Manzanares A, Blagojevic F, Guyot C and Mateescu R. Towards robust file system checkers. Proceedings of the 16th USENIX Conference on File and Storage Technologies. (105-121).

    /doi/10.5555/3189759.3189770

  • Gunawi H, Suminto R, Sears R, Golliher C, Sundararaman S, Lin X, Emami T, Sheng W, Bidokhti N, McCaffrey C, Grider G, Fields P, Harms K, Ross R, Jacobson A, Ricci R, Webb K, Alvaro P, Runesha H, Hao M and Li H. Fail-slow at scale. Proceedings of the 16th USENIX Conference on File and Storage Technologies. (1-14).

    /doi/10.5555/3189759.3189761

  • Hameed M, Haider S and Kantarci B. (2017). Performance impacts of hybrid cloud storage. Computing. 99:12. (1207-1229). Online publication date: 1-Dec-2017.

    https://doi.org/10.1007/s00607-017-0560-y

  • Hu Y, Li X, Zhang M, Lee P, Zhang X, Zhou P and Feng D. (2017). Optimal Repair Layering for Erasure-Coded Data Centers. ACM Transactions on Storage. 13:4. (1-24). Online publication date: 30-Nov-2017.

    https://doi.org/10.1145/3149349

  • Chen R, Akkus I, Viswanath B, Rimac I and Hilt V. Towards Reliable Application Deployment in the Cloud. Proceedings of the 13th International Conference on emerging Networking EXperiments and Technologies. (464-477).

    https://doi.org/10.1145/3143361.3143388

  • Gupta S, Patel T, Engelmann C and Tiwari D. Failures in large scale systems. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. (1-12).

    https://doi.org/10.1145/3126908.3126937

  • Xu J, Zhang L, Memaripour A, Gangadharaiah A, Borase A, Da Silva T, Swanson S and Rudoff A. NOVA-Fortis. Proceedings of the 26th Symposium on Operating Systems Principles. (478-496).

    https://doi.org/10.1145/3132747.3132761

  • Jung H, Park Y, Song C and Kang S. (2017). PCS. Cluster Computing. 20:3. (2655-2668). Online publication date: 1-Sep-2017.

    https://doi.org/10.1007/s10586-017-0805-8

  • Ganesan A, Alagappan R, Arpaci-Dusseau A and Arpaci-Dusseau R. (2017). Redundancy Does Not Imply Fault Tolerance. ACM Transactions on Storage. 13:3. (1-33). Online publication date: 31-Aug-2017.

    https://doi.org/10.1145/3125497

  • Jang E, Johnson M, Burnell E and Heimerl K. Unplanned Obsolescence. Proceedings of the 2017 Workshop on Computing Within Limits. (93-101).

    https://doi.org/10.1145/3080556.3080566

  • Liu Q, Feng D, Hu Y, Shi Z and Fu M. (2017). High-Performance General Functional Regenerating Codes with Near-Optimal Repair Bandwidth. ACM Transactions on Storage. 13:2. (1-28). Online publication date: 10-Jun-2017.

    https://doi.org/10.1145/3051122

  • Gaber S, Ben-Harush O and Savir A. Predicting HDD failures from compound SMART attributes. Proceedings of the 10th ACM International Systems and Storage Conference. (1-1).

    https://doi.org/10.1145/3078468.3081875

  • Akash G, Lee O, Kumar S, Chandran P and Cuzzocrea A. RAPID. Proceedings of the 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing. (890-897).

    https://doi.org/10.1109/CCGRID.2017.115

  • Kishani M, Eftekhari R and Asadi H. Evaluating impact of human errors on the availability of data storage systems. Proceedings of the Conference on Design, Automation & Test in Europe. (314-317).

    /doi/10.5555/3130379.3130452

  • Shen Z, Shu J, Lee P and Fu Y. (2017). Seek-Efficient I/O Optimization in Single Failure Recovery for XOR-Coded Storage Systems. IEEE Transactions on Parallel and Distributed Systems. 28:3. (877-890). Online publication date: 1-Mar-2017.

    https://doi.org/10.1109/TPDS.2016.2591040

  • Kumar H, Patel Y, Kesavan R and Makam S. High-performance metadata integrity protection in the WAFL copy-on-write file system. Proceedings of the 15th Usenix Conference on File and Storage Technologies. (197-211).

    /doi/10.5555/3129633.3129651

  • Ganesan A, Alagappan R, Arpaci-Dusseau A and Arpaci-Dusseau R. Redundancy does not imply fault tolerance. Proceedings of the 15th Usenix Conference on File and Storage Technologies. (149-165).

    /doi/10.5555/3129633.3129648

  • Heinrich R, Merkle P, Henss J and Paech B. (2017). Integrating business process simulation and information system simulation for performance prediction. Software and Systems Modeling (SoSyM). 16:1. (257-277). Online publication date: 1-Feb-2017.

    https://doi.org/10.1007/s10270-015-0457-1

  • Zheng M, Tucek J, Qin F, Lillibridge M, Zhao B and Yang E. (2016). Reliability Analysis of SSDs Under Power Fault. ACM Transactions on Computer Systems. 34:4. (1-28). Online publication date: 16-Jan-2017.

    https://doi.org/10.1145/2992782

  • Fu Y, Shu J, Luo X, Shen Z and Hu Q. (2017). Short Code. IEEE Transactions on Computers. 66:1. (127-137). Online publication date: 1-Jan-2017.

    https://doi.org/10.1109/TC.2016.2576461

  • Cao J, Wang S, Dai D, Zheng M and Chen Y. A generic framework for testing parallel file systems. Proceedings of the 1st Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems. (49-54).

    /doi/10.5555/3019046.3019055

  • Bautista-Gomez L, Zyulkyarov F, Unsal O and McIntosh-Smith S. Unprotected computing. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. (1-11).

    /doi/10.5555/3014904.3014978

  • Jiang Y, Chen H, Qin F, Xu C, Ma X and Lu J. Crash consistency validation made easy. Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering. (133-143).

    https://doi.org/10.1145/2950290.2950327

  • Xu C, Wang G, Liu X, Guo D and Liu T. (2016). Health Status Assessment and Failure Prediction for Hard Drives with Recurrent Neural Networks. IEEE Transactions on Computers. 65:11. (3502-3508). Online publication date: 1-Nov-2016.

    https://doi.org/10.1109/TC.2016.2538237

  • Cano I, Aiyar S and Krishnamurthy A. Characterizing Private Clouds. Proceedings of the Seventh ACM Symposium on Cloud Computing. (29-41).

    https://doi.org/10.1145/2987550.2987584

  • Gunawi H, Hao M, Suminto R, Laksono A, Satria A, Adityatama J and Eliazar K. Why Does the Cloud Stop Computing?. Proceedings of the Seventh ACM Symposium on Cloud Computing. (1-16).

    https://doi.org/10.1145/2987550.2987583

  • Shahrad M and Wentzlaff D. Availability Knob. Proceedings of the Seventh ACM Symposium on Cloud Computing. (42-56).

    https://doi.org/10.1145/2987550.2987556

  • Hall R. (2016). Tools for Predicting the Reliability of Large-Scale Storage Systems. ACM Transactions on Storage. 12:4. (1-30). Online publication date: 29-Aug-2016.

    https://doi.org/10.1145/2911987

  • Wu S, Mao B, Chen X and Jiang H. (2016). LDM. ACM Transactions on Storage. 12:4. (1-21). Online publication date: 29-Aug-2016.

    https://doi.org/10.1145/2892639

  • Botezatu M, Giurgiu I, Bogojeska J and Wiesmann D. Predicting Disk Replacement towards Reliable Data Centers. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. (39-48).

    https://doi.org/10.1145/2939672.2939699

  • Wang W, Xie T and Sharma A. (2016). SWANS. ACM Transactions on Storage. 12:3. (1-21). Online publication date: 27-Jun-2016.

    https://doi.org/10.1145/2756555

  • Narayanan I, Wang D, Jeon M, Sharma B, Caulfield L, Sivasubramaniam A, Cutler B, Liu J, Khessib B and Vaid K. SSD Failures in Datacenters. Proceedings of the 9th ACM International on Systems and Storage Conference. (1-11).

    https://doi.org/10.1145/2928275.2928278

  • Li S, Cao Q, Wan S, Qian L and Xie C. (2016). HRSPC. Journal of Network and Computer Applications. 66:C. (52-63). Online publication date: 1-May-2016.

    https://doi.org/10.1016/j.jnca.2015.12.012

  • Yao J, Jiang H, Cao Q, Tian L and Xie C. (2016). Elastic-RAID: A New Architecture for Improved Availability of Parity-Based RAIDs by Elastic Mirroring. IEEE Transactions on Parallel and Distributed Systems. 27:4. (1044-1056). Online publication date: 1-Apr-2016.

    https://doi.org/10.1109/TPDS.2015.2432808

  • Grawinkel M, Nagel L and Brinkmann A. (2016). LoneStar RAID. ACM Transactions on Storage. 12:1. (1-29). Online publication date: 26-Feb-2016.

    https://doi.org/10.1145/2840810

  • Hao M, Soundararajan G, Kenchammana-Hosekote D, Chien A and Gunawi H. The tail at store. Proceedings of the 14th Usenix Conference on File and Storage Technologies. (263-276).

    /doi/10.5555/2930583.2930603

  • Schroeder B, Lagisetty R and Merchant A. Flash reliability in production. Proceedings of the 14th Usenix Conference on File and Storage Technologies. (67-80).

    /doi/10.5555/2930583.2930589

  • Deng M, Ou Y, Xiao N, Yu S, Chen W, Chen Z and Liu F. RAID-6Plus. Proceedings of the 9th Asia-Pacific Services Computing Conference on Advances in Services Computing - Volume 9464. (210-221).

    https://doi.org/10.1007/978-3-319-26979-5_15

  • Ma A, Traylor R, Douglis F, Chamness M, Lu G, Sawyer D, Chandra S and Hsu W. (2015). RAIDShield. ACM Transactions on Storage. 11:4. (1-28). Online publication date: 21-Nov-2015.

    https://doi.org/10.1145/2820615

  • Tiwari D, Gupta S, Gallarno G, Rogers J and Maxwell D. Reliability lessons learned from GPU experience with the Titan supercomputer at Oak Ridge leadership computing facility. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. (1-12).

    https://doi.org/10.1145/2807591.2807666

  • Wan L, Wang F, Oral S, Tiwari D, Vazhkudai S and Cao Q. A practical approach to reconciling availability, performance, and capacity in provisioning extreme-scale storage systems. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. (1-12).

    https://doi.org/10.1145/2807591.2807615

  • Cui L, Wo T, Li B, Li J, Shi B and Huai J. (2015). PARS. ACM SIGPLAN Notices. 50:7. (215-228). Online publication date: 25-Aug-2015.

    https://doi.org/10.1145/2817817.2731190

  • Runhui Li , Jian Lin and Lee P. (2015). Enabling Concurrent Failure Recovery for Regenerating-Coding-Based Storage Systems: From Theory to Practice. IEEE Transactions on Computers. 64:7. (1898-1911). Online publication date: 1-Jul-2015.

    https://doi.org/10.1109/TC.2014.2349518

  • Meza J, Wu Q, Kumar S and Mutlu O. (2015). A Large-Scale Study of Flash Memory Failures in the Field. ACM SIGMETRICS Performance Evaluation Review. 43:1. (177-190). Online publication date: 24-Jun-2015.

    https://doi.org/10.1145/2796314.2745848

  • Meza J, Wu Q, Kumar S and Mutlu O. A Large-Scale Study of Flash Memory Failures in the Field. Proceedings of the 2015 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems. (177-190).

    https://doi.org/10.1145/2745844.2745848

  • Rosà A, Chen L, Birke R and Binder W. (2015). Demystifying Casualties of Evictions in Big Data Priority Scheduling. ACM SIGMETRICS Performance Evaluation Review. 42:4. (12-21). Online publication date: 2-Jun-2015.

    https://doi.org/10.1145/2788402.2788406

  • Zhou H, Lou J, Zhang H, Lin H, Lin H and Qin T. An empirical study on quality issues of production big data platform. Proceedings of the 37th International Conference on Software Engineering - Volume 2. (17-26).

    /doi/10.5555/2819009.2819014

  • Rosà A, Chen L and Binder W. Predicting and mitigating jobs failures in big data clusters. Proceedings of the 15th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing. (221-230).

    https://doi.org/10.1109/CCGrid.2015.139

  • Lee E, Oh Y and Lee D. SSD caching to overcome small write problem of disk-based RAID in enterprise environments. Proceedings of the 30th Annual ACM Symposium on Applied Computing. (2047-2053).

    https://doi.org/10.1145/2695664.2695886

  • Chen T, Wei H, Yeh T, Hsu T and Shih W. (2015). An Energy-Efficient and Reliable Storage Mechanism for Data-Intensive Academic Archive Systems. ACM Transactions on Storage. 11:2. (1-21). Online publication date: 24-Mar-2015.

    https://doi.org/10.1145/2720021

  • Cui L, Wo T, Li B, Li J, Shi B and Huai J. PARS. Proceedings of the 11th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments. (215-228).

    https://doi.org/10.1145/2731186.2731190

  • Ma A, Douglis F, Lu G, Sawyer D, Chandra S and Hsu W. RAIDShield. Proceedings of the 13th USENIX Conference on File and Storage Technologies. (241-256).

    /doi/10.5555/2750482.2750501

  • Rashmi K, Nakkiran P, Wang J, Shah N and Ramchandran K. Having your cake and eating it too. Proceedings of the 13th USENIX Conference on File and Storage Technologies. (81-94).

    /doi/10.5555/2750482.2750489

  • Li M and Lee P. (2014). STAIR Codes. ACM Transactions on Storage. 10:4. (1-30). Online publication date: 31-Oct-2014.

    https://doi.org/10.1145/2658991

  • Kim Y, Daly R, Kim J, Fallin C, Lee J, Lee D, Wilkerson C, Lai K and Mutlu O. (2014). Flipping bits in memory without accessing them. ACM SIGARCH Computer Architecture News. 42:3. (361-372). Online publication date: 16-Oct-2014.

    https://doi.org/10.1145/2678373.2665726

  • Zheng M, Tucek J, Huang D, Qin F, Lillibridge M, Yang E, Zhao B and Singh S. Torturing databases for fun and profit. Proceedings of the 11th USENIX conference on Operating Systems Design and Implementation. (449-464).

    /doi/10.5555/2685048.2685083

  • Sampson A, Nelson J, Strauss K and Ceze L. (2014). Approximate Storage in Solid-State Memories. ACM Transactions on Computer Systems. 32:3. (1-23). Online publication date: 23-Sep-2014.

    https://doi.org/10.1145/2644808

  • Qin A, Hu D, Liu J, Yang W and Tan D. (2014). Fatman. Proceedings of the VLDB Endowment. 7:13. (1748-1753). Online publication date: 1-Aug-2014.

    https://doi.org/10.14778/2733004.2733078

  • Luo M, Dang N and Yokota H. A fragmented data-declustering strategy for high skew tolerance and efficient failure recovery. Proceedings of the 18th International Database Engineering & Applications Symposium. (227-236).

    https://doi.org/10.1145/2628194.2628254

  • Buhnova B, Chren S and Fabriková L. Failure data collection for reliability prediction models. Proceedings of the 10th international ACM Sigsoft conference on Quality of software architectures. (83-92).

    https://doi.org/10.1145/2602576.2602586

  • Kim Y, Daly R, Kim J, Fallin C, Lee J, Lee D, Wilkerson C, Lai K and Mutlu O. Flipping bits in memory without accessing them. Proceeding of the 41st annual international symposium on Computer architecuture. (361-372).

    /doi/10.5555/2665671.2665726

  • Floratou A, Bertsch F, Patel J and Laskaris G. (2014). Towards building wind tunnels for data center design. Proceedings of the VLDB Endowment. 7:9. (781-784). Online publication date: 1-May-2014.

    https://doi.org/10.14778/2732939.2732950

  • Elerath J and Schindler J. (2014). Beyond MTTDL. ACM Transactions on Storage. 10:2. (1-21). Online publication date: 1-Mar-2014.

    https://doi.org/10.1145/2577386

  • Li M and Lee P. STAIR codes. Proceedings of the 12th USENIX conference on File and Storage Technologies. (147-162).

    /doi/10.5555/2591305.2591320

  • Plank J and Blaum M. (2014). Sector-Disk (SD) Erasure Codes for Mixed Failure Modes in RAID Systems. ACM Transactions on Storage. 10:1. (1-17). Online publication date: 1-Jan-2014.

    https://doi.org/10.1145/2560013

  • Callou G, Maciel P, Tutsch D, Ferreira J, Araújo J and Souza R. (2013). Estimating sustainability impact of high dependable data centers. Computing. 95:12. (1137-1170). Online publication date: 1-Dec-2013.

    https://doi.org/10.1007/s00607-013-0328-y

  • Kasick M, Narasimhan P and Harms K. Making problem diagnosis work for large-scale, production storage systems. Proceedings of the 27th USENIX conference on Large Installation System Administration. (27-44).

    /doi/10.5555/2717477.2717481

  • Kasick M, Narasimhan P and Harms K. Making problem diagnosiswork for large-scale, production storage systems. Proceedings of the 27th international conference on Large Installation System Administration. (27-44).

    /doi/10.5555/2555492.2555496

  • Potharaju R and Jain N. When the network crumbles. Proceedings of the 4th annual Symposium on Cloud Computing. (1-17).

    https://doi.org/10.1145/2523616.2523638

  • Do T, Hao M, Leesatapornwongsa T, Patana-anake T and Gunawi H. Limplock. Proceedings of the 4th annual Symposium on Cloud Computing. (1-14).

    https://doi.org/10.1145/2523616.2523627

  • Lazarova-Molnar S. Petri nets extension to model state-varying failure rates. Proceedings of the 2013 Summer Computer Simulation Conference. (1-8).

    /doi/10.5555/2557696.2557730

  • Sankar S, Shaw M, Vaid K and Gurumurthi S. (2013). Datacenter Scale Evaluation of the Impact of Temperature on Hard Disk Drive Failures. ACM Transactions on Storage. 9:2. (1-24). Online publication date: 1-Jul-2013.

    https://doi.org/10.1145/2491472.2491475

  • Rozier E, Zhou P and Divine D. Building intelligence for software defined data centers. Proceedings of the 6th International Systems and Storage Conference. (1-10).

    https://doi.org/10.1145/2485732.2485752

  • Levy S, Dosanjh M, Bridges P and Ferreira K. Using unreliable virtual hardware to inject errors in extreme-scale systems. Proceedings of the 3rd Workshop on Fault-tolerance for HPC at extreme scale. (21-26).

    https://doi.org/10.1145/2465813.2465820

  • Huang C, Chen M and Li J. (2013). Pyramid Codes. ACM Transactions on Storage. 9:1. (1-28). Online publication date: 1-Mar-2013.

    https://doi.org/10.1145/2435204.2435207

  • Zheng M, Tucek J, Qin F and Lillibridge M. Understanding the robustness of SSDS under power fault. Proceedings of the 11th USENIX conference on File and Storage Technologies. (271-284).

    /doi/10.5555/2591272.2591300

  • Plank J, Blaum M and Hafner J. SD codes. Proceedings of the 11th USENIX conference on File and Storage Technologies. (95-104).

    /doi/10.5555/2591272.2591283

  • Juels A and Oprea A. (2013). New approaches to security and availability for cloud data. Communications of the ACM. 56:2. (64-73). Online publication date: 1-Feb-2013.

    https://doi.org/10.1145/2408776.2408793

  • Wu S, Jiang H and Mao B. IDO. Proceedings of the 26th international conference on Large Installation System Administration: strategies, tools, and techniques. (17-32).

    /doi/10.5555/2432523.2432525

  • Luo X and Shu J. (2012). Generalized X-code. ACM Transactions on Storage. 8:3. (1-16). Online publication date: 1-Sep-2012.

    https://doi.org/10.1145/2339118.2339121

  • Tran N, Chiang F and Li J. (2012). Efficient cooperative backup with decentralized trust management. ACM Transactions on Storage. 8:3. (1-25). Online publication date: 1-Sep-2012.

    https://doi.org/10.1145/2339118.2339119

  • Stewart R, Trinder P and Maier P. Supervised Workpools for Reliable Massively Parallel Computing. Proceedings of the 2012 Conference on Trends in Functional Programming - Volume 7829. (247-262).

    https://doi.org/10.1007/978-3-642-40447-4_16

  • El-Sayed N, Stefanovici I, Amvrosiadis G, Hwang A and Schroeder B. Temperature management in data centers. Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems. (163-174).

    https://doi.org/10.1145/2254756.2254778

  • El-Sayed N, Stefanovici I, Amvrosiadis G, Hwang A and Schroeder B. (2012). Temperature management in data centers. ACM SIGMETRICS Performance Evaluation Review. 40:1. (163-174). Online publication date: 7-Jun-2012.

    https://doi.org/10.1145/2318857.2254778

  • Li Y, Miller E and Long D. Understanding data survivability in archival storage systems. Proceedings of the 5th Annual International Systems and Storage Conference. (1-12).

    https://doi.org/10.1145/2367589.2367605

  • Laptev N, Zeng K and Zaniolo C. (2012). Early accurate results for advanced analytics on MapReduce. Proceedings of the VLDB Endowment. 5:10. (1028-1039). Online publication date: 1-Jun-2012.

    https://doi.org/10.14778/2336664.2336675

  • Adams I, Storer M and Miller E. (2012). Analysis of Workload Behavior in Scientific and Historical Long-Term Data Repositories. ACM Transactions on Storage. 8:2. (1-27). Online publication date: 1-May-2012.

    https://doi.org/10.1145/2180905.2180907

  • Kim C, Park K and Park K. GHOST. Proceedings of the 2012 International Workshop on Programming Models and Applications for Multicores and Manycores. (17-26).

    https://doi.org/10.1145/2141702.2141705

  • Khan O, Burns R, Plank J, Pierce W and Huang C. Rethinking erasure codes for cloud file systems. Proceedings of the 10th USENIX conference on File and Storage Technologies. (20-20).

    /doi/10.5555/2208461.2208481

  • Mao B, Jiang H, Wu S, Tian L, Feng D, Chen J and Zeng L. (2012). HPDA. ACM Transactions on Storage. 8:1. (1-20). Online publication date: 1-Feb-2012.

    https://doi.org/10.1145/2093139.2093143

  • Tantisiriroj W, Son S, Patil S, Lang S, Gibson G and Ross R. On the duality of data-intensive file system design. Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis. (1-12).

    https://doi.org/10.1145/2063384.2063474

  • Chen Y and Sion R. To cloud or not to cloud?. Proceedings of the 2nd ACM Symposium on Cloud Computing. (1-7).

    https://doi.org/10.1145/2038916.2038945

  • Joshi P, Gunawi H and Sen K. PREFAIL. Proceedings of the 2011 ACM international conference on Object oriented programming systems languages and applications. (171-188).

    https://doi.org/10.1145/2048066.2048082

  • Gill P, Jain N and Nagappan N. (2011). Understanding network failures in data centers. ACM SIGCOMM Computer Communication Review. 41:4. (350-361). Online publication date: 22-Oct-2011.

    https://doi.org/10.1145/2043164.2018477

  • Joshi P, Gunawi H and Sen K. (2011). PREFAIL. ACM SIGPLAN Notices. 46:10. (171-188). Online publication date: 18-Oct-2011.

    https://doi.org/10.1145/2076021.2048082

  • Xiang L, Xu Y, Lui J, Chang Q, Pan Y and Li R. (2011). A Hybrid Approach to Failed Disk Recovery Using RAID-6 Codes. ACM Transactions on Storage. 7:3. (1-34). Online publication date: 1-Oct-2011.

    https://doi.org/10.1145/2027066.2027071

  • Gill P, Jain N and Nagappan N. Understanding network failures in data centers. Proceedings of the ACM SIGCOMM 2011 conference. (350-361).

    https://doi.org/10.1145/2018436.2018477

  • Sztandera L. Spare parts allocation. Proceedings of the 15th WSEAS international conference on Computers. (245-249).

    /doi/10.5555/2028299.2028347

  • Iliadis I, Haas R, Hu X and Eleftheriou E. (2011). Disk Scrubbing Versus Intradisk Redundancy for RAID Storage Systems. ACM Transactions on Storage. 7:2. (1-42). Online publication date: 1-Jul-2011.

    https://doi.org/10.1145/1970348.1970350

  • Tian L, Cao Q, Jiang H, Feng D, Xie C and Xin Q. (2011). Online availability upgrades for parity-based RAIDs through supplementary parity augmentations. ACM Transactions on Storage. 6:4. (1-23). Online publication date: 1-May-2011.

    https://doi.org/10.1145/1970338.1970341

  • Nightingale E, Douceur J and Orgovan V. Cycles, cells and platters. Proceedings of the sixth conference on Computer systems. (343-356).

    https://doi.org/10.1145/1966445.1966477

  • Deng Y. (2011). What is the future of disk drives, death or rebirth?. ACM Computing Surveys. 43:3. (1-27). Online publication date: 1-Apr-2011.

    https://doi.org/10.1145/1922649.1922660

  • Schroeder B, Pinheiro E and Weber W. (2011). DRAM errors in the wild. Communications of the ACM. 54:2. (100-107). Online publication date: 1-Feb-2011.

    https://doi.org/10.1145/1897816.1897844

  • Li M, Vazhkudai S, Butt A, Meng F, Ma X, Kim Y, Engelmann C and Shipman G. Functional Partitioning to Optimize End-to-End Performance on Many-core Architectures. Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis. (1-12).

    https://doi.org/10.1109/SC.2010.28

  • Rosenthal D. (2010). Keeping bits safe. Communications of the ACM. 53:11. (47-55). Online publication date: 1-Nov-2010.

    https://doi.org/10.1145/1839676.1839692

  • Ford D, Labelle F, Popovici F, Stokely M, Truong V, Barroso L, Grimes C and Quinlan S. Availability in globally distributed storage systems. Proceedings of the 9th USENIX conference on Operating systems design and implementation. (61-74).

    /doi/10.5555/1924943.1924948

  • Du Y, Yu H, Jiang Y, Dong Y and Zheng W. A rising tide lifts all boats. Proceedings of the Sixth international conference on Hot topics in system dependability.

    /doi/10.5555/1924908.1924916

  • Rosenthal D. (2010). Keeping Bits Safe: How Hard Can It Be?. Queue. 8:10. (10-22). Online publication date: 1-Oct-2010.

    https://doi.org/10.1145/1866296.1866298

  • Tan Y, Gu X and Wang H. Adaptive system anomaly prediction for large-scale hosting infrastructures. Proceedings of the 29th ACM SIGACT-SIGOPS symposium on Principles of distributed computing. (173-182).

    https://doi.org/10.1145/1835698.1835741

  • Brosch F, Koziolek H, Buhnova B and Reussner R. Parameterized reliability prediction for component-based software architectures. Proceedings of the 6th international conference on Quality of Software Architectures: research into Practice - Reality and Gaps. (36-51).

    https://doi.org/10.1007/978-3-642-13821-8_5

  • Bui H, Wright D, Helm C, Witty R, Flynn P and Thain D. Towards long term data quality in a large scale biometrics experiment. Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing. (565-572).

    https://doi.org/10.1145/1851476.1851559

  • Xiang L, Xu Y, Lui J and Chang Q. Optimal recovery of single disk failure in RDP code storage systems. Proceedings of the ACM SIGMETRICS international conference on Measurement and modeling of computer systems. (119-130).

    https://doi.org/10.1145/1811039.1811054

  • Xiang L, Xu Y, Lui J and Chang Q. (2010). Optimal recovery of single disk failure in RDP code storage systems. ACM SIGMETRICS Performance Evaluation Review. 38:1. (119-130). Online publication date: 12-Jun-2010.

    https://doi.org/10.1145/1811099.1811054

  • Vishwanath K and Nagappan N. Characterizing cloud computing hardware reliability. Proceedings of the 1st ACM symposium on Cloud computing. (193-204).

    https://doi.org/10.1145/1807128.1807161

  • Lang W, Patel J and Shankar S. Wimpy node clusters. Proceedings of the Sixth International Workshop on Data Management on New Hardware. (47-55).

    https://doi.org/10.1145/1869389.1869396

  • Fountoulakis M, Marazakis M, Flouris M and Bilas A. DARC. Proceedings of the 3rd Annual Haifa Experimental Systems Conference. (1-12).

    https://doi.org/10.1145/1815695.1815721

  • Brandt J, Chen F, De Sapio V, Gentile A, Mayo J, Pébay P, Roe D, Thompson D and Wong M. Using Cloud Constructs and Predictive Analysis to Enable Pre-Failure Process Migration in HPC Systems. Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing. (703-708).

    https://doi.org/10.1109/CCGRID.2010.31

  • Kadirvel S and Fortes J. Towards IT systems capable of managing their health. Proceedings of the 16th Monterey conference on Foundations of computer software: modeling, development, and verification of adaptive systems. (77-102).

    /doi/10.5555/2023011.2023016

  • Oprea A and Juels A. A clean-slate look at disk scrubbing. Proceedings of the 8th USENIX conference on File and storage technologies. (5-5).

    /doi/10.5555/1855511.1855516

  • Matos M, Sousa A, Pereira J, Oliveira R, Deliot E and Murray P. CLON. Proceedings of the Confederated International Conferences, CoopIS, DOA, IS, and ODBASE 2009 on On the Move to Meaningful Internet Systems: Part I. (549-566).

    https://doi.org/10.1007/978-3-642-05148-7_41

  • Clement A, Kapritsos M, Lee S, Wang Y, Alvisi L, Dahlin M and Riche T. Upright cluster services. Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles. (277-290).

    https://doi.org/10.1145/1629575.1629602

  • Heydegger V. Just one bit in a million. Proceedings of the 13th European conference on Research and advanced technology for digital libraries. (315-326).

    /doi/10.5555/1812799.1812840

  • Schroeder B, Pinheiro E and Weber W. (2009). DRAM errors in the wild. ACM SIGMETRICS Performance Evaluation Review. 37:1. (193-204). Online publication date: 15-Jun-2009.

    https://doi.org/10.1145/2492101.1555372

  • Chen F, Koufaty D and Zhang X. (2009). Understanding intrinsic characteristics and system implications of flash memory based solid state drives. ACM SIGMETRICS Performance Evaluation Review. 37:1. (181-192). Online publication date: 15-Jun-2009.

    https://doi.org/10.1145/2492101.1555371

  • Schroeder B, Pinheiro E and Weber W. DRAM errors in the wild. Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems. (193-204).

    https://doi.org/10.1145/1555349.1555372

  • Chen F, Koufaty D and Zhang X. Understanding intrinsic characteristics and system implications of flash memory based solid state drives. Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems. (181-192).

    https://doi.org/10.1145/1555349.1555371

  • Vishwanath K, Greenberg A and Reed D. Modular data centers. Proceedings of the 1st ACM workshop on Large-Scale system and application performance. (3-10).

    https://doi.org/10.1145/1552272.1552275

  • Liu C, Gu Y, Sun L, Yan B and Wang D. R-ADMAD. Proceedings of the 23rd international conference on Supercomputing. (370-379).

    https://doi.org/10.1145/1542275.1542327

  • Jin C, Jiang H, Feng D and Tian L. P-Code. Proceedings of the 23rd international conference on Supercomputing. (360-369).

    https://doi.org/10.1145/1542275.1542326

  • Elerath J. (2009). Hard-disk drives. Communications of the ACM. 52:6. (38-45). Online publication date: 1-Jun-2009.

    https://doi.org/10.1145/1516046.1516059

  • Barateiro J, Borbinha J, Antunes G and Freitas F. Challenges on preserving scientific data with data grids. Proceedings of the 1st ACM workshop on Data grids for eScience. (17-22).

    https://doi.org/10.1145/1531786.1531790

  • Narayanan D, Thereska E, Donnelly A, Elnikety S and Rowstron A. Migrating server storage to SSDs. Proceedings of the 4th ACM European conference on Computer systems. (145-158).

    https://doi.org/10.1145/1519065.1519081

  • Wu S, Jiang H, Feng D, Tian L and Mao B. WorkOut. Proccedings of the 7th conference on File and storage technologies. (239-252).

    /doi/10.5555/1525908.1525926

  • Weatherspoon H, Ganesh L, Marian T, Balakrishnan M and Birman K. Smoke and mirrors. Proccedings of the 7th conference on File and storage technologies. (211-224).

    /doi/10.5555/1525908.1525924

  • Anderson E, Arlitt M, Morrey C and Veitch A. (2009). DataSeries. ACM SIGOPS Operating Systems Review. 43:1. (70-75). Online publication date: 1-Jan-2009.

    https://doi.org/10.1145/1496909.1496923

  • Li M, Shu J and Zheng W. (2009). GRID codes. ACM Transactions on Storage. 4:4. (1-22). Online publication date: 1-Jan-2009.

    https://doi.org/10.1145/1480439.1480444

  • Narayanan D, Donnelly A and Rowstron A. (2008). Write off-loading. ACM Transactions on Storage. 4:3. (1-23). Online publication date: 1-Nov-2008.

    https://doi.org/10.1145/1416944.1416949

  • Bairavasundaram L, Arpaci-Dusseau A, Arpaci-Dusseau R, Goodson G and Schroeder B. (2008). An analysis of data corruption in the storage stack. ACM Transactions on Storage. 4:3. (1-28). Online publication date: 1-Nov-2008.

    https://doi.org/10.1145/1416944.1416947

  • Jiang W, Hu C, Zhou Y and Kanevsky A. (2008). Are disks the dominant contributor for storage failures?. ACM Transactions on Storage. 4:3. (1-25). Online publication date: 1-Nov-2008.

    https://doi.org/10.1145/1416944.1416946

  • Iliadis I, Haas R, Hu X and Eleftheriou E. (2008). Disk scrubbing versus intra-disk redundancy for high-reliability raid storage systems. ACM SIGMETRICS Performance Evaluation Review. 36:1. (241-252). Online publication date: 12-Jun-2008.

    https://doi.org/10.1145/1384529.1375485

  • Iliadis I, Haas R, Hu X and Eleftheriou E. Disk scrubbing versus intra-disk redundancy for high-reliability raid storage systems. Proceedings of the 2008 ACM SIGMETRICS international conference on Measurement and modeling of computer systems. (241-252).

    https://doi.org/10.1145/1375457.1375485

  • Grottke M, Sun H, Fricks R and Trivedi K. Ten fallacies of availability and reliability analysis. Proceedings of the 5th international conference on Service availability. (187-206).

    /doi/10.5555/1788594.1788615

  • Joukov N and Sipek J. (2008). GreenFS. ACM SIGOPS Operating Systems Review. 42:4. (69-80). Online publication date: 25-Apr-2008.

    https://doi.org/10.1145/1357010.1352600

  • Tran D, Chiang F and Li J. Friendstore. Proceedings of the 1st Workshop on Social Network Systems. (37-42).

    https://doi.org/10.1145/1435497.1435504

  • Joukov N and Sipek J. GreenFS. Proceedings of the 3rd ACM SIGOPS/EuroSys European Conference on Computer Systems 2008. (69-80).

    https://doi.org/10.1145/1352592.1352600

  • Bellam K, Manzanares A and Qin X. Energy efficient and reliable storage disks. Proceedings of the 46th annual ACM Southeast Conference. (7-12).

    https://doi.org/10.1145/1593105.1593108

  • Strunk J, Thereska E, Faloutsos C and Ganger G. Using utility to provision storage systems. Proceedings of the 6th USENIX Conference on File and Storage Technologies. (1-16).

    /doi/10.5555/1364813.1364834

  • Narayanan D, Donnelly A and Rowstron A. Write off-loading. Proceedings of the 6th USENIX Conference on File and Storage Technologies. (1-15).

    /doi/10.5555/1364813.1364830

  • Bairavasundaram L, Goodson G, Schroeder B, Arpaci-Dusseau A and Arpaci-Dussea R. An analysis of data corruption in the storage stack. Proceedings of the 6th USENIX Conference on File and Storage Technologies. (1-16).

    /doi/10.5555/1364813.1364828

  • Jiang W, Hu C, Zhou Y and Kanevsky A. Are disks the dominant contributor for storage failures?. Proceedings of the 6th USENIX Conference on File and Storage Technologies. (1-15).

    /doi/10.5555/1364813.1364821

  • Storer M, Greenan K, Miller E and Voruganti K. Pergamum. Proceedings of the 6th USENIX Conference on File and Storage Technologies. (1-16).

    /doi/10.5555/1364813.1364814

  • Gunawi H, Prabhakaran V, Krishnan S, Arpaci-Dusseau A and Arpaci-Dusseau R. (2007). Improving file system reliability with I/O shepherding. ACM SIGOPS Operating Systems Review. 41:6. (293-306). Online publication date: 14-Oct-2007.

    https://doi.org/10.1145/1323293.1294290

  • Gunawi H, Prabhakaran V, Krishnan S, Arpaci-Dusseau A and Arpaci-Dusseau R. Improving file system reliability with I/O shepherding. Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles. (293-306).

    https://doi.org/10.1145/1294261.1294290

  • Schroeder B and Gibson G. (2007). Understanding disk failure rates. ACM Transactions on Storage. 3:3. (8-es). Online publication date: 1-Oct-2007.

    https://doi.org/10.1145/1288783.1288785

  • Zukowski M, Héman S, Nes N and Boncz P. Cooperative scans. Proceedings of the 33rd international conference on Very large data bases. (723-734).

    /doi/10.5555/1325851.1325934

  • Elerath J. (2007). Hard Disk Drives: The Good, the Bad and the Ugly!. Queue. 5:6. (28-37). Online publication date: 1-Sep-2007.

    https://doi.org/10.1145/1317394.1317403

  • Hacker T and Meglicki Z. Using queue structures to improve job reliability. Proceedings of the 16th international symposium on High performance distributed computing. (43-54).

    https://doi.org/10.1145/1272366.1272373

  • Kotla R, Alvisi L and Dahlin M. SafeStore. 2007 USENIX Annual Technical Conference on Proceedings of the USENIX Annual Technical Conference. (1-14).

    /doi/10.5555/1364385.1364395

  • Bairavasundaram L, Goodson G, Pasupathy S and Schindler J. (2007). An analysis of latent sector errors in disk drives. ACM SIGMETRICS Performance Evaluation Review. 35:1. (289-300). Online publication date: 12-Jun-2007.

    https://doi.org/10.1145/1269899.1254917

  • Bairavasundaram L, Goodson G, Pasupathy S and Schindler J. An analysis of latent sector errors in disk drives. Proceedings of the 2007 ACM SIGMETRICS international conference on Measurement and modeling of computer systems. (289-300).

    https://doi.org/10.1145/1254882.1254917

  • Shah M, Baker M, Mogul J and Swaminathan R. Auditing to keep online storage services honest. Proceedings of the 11th USENIX workshop on Hot topics in operating systems. (1-6).

    /doi/10.5555/1361397.1361408