Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2538542.2538561acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

Fourier-assisted machine learning of hard disk drive access time models

Published: 17 November 2013 Publication History

Abstract

Predicting access times is a crucial part of predicting hard disk drive performance. Existing approaches use white-box modeling and require intimate knowledge of the internal layout of the drive, which can take months to extract. Automatically learning this behavior is a much more desirable approach, requiring less expert knowledge, fewer assumptions, and less time. Others have created behavioral models of hard disk drive performance, but none have shown low per-request errors. A barrier to machine learning of access times has been the existence of periodic behavior with high, unknown frequencies. We show how hard disk drive access times can be predicted to within 0:83 ms using a neural net after these frequencies are found using Fourier analysis.

References

[1]
E. Anderson, S. Spence, R. Swaminathan, M. Kallahalla, and Q. Wang. Quickly finding near-optimal storage designs. ACM Transactions on Computer Systems (TOCS), 23(4):337--374, 2005.
[2]
L. Breiman. Bagging predictors. Machine learning, 24(2):123--140, 1996.
[3]
J. S. Bucy, J. Schindler, S. W. Schlosser, G. R. Ganger, and Contributors. The DiskSim Simulation Environment Version 4.0 Reference Manual. Carnegie Mellon University, Pittsburgh, PA, May 2008.
[4]
Y. Chen, W. W. Hsu, and H. C. Young. Logging RAID - an approach to fast, reliable, and low-cost disk arrays. In A. Bode, T. Ludwig, W. Karl, and R. Wismüller, editors, Euro-Par 2000 Parallel Processing, volume 1900 of Lecture Notes in Computer Science, pages 1302--1311. Springer Berlin Heidelberg, 2000.
[5]
C. Dai, G. Liu, L. Zhang, and E. Chen. Storage device performance prediction with hybrid regression models. In PDCAT'12, Beijing, China, December 2012.
[6]
D. Elizondo, G. Hoogenboom, and R. McClendon. Development of a neural network model to predict daily solar radiation. Agricultural and Forest Meteorology, 71(1):115--132, 1994.
[7]
S. E. Fahlman and C. Lebiere. The cascade-correlation learning architecture. Technical Report CMU-CS-90-100, School of Computer Science, Carnegie Mellon University, February 1990.
[8]
S. Ferrari and R. F. Stengel. Smooth function approximation using neural networks. Neural Networks, IEEE Transactions on, 16(1):24--38, 2005.
[9]
J. Garcia, L. Prada, J. Fernandez, A. Nunez, and J. Carretero. Using black-box modeling techniques for modern disk drives service time simulation. In Simulation Symposium, 2008. ANSS 2008. 41st Annual, pages 139--145, april 2008.
[10]
M. Gardner and S. Dorling. Neural network modelling and prediction of hourly NOx and NO2 concentrations in urban air in london. Atmospheric Environment, 33(5):709--719, 1999.
[11]
J. M. Gauch. Ch4 - fourier transform. http://www. csce.uark.edu/~jgauch/5683/notes/ch04a.pdf.
[12]
J. Gim and Y. Won. Extract and infer quickly: Obtaining sector geometry of modern hard disk drives. Trans. Storage, 6:6:1--6:26, July 2010.
[13]
A. Guez and Z. Ahmad. Solution to the inverse kinematics problem in robotics by neural networks. In Neural Networks, 1988., IEEE International Conference on, pages 617--624. IEEE, 1988.
[14]
M. T. Hagan and M. B. Menhaj. Training feedforward networks with the marquardt algorithm. Neural Networks, IEEE Transactions on, 5(6):989--993, 1994.
[15]
H. Huang, W. Hung, and K. G. Shin. Fs2: dynamic data replication in free disk space for improving disk performance and energy consumption. ACM SIGOPS Operating Systems Review, 39(5):263--276, 2005.
[16]
S. Jagannathan and F. L. Lewis. Multilayer discrete-time neural-net controller with guaranteed performance. Neural Networks, IEEE Transactions on, 7(1):107--130, 1996.
[17]
T. Kelly, I. Cohen, M. Goldszmidt, and K. Keeton. Inducing models of black-box storage arrays. Technical Report HPL-2004-108, HP Laboratories, Palo Alto, CA, June 2004.
[18]
L. Kindermann, A. Lewandowski, and P. Protzel. A framework for solving functional equations with neural networks. In Proceedings of Neural Information Processing (ICONIP2001), volume 2, pages 1075--1078. Fudan University Press, Shanghai, 2001.
[19]
K. Kosanovich, A. Gurumoorthy, E. Sinzinger, and M. Piovoso. Improving the extrapolation capability of neural networks. In Intelligent Control, 1996., Proceedings of the 1996 IEEE International Symposium on, pages 390--395. IEEE, 1996.
[20]
J. Kwon, B. Coifman, and P. Bickel. Day-to-day travel-time trends and travel-time prediction from loop-detector data. Transportation Research Record: Journal of the Transportation Research Board, 1717(1):120--129, 2000.
[21]
A. S. Lebrecht, N. J. Dingle, and W. J. Knottenbelt. A performance model of zoned disk drives with I/O request reordering. In Proceedings of the 2009 Sixth International Conference on the Quantitative Evaluation of Systems, QEST '09, pages 97--106, Washington, DC, USA, 2009. IEEE Computer Society.
[22]
N. Liu, J. Cope, P. Carns, C. Carothers, R. Ross, G. Grider, A. Crume, and C. Maltzahn. On the role of burst buffers in leadership-class storage systems. In MSST/SNAPI 2012, Pacific Grove, CA, April 16 - 20 2012.
[23]
Y. Liu, R. Figueiredo, D. Clavijo, Y. Xu, and M. Zhao. Towards simulation of parallel file system scheduling algorithms with pfssim. In Proceedings of the 7th IEEE International Workshop on Storage Network Architectures and Parallel I/O (May 2011), 2011.
[24]
M. P. Mesnier, M. Wachs, R. R. Sambasivan, A. X. Zheng, and G. R. Ganger. Modeling the relative fitness of storage. In SIGMETRICS 2007, 2007.
[25]
G. Noone and S. D. Howard. Investigation of periodic time series using neural networks and adaptive error thresholds. In Neural Networks, 1995. Proceedings., IEEE International Conference on, volume 4, pages 1541--1545. IEEE, 1995.
[26]
B. E. Rosen. Ensemble learning using decorrelated neural networks. Connection Science, 8(3-4):373--384, 1996.
[27]
Y. Shang and B. W. Wah. Global optimization for neural network training. Computer, 29(3):45--54, 1996.
[28]
J. M. Sopena, E. Romero, and R. Alquezar. Neural networks with periodic and monotonic activation functions: a comparative study in classification problems. In Artificial Neural Networks, 1999. ICANN 99. Ninth International Conference on (Conf. Publ. No. 470), volume 1, pages 323--328. IET, 1999.
[29]
D. F. Specht and P. Shapiro. Generalization accuracy of probabilistic neural networks compared with backpropagation networks. In Neural Networks, 1991., IJCNN-91-Seattle International Joint Conference on, volume i, pages 887--892 vol.1, 1991.
[30]
M. Wang, K. Au, A. Ailamaki, A. Brockwell, C. Faloutsos, and G. R. Ganger. Storage device performance prediction with cart models. In MASCOTS 2004, 2004.
[31]
K.-W. Wong, C.-S. Leung, and S.-J. Chang. Use of periodic and monotonic activation functions in multilayer feedforward neural networks trained by extended kalman filter algorithm. In Vision, Image and Signal Processing, IEE Proceedings-, volume 149, pages 217--224. IET, 2002.
[32]
Z. Zainuddin and O. Pauline. Function approximation using artificial neural networks. WSEAS Transactions on Mathematics, 7(6):333--338, 2008.
[33]
L. Zhang, G. Liu, X. Zhang, S. Jiang, and E. Chen. Storage device performance prediction with selective bagging classification and regression tree. Network and Parallel Computing, pages 121--133, 2010.

Cited By

View all
  • (2019)Support for Provisioning and Configuration Decisions for Data Intensive WorkflowsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2015.249769327:9(2725-2739)Online publication date: 1-Jan-2019
  • (2016)Impact of data placement on resilience in large-scale object storage systems2016 32nd Symposium on Mass Storage Systems and Technologies (MSST)10.1109/MSST.2016.7897091(1-12)Online publication date: 2016
  • (2014)Automatic generation of behavioral hard disk drive access time models2014 30th Symposium on Mass Storage Systems and Technologies (MSST)10.1109/MSST.2014.6855553(1-11)Online publication date: Jun-2014

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
PDSW '13: Proceedings of the 8th Parallel Data Storage Workshop
November 2013
55 pages
ISBN:9781450325059
DOI:10.1145/2538542
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 November 2013

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

SC13

Acceptance Rates

PDSW '13 Paper Acceptance Rate 8 of 16 submissions, 50%;
Overall Acceptance Rate 17 of 41 submissions, 41%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 19 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Support for Provisioning and Configuration Decisions for Data Intensive WorkflowsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2015.249769327:9(2725-2739)Online publication date: 1-Jan-2019
  • (2016)Impact of data placement on resilience in large-scale object storage systems2016 32nd Symposium on Mass Storage Systems and Technologies (MSST)10.1109/MSST.2016.7897091(1-12)Online publication date: 2016
  • (2014)Automatic generation of behavioral hard disk drive access time models2014 30th Symposium on Mass Storage Systems and Technologies (MSST)10.1109/MSST.2014.6855553(1-11)Online publication date: Jun-2014

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media