Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

A storage advisor for hybrid-store databases

Published: 01 August 2012 Publication History

Abstract

With the SAP HANA database, SAP offers a high-performance in-memory hybrid-store database. Hybrid-store databases---that is, databases supporting row- and column-oriented data management---are getting more and more prominent. While the columnar management offers high-performance capabilities for analyzing large quantities of data, the row-oriented store can handle transactional point queries as well as inserts and updates more efficiently. To effectively take advantage of both stores at the same time the novel question whether to store the given data row- or column-oriented arises. We tackle this problem with a storage advisor tool that supports database administrators at this decision. Our proposed storage advisor recommends the optimal store based on data and query characteristics; its core is a cost model to estimate and compare query execution times for the different stores. Besides a per-table decision, our tool also considers to horizontally and vertically partition the data and manage the partitions on different stores. We evaluated the storage advisor for the use in the SAP HANA database; we show the recommendation quality as well as the benefit of having the data in the optimal store with respect to increased query performance.

References

[1]
S. Agrawal, S. Chaudhuri, L. Kollar, A. Marathe, V. Narasayya, and M. Syamala. Database Tuning Advisor for Microsoft SQL Server 2005: Demo. In Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, pages 930--932, 2005.
[2]
S. Agrawal, S. Chaudhuri, and V. R. Narasayya. Automated Selection of Materialized Views and Indexes in SQL Databases. In Proceedings of the 26th International Conference on Very Large Data Bases, pages 496--505, 2000.
[3]
S. Agrawal, V. Narasayya, and B. Yang. Integrating Vertical and Horizontal Partitioning Into Automated Physical Database Design. In Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data, pages 359--370, 2004.
[4]
A. Ailamaki, D. J. DeWitt, M. D. Hill, and M. Skounakis. Weaving Relations for Cache Performance. In Proceedings of the 27th International Conference on Very Large Data Bases, pages 169--180, 2001.
[5]
P. A. Boncz, S. Manegold, and M. L. Kersten. Database Architecture Optimized for the new Bottleneck: Memory Access. In Proceedings of the 25th International Confernce on Very Large Data Bases, pages 54--65, 1999.
[6]
P. A. Boncz, M. Zukowski, and N. Nes. MonetDB/X100: Hyper-Pipelining Query Execution. In Proceedings of the 2nd Biennial Conference on Innovative Data Systems Research, pages 225--237, 2005.
[7]
S. Chaudhuri and V. R. Narasayya. An Efficient Cost-Driven Index Selection Tool for Microsoft SQL Server. In Proceedings of the 23rd International Conference on Very Large Data Bases, pages 146--155, 1997.
[8]
F. Faerber, S. K. Cha, J. Primsch, C. Bornhoevd, S. Sigg, and W. Lehner. SAP HANA Database - Data Management for Modern Business Applications. ACM Sigmod Record, 40(4):45--51, 2011.
[9]
M. Grund, J. Krüger, H. Plattner, A. Zeier, P. Cudre-Mauroux, and S. Madden. HYRISE: A Main Memory Hybrid Storage Engine. Proceedings of the 37th International Conference on Very Large Data Bases, pages 105--116, 2010.
[10]
H. Gupta and I. Mumick. Selection of Views to Materialize in a Data Warehouse. Proceedings of the 6th International Conference on Database Theory, pages 98--112, 1997.
[11]
Y. He, R. Lee, Y. Huai, Z. Shao, N. Jain, X. Zhang, and Z. Xu. RCFile: A Fast and Space-efficient Data Placement Structure in MapReduce-based Warehouse Systems. In In Proceedings of the 27th International Conference on Data Engineering, pages 1199--1208, 2011.
[12]
C. Lemke, K.-U. Sattler, F. Faerber, and A. Zeier. Speeding Up Queries in Column Stores: A Case for Compression. In Proceedings of the 12th International Conference on Data Warehousing and Knowledge Discovery, pages 117--129, 2010.
[13]
H. Plattner. A Common Database Approach for OLTP and OLAP Using an In-Memory Column Database. In Proceedings of the 35th SIGMOD international conference on Management of data, pages 1--2, 2009.
[14]
R. Ramamurthy, D. J. DeWitt, and Q. Su. A Case for Fractured Mirrors. In Proceedings of the 28th International Conference on Very Large Databases, pages 89--101, 2002.
[15]
V. Raman, G. Swart, L. Qiao, F. Reiss, V. Dialani, D. Kossmann, I. Narang, and R. Sidle. Constant-Time Query Processing. In Proceedings of the 24th International Conference on Data Engineering, pages 60--69, 2008.
[16]
R. Winter. Scaling The Data Warehouse. InformationWeek: Research & Reports, http://www.informationweek.com/news/211100262, 2008.
[17]
D. C. Zilio, J. Rao, S. Lightstone, G. Lohman, A. Storm, C. Garcia-Arellano, and S. Fadden. DB2 Design Advisor: Integrated Automatic Physical Database Design. In Proceedings of the 30th International Conference on Very Large Data Bases, pages 1087--1097, 2004.

Cited By

View all
  • (2019)FASTProceedings of Real-Time Business Intelligence and Analytics10.1145/3350489.3350490(1-10)Online publication date: 26-Aug-2019
  • (2018)GridFormationProceedings of the First International Workshop on Exploiting Artificial Intelligence Techniques for Data Management10.1145/3211954.3211956(1-7)Online publication date: 10-Jun-2018
  • (2017)Access Path Selection in Main-Memory Optimized Data SystemsProceedings of the 2017 ACM International Conference on Management of Data10.1145/3035918.3064049(715-730)Online publication date: 9-May-2017
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 5, Issue 12
August 2012
340 pages

Publisher

VLDB Endowment

Publication History

Published: 01 August 2012
Published in PVLDB Volume 5, Issue 12

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2019)FASTProceedings of Real-Time Business Intelligence and Analytics10.1145/3350489.3350490(1-10)Online publication date: 26-Aug-2019
  • (2018)GridFormationProceedings of the First International Workshop on Exploiting Artificial Intelligence Techniques for Data Management10.1145/3211954.3211956(1-7)Online publication date: 10-Jun-2018
  • (2017)Access Path Selection in Main-Memory Optimized Data SystemsProceedings of the 2017 ACM International Conference on Management of Data10.1145/3035918.3064049(715-730)Online publication date: 9-May-2017
  • (2017)Hybrid storage architecture and efficient MapReduce processing for unstructured dataParallel Computing10.1016/j.parco.2017.08.00869:C(63-77)Online publication date: 1-Nov-2017
  • (2016)Regularized Cost-Model Oblivious Database Tuning with Reinforcement LearningTransactions on Large-Scale Data- and Knowledge-Centered Systems XXVIII - Volume 994010.1007/978-3-662-53455-7_5(96-132)Online publication date: 1-Jun-2016
  • (2016)Storing and Querying DICOM Data with HYTORMOProceedings of the Second International Workshop on Data Management and Analytics for Medicine and Healthcare - Volume 1018610.1007/978-3-319-57741-8_4(43-61)Online publication date: 9-Sep-2016
  • (2014)H2OProceedings of the 2014 ACM SIGMOD International Conference on Management of Data10.1145/2588555.2610502(1103-1114)Online publication date: 18-Jun-2014
  • (2013)Medical data management in the SYSEO projectACM SIGMOD Record10.1145/2536669.253667842:3(48-53)Online publication date: 17-Oct-2013

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media