Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2513591.2513640acmotherconferencesArticle/Chapter ViewAbstractPublication PagesideasConference Proceedingsconference-collections
research-article

Read optimisations for append storage on flash

Published: 09 October 2013 Publication History

Abstract

Append-/Log-based Storage Managers (LbSM) for database systems represent a good match for the characteristics and behaviour of Flash technology. LbSM alleviate random writes reducing the impact of Flash read/write asymmetry, increasing endurance and performance. A recently proposed combination of Multi-Versioning database approaches and LbSM called SIAS [9] offers further benefits: it substantially lowers the write rate due to tuple version append granularity and therefore improves the performance. In SIAS a page contains versions of tuples of the same table. Once appended such a page is immutable. The only allowable operations are reads (lookups, scans, version visibility checks) in tuple version granularity. Optimising for them offers an essential performance increase. In the present work-in-progress paper we propose two types of read optimisations: Multi-Version Index and Ordered Log Storage.
Benefits of Ordered Log Storage: (i) Read efficiency due to the use of parallel read streams; (ii) Write efficiency since larger amounts of data are appended sequentially; (iii) fast garbage collection: read multiple sorted runs, filter dead tuples and write one single, large (combined) sorted run. (iv) possible cache-efficiency optimisations (for large scans)
Benefits of Multi-Version Indexing: (i) index only visibility checks; (ii) postponing of index reorganisations; (iii) no invalid tuple bits in the index (in-place updates); (iv) pre-filtering of invisible tuple versions; (v) facilitate easy identification of tuple versions to be garbage collected.
Benefits of the combination of both approaches: (i) Index and ordered access; (ii) Facilitate range searches in sorted runs; (iii) on the fly garbage collection (checking of one bit).

References

[1]
D. Agrawal, D. Ganesan, R. Sitaraman, Y. Diao, and S. Singh. Lazy-adaptive tree: An optimized index structure for flash devices. Proceedings of the VLDB Endowment, 2(1): 361--372, 2009.
[2]
H. Berenson, P. Bernstein, J. Gray, J. Melton, E. O'Neil, and P. O'Neil. A critique of ansi sql isolation levels. In Proc. SIGMOD'95, pages 1--10, 1995.
[3]
P. Bober and M. Carey. On mixing queries and transactions via multiversion locking. In Proc. IEEE CS Intl. Conf. No. 8 on Data Engineering, Tempe, AZ, feb 1992.
[4]
M. J. Carey and W. A. Muhanna. The performance of multiversion concurrency control algorithms. ACM Trans. on Computer Sys., 4(4): 338, Nov. 1986.
[5]
A. Chan, S. Fox, W.-T. K. Lin, A. Nori, and D. R. Ries. The implementation of an integrated concurrency control and recovery scheme. In Proc. SIGMOD'82, June 1982.
[6]
Database Test Suite DBT2. http://osdldbt.sourceforge.net.
[7]
R. Gottstein, I. Petrov, and A. Buchmann. SI-CV: Snapshot isolation with co-located versions. In Proc. TPCTC'11, pages 123--136, 2012.
[8]
R. Gottstein, I. Petrov, and A. Buchmann. Aspects of append-based database storage management on flash memories. In Proc. of DBKDA 2013, pages 116--120. IARIA, 2013.
[9]
R. Gottstein, I. Petrov, and A. P. Buchmann. Append storage in multi-version databases on flash. In BNCOD, pages 62--76, 2013.
[10]
T. Haapasalo, I. Jaluta, B. Seeger, S. Sippu, and E. Soisalon-Soininen. Transactions on the multiversion b+-tree. In Proc. EDBT '09, pages 1064--1075, 2009.
[11]
S. Hardock, I. Petrov, R. Gottstein, and A. Buchmann. Noftl: Database systems on ftl-less flash storage. In 39th International Conference on Very Large Databases (VLDB). VLDB 2013, 2013.
[12]
J. Krueger, C. Kim, M. Grund, N. Satish, D. Schwalb, J. Chhugani, H. Plattner, P. Dubey, and A. Zeier. Fast updates on read-optimized databases using multi-core cpus. Proc. VLDB Endow., 5(1): 61--72, Sept. 2011.
[13]
Y. Li, B. He, R. J. Yang, Q. Luo, and K. Yi. Tree indexing on solid state drives. Proceedings of the VLDB Endowment 3(1--2): 1195--1206, 2010.
[14]
D. Majumdar. A quick survey of multiversion concurrency algorithms.
[15]
P. O'Neil, E. Cheng, D. Gawlick, and E. O'Neil. The log-structured merge-tree, 1996.
[16]
I. Petrov, R. Gottstein, T. Ivanov, D. Bausch, and A. P. Buchmann. Page size selection for OLTP databases on SSD storage. JIDM, 2(1): 11--18, 2011.
[17]
R. Sears and R. Ramakrishnan. blsm: a general purpose log structured merge tree. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, SIGMOD '12, pages 217--228, New York, NY, USA, 2012. ACM.
[18]
Seekwatcher. https://oss.oracle.com/mason/seekwatcher.
[19]
A. Silberschatz, H. F. Korth, and S. Sudarshan. Database Systems Concepts. McGraw-Hill Higher Education, 4th edition, 2001.
[20]
R. Stoica, M. Athanassoulis, R. Johnson, and A. Ailamaki. Evaluating and repairing write performance on flash devices. In Proc. DaMoN, pages 9--14, 2009.
[21]
M. Stonebraker, L. A. Rowe, and M. Hirohama. The implementation of postgres. IEEE Trans. on Knowledge and Data Eng., 2(1): 125, Mar. 1990.
[22]
A. Twigg, A. Byde, G. Milos, T. Moreton, J. Wilkes, and T. Wilkie. Stratified b-trees and versioning dictionaries. arXiv preprint arXiv:1103.4282, 2011.
[23]
S. Wu and B. Kemme. Postgres-r(si): Combining replica control with concurrency control based on snapshot isolation. In Proc. ICDE '05, ICDE '05, pages 422--433, 2005.

Cited By

View all
  • (2014)MV-IDXProceedings of the 18th International Database Engineering & Applications Symposium10.1145/2628194.2628911(142-148)Online publication date: 7-Jul-2014

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
IDEAS '13: Proceedings of the 17th International Database Engineering & Applications Symposium
October 2013
222 pages
ISBN:9781450320252
DOI:10.1145/2513591
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • UPC: Technical University of Catalunya
  • BytePress
  • Concordia University: Concordia University

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 October 2013

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Funding Sources

Conference

IDEAS '13
Sponsor:
  • UPC
  • Concordia University

Acceptance Rates

IDEAS '13 Paper Acceptance Rate 9 of 51 submissions, 18%;
Overall Acceptance Rate 74 of 210 submissions, 35%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2014)MV-IDXProceedings of the 18th International Database Engineering & Applications Symposium10.1145/2628194.2628911(142-148)Online publication date: 7-Jul-2014

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media