Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/2886444.2886478dlproceedingsArticle/Chapter ViewAbstractPublication PagescasconConference Proceedingsconference-collections
research-article

To default or not to default: exposing limitations to HBase cluster deployers

Published: 02 November 2015 Publication History

Abstract

With the advent of sensor networks and portable devices, data has been produced rapidly and in great amount. As a result storing and processing Big Data, in combination with the advances in cloud and virtual infrastructures, pose interesting challenges. In our previous work, we studied these challenges with various experiments around different HBase cluster configurations and their impact on the performance of the cluster. A by-product of our experiments was that, in spite of advances in tooling support to set up and configure a Big Data cluster, the various tools are not always aligned to produce the optimal or near-optimal performance for data clusters. More specifically, we show that the default configuration values of state-of-the-art cluster deployers, including Cloudera, IBM BigInsights, Apache Hortonworks and the manual HBase deployment, do not take in to account the underlying infrastructure resulting in subpar performance.

References

[1]
Apache HBase âĎć Reference Guide - hbase-env.sh. https://hbase.apache.org/book.html#hbase_env. Last accessed: 24-Jun-2015.
[2]
Hortonworks Data Platform - Recommended Memory Configurations for the MapReduce Service. http://docs.hortonworks.com/HDPDocuments/Ambari-1.6.1.0/bk_ambari_reference/content/mem-configs-mapReduce.html. Last accessed: 24-Jun-2015.
[3]
IBM InfoSphere BigInsights Version 3.0 - General HBase tuning. https://www-01.ibm.com/support/knowledgecenter/SSPT3X_3.0.0/com.ibm.swg.im.infosphere.biginsights.analyze.doc/doc/bigsql_gentune.html. Last accessed: 24-Jun-2015.
[4]
A. Rabkin and R. H. Katz. How hadoop clusters break. Software, IEEE, 30(4): 88--94, 2013.
[5]
B. T. Rao, N. Sridevi, V. K. Reddy, and L. Reddy. Performance issues of heterogeneous hadoop clusters in cloud computing. arXiv preprint arXiv:1207.0894, 2012.
[6]
R. Sandel, M. Shtern, M. Fokaefs, and M. Litoiu. Evaluating cluster configurations for big data processing: An exploratory study. In 2015 Symposium on the Maintenance and Evolution of Service-Oriented Systems and Cloud-Based Environments. IEEE, 2015. (accepted).
[7]
M. Shtern, R. Mian, M. Litoiu, S. Zareian, H. Abdelgawad, and A. Tizghadam. Towards a multi-cluster analytical engine for transportation data. In Cloud and Autonomic Computing (ICCAC), 2014 International Conference on, pages 249--257. IEEE, 2014.

Cited By

View all
  • (2018)Adaptation as a serviceProceedings of the 28th Annual International Conference on Computer Science and Software Engineering10.5555/3291291.3291321(282-288)Online publication date: 29-Oct-2018

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
CASCON '15: Proceedings of the 25th Annual International Conference on Computer Science and Software Engineering
November 2015
409 pages

Publisher

IBM Corp.

United States

Publication History

Published: 02 November 2015

Author Tags

  1. big data
  2. cloud computing
  3. design
  4. migration
  5. performance modelling and evaluation

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 24 of 90 submissions, 27%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Adaptation as a serviceProceedings of the 28th Annual International Conference on Computer Science and Software Engineering10.5555/3291291.3291321(282-288)Online publication date: 29-Oct-2018

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media