Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2487788.2487916acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
demonstration

Large-scale social-media analytics on stratosphere

Published: 13 May 2013 Publication History

Abstract

The importance of social-media platforms and online communities - in business as well as public context - is more and more acknowledged and appreciated by industry and researchers alike. Consequently, a wide range of analytics has been proposed to understand, steer, and exploit the mechanics and laws driving their functionality and creating the resulting benefits. However, analysts usually face significant problems in scaling existing and novel approaches to match the data volume and size of modern online communities. In this work, we propose and demonstrate the usage of the massively parallel data processing system Stratosphere, based on second order functions as an extended notion of the MapReduce paradigm, to provide a new level of scalability to such social-media analytics. Based on the popular example of role analysis, we present and illustrate how this massively parallel approach can be leveraged to scale out complex data-mining tasks, while providing a programming approach that eases the formulation of complete analytical workflows.

References

[1]
D. Battré, S. Ewen, F. Hueske, O. Kao, V. Markl, and D. Warneke. Nephele/PACTs: A Programming Model and Execution Framework for Web-Scale Analytical Processing. In Symposium on Cloud Computing, 2010.
[2]
J. Chan, C. Hayes, and E. M. Daly. Decomposing discussion forums and boards using user roles. In ICWSM, 2010.
[3]
J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. In OSDI, pages 137--150, 2004.
[4]
S. Ewen, S. Schelter, K. Tzoumas, D. Warneke, and V. Markl. Iterative parallel data processing with stratosphere: An inside look. SIGMOD, 2013.
[5]
E. Friedman, P. Pawlowski, and J. Cieslewicz. Sql/mapreduce: a practical approach to self-describing, polymorphic, and parallelizable user-defined functions. Proc. VLDB Endow., 2(2):1402--1413, 2009.
[6]
F. Hueske, M. Peters, A. Krettek, M. Ringwald, K. Tzoumas, V. Markl, and J.-C. Freytag. Peeking into the Optimization of Data Flow Programs with MapReduce-style UDFs. In ICDE, 2013.
[7]
M. Leich, J. Adamek, M. Schubotz, A. Heise, A. Rheinländer, and V. Markl. Applying Stratosphere for Big Data Analytics. In BTW, 2013.
[8]
M. Rowe, M. Fernandez, S. Angeletou, and H. Alani. Community analysis through semantic rules and role composition derivation. JWS, 18(1), 2012.

Cited By

View all
  • (2018)Simulation of Power Line Communication Using OPNET for Vertical Fish FarmSmart Grid Test Bed Using OPNET and Power Line Communication10.4018/978-1-5225-2776-3.ch008(139-164)Online publication date: 2018
  • (2017)MongoDB-Based Modular Ontology Building for Big Data IntegrationJournal on Data Semantics10.1007/s13740-017-0081-z7:1(1-27)Online publication date: 27-Oct-2017
  • (2017)A Keyword-Based Big Data Analysis for Individualized Health Activity Using Keyword Analysis Technique: A Methodological Approach Using National Health DataAdvances in Computer Science and Ubiquitous Computing10.1007/978-981-10-7605-3_197(1237-1243)Online publication date: 20-Dec-2017
  • Show More Cited By

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '13 Companion: Proceedings of the 22nd International Conference on World Wide Web
May 2013
1636 pages
ISBN:9781450320382
DOI:10.1145/2487788

Sponsors

  • NICBR: Nucleo de Informatcao e Coordenacao do Ponto BR
  • CGIBR: Comite Gestor da Internet no Brazil

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. behaviour analysis
  2. boards.ie
  3. community analysis
  4. online communities
  5. role analysis
  6. scalability
  7. stratosphere

Qualifiers

  • Demonstration

Conference

WWW '13
Sponsor:
  • NICBR
  • CGIBR
WWW '13: 22nd International World Wide Web Conference
May 13 - 17, 2013
Rio de Janeiro, Brazil

Acceptance Rates

WWW '13 Companion Paper Acceptance Rate 831 of 1,250 submissions, 66%;
Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2018)Simulation of Power Line Communication Using OPNET for Vertical Fish FarmSmart Grid Test Bed Using OPNET and Power Line Communication10.4018/978-1-5225-2776-3.ch008(139-164)Online publication date: 2018
  • (2017)MongoDB-Based Modular Ontology Building for Big Data IntegrationJournal on Data Semantics10.1007/s13740-017-0081-z7:1(1-27)Online publication date: 27-Oct-2017
  • (2017)A Keyword-Based Big Data Analysis for Individualized Health Activity Using Keyword Analysis Technique: A Methodological Approach Using National Health DataAdvances in Computer Science and Ubiquitous Computing10.1007/978-981-10-7605-3_197(1237-1243)Online publication date: 20-Dec-2017
  • (2017)“BOMEST” a Vital Approach to Extract the Propitious Information from the Big DataInformation and Communication Technology for Sustainable Development10.1007/978-981-10-3932-4_46(441-448)Online publication date: 8-Nov-2017
  • (2017)Managing Modular Ontology Evolution Under Big Data IntegrationInformation Systems10.1007/978-3-319-65930-5_2(17-28)Online publication date: 15-Aug-2017
  • (2016)Big Data IntegrationProcedia Computer Science10.1016/j.procs.2016.08.09996:C(446-455)Online publication date: 1-Oct-2016
  • (2015)Big Data Stream Analytics for Near Real-Time Sentiment AnalysisJournal of Computer and Communications10.4236/jcc.2015.3502403:05(189-195)Online publication date: 2015
  • (2015)Reference Architecture and Classification of Technologies, Products and Services for Big Data SystemsBig Data Research10.1016/j.bdr.2015.01.0012:4(166-186)Online publication date: 1-Dec-2015
  • (2014)The Stratosphere platform for big data analyticsThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-014-0357-y23:6(939-964)Online publication date: 1-Dec-2014

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media