Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3093742.3093902acmconferencesArticle/Chapter ViewAbstractPublication PagesdebsConference Proceedingsconference-collections
extended-abstract

A New Application Benchmark for Data Stream Processing Architectures in an Enterprise Context: Doctoral Symposium

Published: 08 June 2017 Publication History

Abstract

Against the backdrop of ever-growing data volumes and trends like the Internet of Things (IoT) or Industry 4.0, Data Stream Processing Systems (DSPSs) or data stream processing architectures in general receive a greater interest. Continuously analyzing streams of data allows immediate responses to environmental changes. A challenging task in that context is assessing and comparing data stream processing architectures in order to identify the most suitable one for certain settings.
The present paper provides an overview about performance benchmarks that can be used for analyzing data stream processing applications. By describing shortcomings of these benchmarks, the need for a new application benchmark in this area, especially for a benchmark covering enterprise architectures, is highlighted. A key role in such an enterprise context is the combination of streaming data and business data, which is barely covered in current data stream processing benchmarks. Furthermore, first ideas towards the development of a solution, i.e., a new application benchmark that is able to fill the existing gap, are depicted.

References

[1]
Daniel J. Abadi, Don Carney, Ugur Cetintemel, Mitch Cherniack, Christian Convey, Sangdon Lee, Michael Stonebraker, Nesime Tatbul, and Stan Zdonik. 2003. Aurora: A New Model and Architecture for Data Stream Management. The VLDB journal 12, 2 (Aug. 2003), 120--139.
[2]
Mohamed Amine Abdessemed. 2015. Real-time Data Integration with Apache Flink & Kafka @Bouygues Telecom. http://www.slideshare.net/FlinkForward/mohamed-amine-abdessemed-realtime-data-integration-with-apache-flink-kafka. (2015). Accessed: 2017-04-06.
[3]
Arvind Arasu, Brian Babcock, Shivnath Babu, Mayur Datar, Keith Ito, Itaru Nishizawa, Justin Rosenstein, and Jennifer Widom. 2003. STREAM: The Stanford Stream Data Manager (Demonstration Description). In Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data (SIGMOD '03). ACM, New York, NY, USA, 665--665.
[4]
Arvind Arasu, Mitch Cherniack, Eduardo Galvez, David Maier, Anurag S. Maskey, Esther Ryvkina, Michael Stonebraker, and Richard Tibbetts. 2004. Linear Road: A Stream Data Management Benchmark. In Proceedings of the Thirtieth International Conference on Very Large Data Bases - Volume 30 (VLDB '04). VLDB Endowment, 480--491. http://dl.acm.org/citation.cfm?id=1316689.1316732
[5]
T. Dunning and E. Friedman. 2016. Streaming Architecture: New Designs Using Apache Kafka and MapR Streams. O'Reilly Media.
[6]
Guenter Hesse and Martin Lorenz. 2015. Conceptual Survey on Data Stream Processing Systems. In Proceedings of the 2015 IEEE 21st International Conference on Parallel and Distributed Systems (ICPADS) (ICPADS '15). IEEE Computer Society, Washington, DC, USA, 797--802.
[7]
Marco F. Huber, Martin Voigt, and Axel-Cyrille Ngonga Ngomo. 2016. Big data architecture for the semantic analysis of complex events in manufacturing. In Informatik 2016, 46. Jahrestagung der Gesellschaft für Informatik, 26-30. September 2016, Klagenfurt, Österreich. 353--360. http://subs.emis.de/LNI/Proceedings/Proceedings259/article173.html
[8]
Jay Kreps, Neha Narkhede, Jun Rao, et al. 2011. Kafka: A distributed messaging system for log processing. In SIGMOD Workshop on Networking Meets Databases.
[9]
Sanjeev Kulkarni, Nikunj Bhagat, Maosong Fu, Vikas Kedigehalli, Christopher Kellogg, Sailesh Mittal, Jignesh M. Patel, Karthik Ramasamy, and Siddarth Taneja. 2015. Twitter Heron: Stream Processing at Scale. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data (SIGMOD '15). ACM, New York, NY, USA, 239--250.
[10]
Ruirui Lu, Gang Wu, Bin Xie, and Jingtong Hu. 2014. Stream Bench: Towards Benchmarking Modern Distributed Stream Computing Frameworks. In Proceedings of the 2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing (UCC '14). IEEE Computer Society, Washington, DC, USA, 69--78.
[11]
D. A. Menasce. 2002. TPC-W: a benchmark for e-commerce. IEEE Internet Computing 6, 3 (May 2002), 83--87.
[12]
Anshu Shukla, Shilpa Chaturvedi, and Yogesh Simmhan. 2017. RIoTBench: A Real-time IoT Benchmark for Distributed Stream Processing Platforms. CoRR abs/1701.08530 (2017). http://arxiv.org/abs/1701.08530
[13]
Mihail Vieru and Javier López. 2016. Flink in Zalando's World of Microservices. http://www.slideshare.net/ZalandoTech/flink-in-zalandos-world-of-microservices-62376341. (2016). Accessed: 2017-04-06.
[14]
Steven Weiner and David Line. 2014. Manufacturing and the data conundrum - Too much? Too little? Or just right? https://www.eiuperspectives.economist.com/sites/default/files/Manufacturing_Data_Conundrum_Jul14.pdf. (2014). Accessed: 2017-03-01.

Cited By

View all
  • (2021)ESPBench: The Enterprise Stream Processing BenchmarkProceedings of the ACM/SPEC International Conference on Performance Engineering10.1145/3427921.3450242(201-212)Online publication date: 9-Apr-2021
  • (2020)How Fast Can We Insert? An Empirical Performance Evaluation of Apache Kafka2020 IEEE 26th International Conference on Parallel and Distributed Systems (ICPADS)10.1109/ICPADS51040.2020.00089(641-648)Online publication date: Dec-2020

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
DEBS '17: Proceedings of the 11th ACM International Conference on Distributed and Event-based Systems
June 2017
393 pages
ISBN:9781450350655
DOI:10.1145/3093742
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 June 2017

Check for updates

Author Tags

  1. Benchmark Development
  2. Data Stream Processing
  3. Internet of Things
  4. Performance Benchmarking
  5. Stream Processing

Qualifiers

  • Extended-abstract
  • Research
  • Refereed limited

Conference

DEBS '17

Acceptance Rates

DEBS '17 Paper Acceptance Rate 22 of 60 submissions, 37%;
Overall Acceptance Rate 145 of 583 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2021)ESPBench: The Enterprise Stream Processing BenchmarkProceedings of the ACM/SPEC International Conference on Performance Engineering10.1145/3427921.3450242(201-212)Online publication date: 9-Apr-2021
  • (2020)How Fast Can We Insert? An Empirical Performance Evaluation of Apache Kafka2020 IEEE 26th International Conference on Parallel and Distributed Systems (ICPADS)10.1109/ICPADS51040.2020.00089(641-648)Online publication date: Dec-2020

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media