Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1989323.1989433acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
keynote

Managing scientific data: lessons, challenges, and opportunities

Published: 12 June 2011 Publication History

Abstract

Today's scientific processes heavily depend on fast and accurate analysis of experimental data. Scientists are routinely overwhelmed by the effort needed to manage the volumes of data produced either by observing phenomena or by sophisticated simulations. As database systems have proven inefficient, inadequate, or insufficient to meet the needs of scientific applications, the scientific community typically uses special-purpose legacy software. When compared to a general-purpose DBMS, however, application-specific systems require more resources to maintain, and in order to achieve acceptable performance they often sacrifice data independence and hinder the reuse of knowledge. Nowadays, scientific datasets are growing at unprecedented rates, a result of increasing complexity of the simulated models and ever-improving instrument precision; consequently, scientists' queries become more sophisticated as they try to interpret the data correctly. Datasets and scientific query complexity are likely to continue to grow indefinitely, rendering legacy systems increasingly inadequate. To respond to the challenge, the data management community aspires to solve scientific data management problems by carefully examining the problems of scientific applications and by developing special- or general-purpose scientific data management techniques and systems. This talk discusses the work of teams around the world in an effort to surface the most critical requirements of such an undertaking, and the technological innovations needed to satisfy them.

Supplementary Material

JPG File (p1045-sigmod110616.jpg)
FLV File (p1045-sigmod110616.flv)

Cited By

View all
  • (2022)Workload Aware Cost-Based Partial Loading of Raw Data for Limited Storage ResourcesFuturistic Trends in Networks and Computing Technologies10.1007/978-981-19-5037-7_74(1035-1048)Online publication date: 16-Nov-2022
  • (2020)Adding domain data to code profiling tools to debug workflow parallel executionFuture Generation Computer Systems10.1016/j.future.2018.05.078110(422-439)Online publication date: Sep-2020
  • (2015)A novel approach to user-steering in scientific workflows2015 IEEE 10th Jubilee International Symposium on Applied Computational Intelligence and Informatics10.1109/SACI.2015.7208205(233-236)Online publication date: May-2015
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '11: Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
June 2011
1364 pages
ISBN:9781450306614
DOI:10.1145/1989323

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 June 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. astronomy
  2. brain simulations
  3. earthquake simulations

Qualifiers

  • Keynote

Conference

SIGMOD/PODS '11
Sponsor:

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Workload Aware Cost-Based Partial Loading of Raw Data for Limited Storage ResourcesFuturistic Trends in Networks and Computing Technologies10.1007/978-981-19-5037-7_74(1035-1048)Online publication date: 16-Nov-2022
  • (2020)Adding domain data to code profiling tools to debug workflow parallel executionFuture Generation Computer Systems10.1016/j.future.2018.05.078110(422-439)Online publication date: Sep-2020
  • (2015)A novel approach to user-steering in scientific workflows2015 IEEE 10th Jubilee International Symposium on Applied Computational Intelligence and Informatics10.1109/SACI.2015.7208205(233-236)Online publication date: May-2015
  • (2015)Dynamic steering of HPC scientific workflowsFuture Generation Computer Systems10.1016/j.future.2014.11.01746:C(100-113)Online publication date: 1-May-2015
  • (2015)Running Multi-relational Data Mining Processes in the Cloud: A Practical Approach for Social NetworksHigh Performance Computing10.1007/978-3-319-26928-3_1(3-18)Online publication date: 12-Dec-2015
  • (2013)User-steering of HPC workflowsProceedings of the 2nd ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies10.1145/2499896.2499900(1-6)Online publication date: 23-Jun-2013
  • (2012)GiST scan acceleration using coprocessorsProceedings of the Eighth International Workshop on Data Management on New Hardware10.1145/2236584.2236593(63-69)Online publication date: 21-May-2012

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media