Abstract
Approximate query processing (AQP) technique speeds up query execution by reducing the amount of data that needs to be processed, while sacrificing the accuracy of the query result to some extent. AQP is essentially a trade-off between the accuracy of the query result and the query latency. However, the heuristic AQP optimization and error control mechanism used by the existing AQP system fails to meet the accuracy requirements of users. This paper proposes a deep learning-based error prediction model to guide AQP query optimization. By using this model, we can estimate the errors of candidate query plans and select the appropriate plans that can meet the accuracy requirement with high probability. Extensive experiments show that the AQP system proposed in this paper can outperform the state-of-the-art online sampling-based AQP approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
References
Agarwal, S., Mozafari, B., Panda, A., Milner, H., Madden, S., Stoica, I.: BlinkDB: queries with bounded errors and bounded response times on very large data. In: Eurosys (2013)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Kandula, S., et al.: Quickr: Lazily approximating complex AdHoc queries in bigdata clusters. In: SIGMOD (2016)
Lakshmi, S., Zhou, S.: Selectivity estimation in extensible databases-a neural network approach. In: VLDB, vol. 98, pp. 24–27 (1998)
Li, F., Wu, B., Yi, K., Zhao, Z.: Wander join: Online aggregation via random walks. In: SIGMOD 2016. pp. 615–629. ACM (2016)
Marcus, R.C., Papaemmanouil, O.: Plan-structured deep neural network models for query performance prediction. Proc. VLDB Endow. 12(11), 1733–1746 (2019)
Park, Y., Mozafari, B., Sorenson, J., Wang, J.: VerdictDB: universalizing approximate query processing. In: SIGMOD (2018)
Sun, J., Li, G.: An end-to-end learning-based cost estimator. Proc. VLDB Endow. 13(3), 307–319 (2019)
Wang, W., Zhang, M., Chen, G., Jagadish, H., Ooi, B.C., Tan, K.L.: Database meets deep learning: challenges and opportunities. ACM SIGMOD Rec. 45(2), 17–22 (2016)
Zhang, Y., Zhang, H., He, Z., Jing, Y., Zhang, K., Wang, X.S.: Parrot: a progressive analysis system on large text collections. Data Sci. Eng. 6(1), 1–19 (2021)
Acknowledgement
This work is funded by the NSFC (No. 61732004 and No. 62072113), the National Key R&D Program of China (No. 2018YFB1004404) and the Zhejiang Lab (No. 2021PE0AC01).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Bi, W., Zhang, H., Jing, Y., He, Z., Zhang, K., Wang, X.S. (2022). Learning-Based Optimization for Online Approximate Query Processing. In: Bhattacharya, A., et al. Database Systems for Advanced Applications. DASFAA 2022. Lecture Notes in Computer Science, vol 13245. Springer, Cham. https://doi.org/10.1007/978-3-031-00123-9_7
Download citation
DOI: https://doi.org/10.1007/978-3-031-00123-9_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-00122-2
Online ISBN: 978-3-031-00123-9
eBook Packages: Computer ScienceComputer Science (R0)