PostCENN: postgresql with machine learning models for cardinality estimation

L Woltmann, D Olwig, C Hartmann, D Habich… - Proceedings of the …, 2021 - dl.acm.org
Proceedings of the VLDB Endowment, 2021dl.acm.org
In this demo, we present PostCENN, an enhanced PostgreSQL database system with an
end-to-end integration of machine learning (ML) models for cardinality estimation. In
general, cardinality estimation is a topic with a long history in the database community.
While traditional models like histograms are extensively used, recent works mainly focus on
developing new approaches using ML models. However, traditional as well as ML models
have their own advantages and disadvantages. With PostCENN, we aim to combine both to …
In this demo, we present PostCENN, an enhanced PostgreSQL database system with an end-to-end integration of machine learning (ML) models for cardinality estimation. In general, cardinality estimation is a topic with a long history in the database community. While traditional models like histograms are extensively used, recent works mainly focus on developing new approaches using ML models. However, traditional as well as ML models have their own advantages and disadvantages. With PostCENN, we aim to combine both to maximize their potentials for cardinality estimation by introducing ML models as a novel means to increase the accuracy of the cardinality estimation for certain parts of the database schema. To achieve this, we integrate ML models as first class citizen in PostgreSQL with a well-defined end-to-end life cycle. This life cycle consists of creating ML models for different sub-parts of the database schema, triggering the training, using ML models within the query optimizer in a transparent way, and deleting ML models.
ACM Digital Library