Principal Program Manager, Azure Data CAT, Spark
-
Microsoft
- Denver, CO
-
00:17
(UTC -06:00) - milescole.dev
- in/mileswcole
Stars
A multi-modal Python library for benchmarking lakehouse engines and ELT scenarios, supporting both industry-standard and novel benchmarks.
Implementation of the TPC-DS benchmark using Spark SQL. TPC-DS is a decision support benchmark widely used in the industry
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
🎨 Simplistic, responsive jekyll based open source theme