Lists (8)
Sort Name ascending (A-Z)
Starred repositories
Notebooks/scripts for youtube tutorials
Turning Data into Insight: Flexible Lakehouse with MinIO, Iceberg, Airflow, DBT, Spark, Pandera & Superset.
Breakthrough Method for Agile Ai Driven Development
A simple task management system for managing AI dev agents
Public notebooks and datasets to accompany the Data Analysis with Polars course on Udemy
All the sample datasets that I use across training, demos, learning, and testing.
Accompanying repository to the free YouTube video course: PySpark Course - Solving an Interview Assignment
Databricks. Incremental data processing, task orchestration, and production job monitoring.
This is collection of projects, practices in data engineering foundation
Resources for tracking DP-700 exam prep progress.
In this module, I will be updating the topic wise SQL tutorial notes which is very useful for a fresher to start with MYSQL from basics to advanced.
Repository of SQL projects, case studies, platform solutions, and learning resources to enhance SQL skills through practical applications. Includes content from DataLemur, LeetCode, HackerRank, and…
🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All comp…
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
DataOps for Microsoft Data Platform technologies. https://aka.ms/dataops-repo
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Python wrapper for dbt-core to extend dbt with custom Python.
Scripts and samples to support Confluent Demos, Talks, and Blogs. Not all of the examples in this repository are kept up to date. For automated tutorials and QA'd code, see https://github.com/confl…
Real-time analytics dashboard for Bath parking space occupancy data