Azure Databricks for Data Engineers - Project on Formula 1 Racing
Collection of Sample Databricks Spark Notebooks ( using for Azure Databricks )
Notebook | Description | Lang |
---|---|---|
Mount Setup | Configuration for storage and mount | Python |
Data Ingestion: CSV - Databricks - DataLake | In this notebook, you ingest data from CSV into Databricks cluster, run transformations on the data in Databricks cluster, and then load the transformed data into Parquet as processed and stored in DataLake | Python |
Data Ingestion: CSV - Databricks - DataLake | In this notebook, you ingest data from CSV into Databricks cluster, run transformations on the data in Databricks cluster, and then load the transformed data into Parquet as processed and stored in DataLake | Python |
Bug reports and pull requests are welcome on GitHub at https://github.com/UcheIgbokwe/FormulaOneDataETL