Data Architect JD
Data Architect JD
Data Architect JD
Experience implementing Azure Data Factory Pipelines using latest technologies and techniques
Use the interactive Databricks notebook environment using Apache Spark SQL, Examine
external data sets, Query existing data sets using Spark SQL
Visualize query results and data using the built-in Databricks visualization features, Perform
exploratory data analysis using Spark SQL.
ETL processing and data extraction using Azure Databricks to write a basic ETL pipeline using
the Spark design pattern, Ingest data using DBFS mounts in Azure Blob Storage, Ingest data
using serial and parallel JDBC reads
ETL transformations and Loads using Azure Databricks to apply built-in functions to manipulate
data, Write UDFs with a single DataFrame column inputs, Apply UDFs with a multiple
DataFrame column inputs and that return complex types
Manage Delta lake using Data bricks to use the interactive Databricks notebook environment,
Create, append and upsert data into a data lake.