CV Sumit Saxena 022024
CV Sumit Saxena 022024
CV Sumit Saxena 022024
CAREER OBJECTIVE:
Results-oriented professional currently serving as an AI/ML Engineer and specialize in the implementation
and deployment of AI/ML projects in the Azure cloud environment with 10+ years of experience.
Committed to staying at the forefront of technological advancements and passionate about leveraging
data-driven solutions to drive business success and innovation.
SUMMARY:
❑ Education & Certifications: Computer Science Engineering along with MBA in EC (IMS DAVV, first
class with distinction), complemented by a series of professional certifications in data science, cloud
computing (DP-203) and business intelligence.
❑ Technical Expertise: Skilled in application development, data engineering, and analytics. Proficient
in Python, PySpark, Scala, MongoDB, Azure services (Data Factory, Databricks, Storage, OCR,
Cosmosdb), Snowflake, NLP, Tableau, PowerBI, SSIS, and SSAS. Expertise in MLOps, CI/CD, and
version control with Github.
❑ Data Architecture: Knowledgeable in data platforms, models, tooling, management, governance,
analytics, Data pipelines, visualization, and data quality/integrity, optimizing queries for large
datasets. Experience in architectural blueprint design, roadmap development, project impact
assessments, and business partnership. Hands-on experience with Hive, SQL Server, Microsoft BI
Stack, and data visualization tools (Tableau, PowerBI). Knowledgeable in data profiling, data quality,
and MPP database technologies like Synapse, SnowFlake etc.
❑ Project Leading & Collaboration: Active contributor to code development in AI/ML projects,
Managed data pipelines, automation, and monitoring frameworks. Implemented best practices in
systems integration, security, performance, and data management. Collaborated with internal
teams to develop solutions, POCs, and optimize data science models. Familiar with agile
development, including DevOps MlOps and DataOps concepts.
KEY PROJECTS:
❑ Call Event Tracking (ML application that identify the Steerage calls from voice transcript using
SetFit model)
Architecture design and end to end deployment in cloud using Azure Databricks Unity Catalogue
workspace. Effectively used control tables to parameterize the code and Job clusters.
❑ MDI (AI/ML based web application for Medical Document Indexing and OCR)
Developed an AI/ML solution to automate medical document indexing, reducing processing time by
40% and improving clinical review accuracy.
❑ Auto-contact Call Classification and RFM Analysis to Identify Distress Member and Call Reason
(PySpark, Azure Databricks, Snowflake)
Implemented a data analytics pipeline using PySpark and Azure Databricks, optimizing call center
operations and enhancing customer service insights.
❑ DSG Metrics (Voice-based reporting solution using Alexa and AWS)
Led the development of a voice-activated reporting system, increasing reporting efficiency and
accessibility for digital metrics monitoring.
SKILLS:
❑ Programming – Python (NumPy, Pandas, Scikit-Learn, Flask, Scrapy, nltk, matplotlib), R(dplyr,
ggplot2,tm,xlsx,wordcloud), Core Java, Scala.
❑ Cloud and Data Engineering – Azure, Azure Databricks, Azure Data Factory Pipelines, Azure Storage,
Oracle Cloud.
❑ Databases - SQL(Oracle, MySQL), SPARK -SQL(Spark), NOSQL(MongoDB), SSIS, Snowflake, DWH.
❑ Data Visualization Skill – Tableau Desktop and Server Admin, PowerBI DAX, Statistical and
Exploratory Data Analysis, SSAS Cubes.
❑ Big Data and AI/ML - Datameer, PySpark, Kafka, Hive, Linear Regression, Logistic Regression,
Supervised and Unsupervised Learning, Azure OCR, NLP.
❑ Other Skills - Github, Jenkins CI/CD, DevOps, DataOps, MS Excel, VBA Macros, JSON, Linux.
EDUCATIONAL QUALIFICATIONS:
CERTIFICATIONS:
IBM Certified Data Science Architect; Optum-Scale Data Science Program (3 months) - UHG; Microsoft
Certified Azure Data Engineer Associate (DP-203); Databricks Fundamentals Certification; Azure
Certifications (AZ-900, DP-900, AI-900); Oracle Cloud Infrastructure Fundamental Associate (1Z0-1085);
Introduction to Big Data - UC San Diego; Tableau Certification - Duke University; Data Studio Certification
- Google Analytics; ITIL V3 Certification; Datameer Certified Analyst.
HOBBIES:
I hereby declare that the information furnished above is true to the best of my knowledge.