Data Science Process
Data Science Process
Data Science Process
Data Science
• Data science is the study of data analysis by advanced
technology (Machine Learning, Artificial Intelligence,
Big data). It processes a huge amount of structured,
semi-structured, and unstructured data to extract
insight meaning, from which one pattern can be
designed that will be useful to take a decision for
grabbing the new business opportunity, the betterment
of product/service, and ultimately business growth.
Data science is the process to make sense of Big
data/huge amount of data that is used in business.
Exploratory Data Analysis
• Exploratory data analysis (EDA) is used by data
scientists to analyze and investigate data sets
and summarize their main characteristics,
often employing data visualization methods. It
helps determine how best to manipulate data
sources to get the answers you need, making
it easier for data scientists to discover
patterns, spot anomalies, test a hypothesis, or
check assumptions.
EDA Importance
• The main purpose of EDA is to help look at data before making any
assumptions. It can help identify obvious errors, as well as better
understand patterns within the data, detect outliers or anomalous
events, find interesting relations among the variables.