2. What is Data Science
Data Scientist
Business Intelligence
Data Science Tools
Cloud Computing
Use Cases
Conclusion
01
02
03
04
05
06
07
3. Data science combines math and statistics,
specialized programming, advanced analytics,
artificial intelligence (AI) and machine learning
with specific subject matter expertise to
uncover actionable insights hidden in an
organization’s data. These insights can be used
to guide decision making and strategic
planning.
Data Science
DEFINITION
https://www.ibm.com/
4. Business Problem Understanding
10
Data collection
10
Data cleaning & Preparation
10
Exploratory data analysis
10
Feature Engineering
10
Machine Learning Model
10
Model Evaluation
10
Data Visualization
10
6
5 4
3
2
1
7
8
LIFECYCLE
API
Web Scraping
Varies depending
on the type of
data
Using machine
learning and deep
learning models
Testing the ML
model
Using tools like
Tableau &
PowerBI
5. Understand the business to ask relevant questions and identify pain points.
Apply statistics, computer science, and business knowledge to data analysis.
Use various tools for data preparation and extraction, including databases,
SQL, data mining, and data integration methods.
Extract insights from big data using predictive analytics, AI, machine
learning, natural language processing, and deep learning.
Write programs to automate data processing and calculations.
Communicate findings clearly through storytelling and visualization for all
levels of stakeholders.
Explain how results can address business challenges.
Collaborate with team members such as data analysts, business analysts, IT
architects, data engineers, and application developers.
Data science is considered a discipline, while data
scientists are the practitioners within that field
DATA SCIENCE VS. DATA SCIENTIST
2
6. 3
It’s easy to confuse Data Science with Business Intelligence (BI), because
they both incorporate analysis of data. But they differ in focus:
BUSINESS INTELLIGENCE
8. Data science incorporate the use of large datasets --> We need tools that
can scale with the size of the data (for either big companies, or small
startups)
Cloud Storage Solutions (e.g Data lakes) gives access to storage
infrastructure => Capability to ingest & process large volumes of Data easily
Cloud Storage Solutions : provide flexibility to users, and the ability to store
large amounts of data as needed. (2)
Note: Cloud platforms have different pricing methods (per-use /
subscriptions) to meet with the users needs (enterprise, startup) (2)
Simply put, cloud computing is the delivery of computing services—including servers,
storage, databases, networking, software, analytics, and intelligence—over the internet
(“the cloud”) to offer faster innovation, flexible resources, and economies of scale.(1)
CLOUD COMPUTING
(1) Source : Microsoft Azur
(2) ibm.com
5
10. Conclusion
Data science involves using mathematics, specialized programming, and
advanced analytics to extract actionable insights from data, guiding decision-
making and strategic planning. The field is rapidly growing across industries,
requiring collaboration between various roles and tools, including cloud
computing, to process and analyze large datasets efficiently.
DATA SCIENCE GENERAL INTRODUCTION
7