SlideShare a Scribd company logo
Ankit Rathi
Data Science
(Introductory Session)
2
Your motivation?
Expectations?
3
Let me start with a story…
2.4 3 5.5 1.5 0.5
Perot Mastek RBS Genpact SITA
SQL, PL/SQL ETL, DWH, BI R/Python, Hadoop/Spark, Azure/AWS
My Journey
5
Building the context
6
2012: The turning point…
7
3
4
3.5
3 3
0
3
4
3.5
3
0 0
3
3.5
4
0 0
3
4
3.5
0 0
2
2.5
3
PEROT(2005-07) MASTEK(2007-10) RBS1(2010-13) RBS2(2013-16) GENPACT/SITA(2016-TILL NOW)
SQL/PLSQL ETL/DWH/BI R/Python Hadoop/Spark AWS/Azure
Skill-set Evolution
8
70%
30%
Data Architecture
Data Science
Current Role
9
Setting the expectations
•Expertise Vs Data Science
•Breadth & depth of the field
•Requires knowledge & skills (practice, practice,
practice)
•Scope: Overview & approach only
10
Data Science
11
Why Data Science?
12
Why Data Science?
13
What is Data Science?
14
Data science is an interdisciplinary field of
scientific methods, processes, algorithms and
systems to extract knowledge or insights from data
in various forms, either structured or unstructured,
similar to data mining.
~ Wikipedia
What is Data Science?
15
What is Data Science?
16
How to do Data Science?
17
Skill-set of a Data Scientist
• Data Literacy
• Linear Algebra, Statistics & Probability
• Algorithms
• Programming (SQL/R/Python, KNIME/SPSS)
• Domain Knowledge
• Story-telling
1818
Challenges
•Stakeholders Buy-in
•Data Issues (Access, Quality)
•Problem Statement
•Maturity (Process, Analytics)
•Wrong Expectations
19
Learning Approach
•Kaggle (learn, apply, compete)
•Analytics Vidhya (learn, stay updated, compete)
•KDNuggets (learn, stay updated)
•Machine Learning Mastery (learn, apply)
•Data Science Central (learn, stay updated)
•Quora, Medium & StackOverflow (ask, learn,
stay updated)
20
“If you want to learn to swim, jump into
the water.”
~ Bruce Lee
“The secret of getting ahead is getting
started.”
~ Mark Twain
21
Demo
22
Connect with me
https://ankitrathi.wixsite.com/home/links
23
Questions?

More Related Content

PDF
Areeb CV
PDF
DataStax: Making a Difference with Smart Analytics
PDF
Priti Belnekar PowerBI
DOC
Resume charles-2017
PDF
Kwasi Date-Bah CV SlideShare
PPTX
SQL vs NoSQL: Why you’ll never dump your relations - Dave Shuttleworth, EXASOL
PPTX
Databricks for Dummies
DOCX
Kotasudhaker analytics
Areeb CV
DataStax: Making a Difference with Smart Analytics
Priti Belnekar PowerBI
Resume charles-2017
Kwasi Date-Bah CV SlideShare
SQL vs NoSQL: Why you’ll never dump your relations - Dave Shuttleworth, EXASOL
Databricks for Dummies
Kotasudhaker analytics

Similar to Data Science Session (20)

DOC
Theodore Baar - Business Intelligence
DOCX
ValeryBassenkoResumeShort
PDF
Steve gregory resume bi
DOCX
Deepak_OFSAA_6_Years_Exp_Resume
PDF
KoprowskiT_SQLRelay2014#6_Leeds_WADBForBeginners
PDF
Zahid Resume.pdf
DOC
Sr Database Administrator PostgreSQL and SQL Server - Engineering
DOCX
Raymond Cochrane 12_12_12
PDF
CV | Sham Sunder | Data | Database | Business Intelligence | .Net
DOC
Resume_ASA
DOC
Resume
DOC
GoranRadovanovic_English
DOC
VIKAS JALODIA
DOC
CV - Sarabjeet Singh Taluja - Sept 2016
PPTX
Rabobank - There is something about Data
DOCX
Lakshmi_DB_Engineer1
PPTX
Spark Application Development Made Easy
DOCX
MargeshPatel_Resume
PPTX
IBM Strategy for Spark
PPTX
LanceShivnathHadoopSummit2015
Theodore Baar - Business Intelligence
ValeryBassenkoResumeShort
Steve gregory resume bi
Deepak_OFSAA_6_Years_Exp_Resume
KoprowskiT_SQLRelay2014#6_Leeds_WADBForBeginners
Zahid Resume.pdf
Sr Database Administrator PostgreSQL and SQL Server - Engineering
Raymond Cochrane 12_12_12
CV | Sham Sunder | Data | Database | Business Intelligence | .Net
Resume_ASA
Resume
GoranRadovanovic_English
VIKAS JALODIA
CV - Sarabjeet Singh Taluja - Sept 2016
Rabobank - There is something about Data
Lakshmi_DB_Engineer1
Spark Application Development Made Easy
MargeshPatel_Resume
IBM Strategy for Spark
LanceShivnathHadoopSummit2015
Ad

More from Ankit Rathi (19)

PDF
5 Data Science Use Cases for Every Business
PDF
Kaggle Vs Real-world Projects
PDF
SQL for Data Professionals (Beginner)
PDF
Data & AI Session @ RBS
PDF
Data Professionals: Job of the Century
PDF
Cloud Computing for Data Professionals
PPTX
Data & AI Platform Concepts
PDF
Data & AI Platforms — Open Source Vs Managed Services (AWS vs Azure vs GCP)
PDF
Architecting Modern Data Platforms
PDF
Artificial Intelligence Do-It-Yourself: Course Outline
PDF
Artificial Intelligence Do-It-Yourself: Course Intro
PDF
Auto Encoder & Clustering Based Data Anonymization
PDF
Analytics Induction
PDF
Becoming Data-Driven
PDF
Machine Learning with Python
PDF
Data My Perspective
PPT
PPT
Big Data Overview
PPT
Oracle DBKB Project
5 Data Science Use Cases for Every Business
Kaggle Vs Real-world Projects
SQL for Data Professionals (Beginner)
Data & AI Session @ RBS
Data Professionals: Job of the Century
Cloud Computing for Data Professionals
Data & AI Platform Concepts
Data & AI Platforms — Open Source Vs Managed Services (AWS vs Azure vs GCP)
Architecting Modern Data Platforms
Artificial Intelligence Do-It-Yourself: Course Outline
Artificial Intelligence Do-It-Yourself: Course Intro
Auto Encoder & Clustering Based Data Anonymization
Analytics Induction
Becoming Data-Driven
Machine Learning with Python
Data My Perspective
Big Data Overview
Oracle DBKB Project
Ad

Recently uploaded (20)

PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PDF
Mega Projects Data Mega Projects Data
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PDF
Introduction to Data Science and Data Analysis
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPT
ISS -ESG Data flows What is ESG and HowHow
PDF
Introduction to the R Programming Language
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Fluorescence-microscope_Botany_detailed content
Mega Projects Data Mega Projects Data
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Introduction to Data Science and Data Analysis
oil_refinery_comprehensive_20250804084928 (1).pptx
Clinical guidelines as a resource for EBP(1).pdf
climate analysis of Dhaka ,Banglades.pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Miokarditis (Inflamasi pada Otot Jantung)
ISS -ESG Data flows What is ESG and HowHow
Introduction to the R Programming Language
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Database Infoormation System (DBIS).pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Reliability_Chapter_ presentation 1221.5784
168300704-gasification-ppt.pdfhghhhsjsjhsuxush

Data Science Session