SlideShare a Scribd company logo
Opportunities in Data Science
Mr. Swapnil Telrandhe
Sr. Data Scientist
FirstCry(BrainBees Pvt. Ltd.)
Outline
 Data, Big Data and Challenges
 Data Science
 Introduction
 Why Data Science
 Data Scientists
 What do they do?
 Major/Concentration in Data
Opportunities in Data Science.ppt
Opportunities in Data Science.ppt
Data All Around
 Lots of data is being collected
and warehoused
 Web data, e-commerce
 Financial transactions, bank/credit transactions
 Online trading and purchasing
 Social Network
How Much Data Do We have?
How Much Data Do We have?
 Google processes 20 PB a day (2008)
 Facebook has 60 TB of daily logs
 eBay has 6.5 PB of user data + 50 TB/day
(5/2009)
 1000 genomes project: 200 TB
 Cost of 1 TB of disk: $35
 Time to read 1 TB disk: 3 hrs (100 MB/s)
Big Data
 Big Data is any data that is expensive to manage and
hard to extract value from
 Volume
 The size of the data
 Velocity
 The latency of data processing relative to the growing
demand for interactivity
 Variety and Complexity
 the diversity of sources, formats, quality, structures.
Understand the Data
Types of Data We Have
 Relational Data (Tables/Transaction/Legacy
Data)
 Text Data (Web)
 Semi-structured Data (XML)
 Graph Data
 Social Network, Semantic Web (RDF), …
 Streaming Data
What To Do With These Data?
 Aggregation and Statistics
 Data warehousing and OLAP
 Indexing, Searching, and Querying
 Keyword based search
 Pattern matching (XML/RDF)
 Knowledge discovery
 Data Mining
 Statistical Modeling
DATA SCIENCE V/S BIG DATA
 They are not the “same thing”
 Big data = crude oil
Big data is about extracting “crude
oil”, transporting it in “mega
tankers”,shipping it through
“pipelines”, and storing it in
“massive silos”
Data Science v/s AI
What is Data Science?
 An area that manages, manipulates, extracts,
and interprets knowledge from tremendous
amount of data
 Data science (DS) is a multidisciplinary field of
study with goal to address the challenges in big
data
 Data science principles apply to all data – big
and small
What is Data Science?
 Theories and techniques from many fields and
disciplines are used to investigate and analyze a large
amount of data to help decision makers in many
industries such as science, engineering, economics,
politics, finance, and education
 Computer Science
 Pattern recognition, visualization, data warehousing, High
performance computing, Databases, AI
 Mathematics
 Mathematical Modeling
 Statistics
Opportunities in Data Science.ppt
Opportunities in Data Science.ppt
Real Life Examples
 Companies learn your secrets, shopping
patterns, and preferences
 For example, can we know if a woman is pregnant,
even if she doesn’t want us to know?
 Data Science and election (2008, 2012)
 1 million people installed the Obama Facebook app
that gave access to info on “friends”
What do Data Scientists do?
 National Security
 Cyber Security
 Business Analytics
 Engineering
 Healthcare
 And more ….
Concentration in Data Science
 Mathematics and Applied Mathematics
 Applied Statistics/Data Analysis
 Solid Programming Skills (R, Python, SQL)
 Data Mining
 Data Base Storage and Management
 Machine Learning and discovery
Thank you...

More Related Content

PPTX
Introduction to Data Science 5-13.pptx
PPTX
Introduction to Data Science 5-13.pptx
PPTX
Introduction to Data Science
PPTX
Introduction to Data Science
PPTX
Introduction to Data Science\
PPTX
Introduction to Data Science 5-13.pptx
PPTX
NumPy_ SciPy_ _ DatiiiikaFrames (2).pptx
PPTX
Introduction to Data Science
Introduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptx
Introduction to Data Science
Introduction to Data Science
Introduction to Data Science\
Introduction to Data Science 5-13.pptx
NumPy_ SciPy_ _ DatiiiikaFrames (2).pptx
Introduction to Data Science

Similar to Opportunities in Data Science.ppt (20)

PPTX
hjol.pptx
PPTX
Introduction to Data Science 5-13.pptx
PDF
Introduction to Data Science 5-13 (1).pdf
PPTX
Introduction to Data Science 5-13.pptx
PPTX
mkol.pptx
PPTX
Introduction to Data Science 1114.pptx
PPTX
Introduction to Data Science 1118.pptx
PPTX
Introduction to Data Science 1113.pptx
PPTX
Introduction to Data Science 1115.pptx
PPTX
Introduction to Data Science 1117.pptx
PPTX
Introduction to Data Science 1116.pptx
PPTX
Introduction to Data Science 112.pptx
PPTX
Introduction to Data Science 1119.pptx
PDF
Big Data, Big Deal: For Future Big Data Scientists
PPTX
What is Big Data , 5'v of BIG DATA and Challenges
PPTX
What is big data and 5'v of big data....
PDF
1Lecture_01_Introduction to Big Data.pdf
PDF
4Lecture_01_Introduction to Big Data.pdf
PDF
NPTEL BIG DATA FULL PPT BOOK WITH ASSIGNMENT SOLUTION RAJIV MISHRA IIT PATNA...
PDF
Big Data Ethics
hjol.pptx
Introduction to Data Science 5-13.pptx
Introduction to Data Science 5-13 (1).pdf
Introduction to Data Science 5-13.pptx
mkol.pptx
Introduction to Data Science 1114.pptx
Introduction to Data Science 1118.pptx
Introduction to Data Science 1113.pptx
Introduction to Data Science 1115.pptx
Introduction to Data Science 1117.pptx
Introduction to Data Science 1116.pptx
Introduction to Data Science 112.pptx
Introduction to Data Science 1119.pptx
Big Data, Big Deal: For Future Big Data Scientists
What is Big Data , 5'v of BIG DATA and Challenges
What is big data and 5'v of big data....
1Lecture_01_Introduction to Big Data.pdf
4Lecture_01_Introduction to Big Data.pdf
NPTEL BIG DATA FULL PPT BOOK WITH ASSIGNMENT SOLUTION RAJIV MISHRA IIT PATNA...
Big Data Ethics
Ad

Recently uploaded (20)

PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPT
Quality review (1)_presentation of this 21
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PDF
Launch Your Data Science Career in Kochi – 2025
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Computer network topology notes for revision
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
Mega Projects Data Mega Projects Data
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Quality review (1)_presentation of this 21
STUDY DESIGN details- Lt Col Maksud (21).pptx
Clinical guidelines as a resource for EBP(1).pdf
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Launch Your Data Science Career in Kochi – 2025
oil_refinery_comprehensive_20250804084928 (1).pptx
Computer network topology notes for revision
.pdf is not working space design for the following data for the following dat...
climate analysis of Dhaka ,Banglades.pptx
Business Acumen Training GuidePresentation.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
IB Computer Science - Internal Assessment.pptx
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Mega Projects Data Mega Projects Data
Ad

Opportunities in Data Science.ppt

  • 1. Opportunities in Data Science Mr. Swapnil Telrandhe Sr. Data Scientist FirstCry(BrainBees Pvt. Ltd.)
  • 2. Outline  Data, Big Data and Challenges  Data Science  Introduction  Why Data Science  Data Scientists  What do they do?  Major/Concentration in Data
  • 5. Data All Around  Lots of data is being collected and warehoused  Web data, e-commerce  Financial transactions, bank/credit transactions  Online trading and purchasing  Social Network
  • 6. How Much Data Do We have?
  • 7. How Much Data Do We have?  Google processes 20 PB a day (2008)  Facebook has 60 TB of daily logs  eBay has 6.5 PB of user data + 50 TB/day (5/2009)  1000 genomes project: 200 TB  Cost of 1 TB of disk: $35  Time to read 1 TB disk: 3 hrs (100 MB/s)
  • 8. Big Data  Big Data is any data that is expensive to manage and hard to extract value from  Volume  The size of the data  Velocity  The latency of data processing relative to the growing demand for interactivity  Variety and Complexity  the diversity of sources, formats, quality, structures.
  • 10. Types of Data We Have  Relational Data (Tables/Transaction/Legacy Data)  Text Data (Web)  Semi-structured Data (XML)  Graph Data  Social Network, Semantic Web (RDF), …  Streaming Data
  • 11. What To Do With These Data?  Aggregation and Statistics  Data warehousing and OLAP  Indexing, Searching, and Querying  Keyword based search  Pattern matching (XML/RDF)  Knowledge discovery  Data Mining  Statistical Modeling
  • 12. DATA SCIENCE V/S BIG DATA  They are not the “same thing”  Big data = crude oil Big data is about extracting “crude oil”, transporting it in “mega tankers”,shipping it through “pipelines”, and storing it in “massive silos”
  • 14. What is Data Science?  An area that manages, manipulates, extracts, and interprets knowledge from tremendous amount of data  Data science (DS) is a multidisciplinary field of study with goal to address the challenges in big data  Data science principles apply to all data – big and small
  • 15. What is Data Science?  Theories and techniques from many fields and disciplines are used to investigate and analyze a large amount of data to help decision makers in many industries such as science, engineering, economics, politics, finance, and education  Computer Science  Pattern recognition, visualization, data warehousing, High performance computing, Databases, AI  Mathematics  Mathematical Modeling  Statistics
  • 18. Real Life Examples  Companies learn your secrets, shopping patterns, and preferences  For example, can we know if a woman is pregnant, even if she doesn’t want us to know?  Data Science and election (2008, 2012)  1 million people installed the Obama Facebook app that gave access to info on “friends”
  • 19. What do Data Scientists do?  National Security  Cyber Security  Business Analytics  Engineering  Healthcare  And more ….
  • 20. Concentration in Data Science  Mathematics and Applied Mathematics  Applied Statistics/Data Analysis  Solid Programming Skills (R, Python, SQL)  Data Mining  Data Base Storage and Management  Machine Learning and discovery