SlideShare a Scribd company logo
2
Most read
1 | P a g e
What is data science and why it is important now?
What is data science and why it is important now?
Author – Bohitesh Misra (bohitesh.misra@gmail.com), September 2017
Data Science!
Fundamentally, in layman terms, data scientists collect data from various
data sources, clean them, organize the data and shape them to be able to
analyze them. We can separate data into training and testing to assess and
experiment the algorithm or model that is developed using statistics and
apply them to any area or sector that we find suitable. Data mining helps end
users extract useful business information from large databases.
Asking the right questions
Asking the right questions is extremely important, and hence apt
communications skills is essential for data scientists. With the advent of
technology and the internet, we now have access to data instantly and the
technology to test our interpretation to make decisions rapidly and promptly.
Data scientist
Data scientists use their data and analytical ability to find and interpret rich
data sources; manage large volume of data; merge data sources; ensure
consistency of datasets; create visualizations in understanding data; build
mathematical models using the data; and present and communicate the data
insights and findings to business decision makers.
"Data scientist" has become a popular buzzword with Harvard Business
Review dubbing it "The Sexiest Job of the 21st Century" and McKinsey &
Company projecting a global excess demand of 1.5 million new data
scientists.
Statistical models
2 | P a g e
What is data science and why it is important now?
How does data mining works? It works the same way a human being does.
Basically, it uses historical information to learn for future. Mathematical
models like linear algebra, probability, statistics and calculus, regression,
clustering, predictive analysis are indispensable in data science. Python and
R are preferred programming languages that have packages and libraries
built specifically for data science which allow us to learn programming and
start applying. I’ve begun with R and use basic libraries for text and data
mining.
Data Cleaning
80% of the work by data scientists is data cleaning. Data is sometimes
available in preferred formats such as csv and xls, but you’ll find very little
data directly available to be executed using programming. APIs, web scraping
and SQL come in to the rescue of Data Scientists. Spark and Map-Reduce are
used to clean and analyze large and distributed datasets.
It’s everywhere!
Data-driven solutions are being used everywhere, from e-commerce websites,
social networking sites, financial visualization and interpretation.
Data-driven practices are increasingly being employed by companies over the
last few years. In fact, it would be difficult to find a sector in which data
science cannot be used to take better decisions, and companies are slowly
realizing this and adopting it.
Want to learn it?
I came across data science and decided it was the right fit for me and recently
completed Executive Management Programme from Indian Institute of
Technology Delhi in the same subject. Learning data science is very easy and
convenient, with the large number of MOOCs and eBooks available for free
online.
I urge you to think about how it may be applied to you, whether it is your
business where you can gather data in the form of reviews and opinions of
3 | P a g e
What is data science and why it is important now?
customers to make better data-driven decisions. You can use the data from
movie review sites to choose your next movie.
Data science for Startups
Startups critically need a Data strategy around the collection, storage and
usage of large data, in a way that data can serve the purpose behind the selling
point of a startup and can also open-up additional potential monetisation
avenues in the future.
A common case can be recommendation engine, which can benefit from
all kinds of information about the users: age, gender, purchases, offerings and
discounts. Designing the platform in a way that improves information
collection from its users, results in a big database that can be used to improve
in better managing discount deals, improving advertising or even the user
experience on the platform.
A clear data strategy can provide startups with additional revenue scope
and can also provide with a competitive advantage.

More Related Content

PPTX
Data science
PPTX
Data Science
PPTX
Data science Big Data
PDF
Data science
PDF
Data analytics course in bangalore
PPTX
data science
PDF
Welcome to Data Science
PPTX
5 ways to get more from data science
Data science
Data Science
Data science Big Data
Data science
Data analytics course in bangalore
data science
Welcome to Data Science
5 ways to get more from data science

What's hot (20)

PPTX
Data science & data scientist
PPTX
Data analytics
PPTX
Data analytics
PPTX
Big data analytics
PDF
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
PPTX
introduction to data science
DOC
2005)
 
PDF
Data Science Applications | Data Science For Beginners | Data Science Trainin...
PDF
Data Science Salon: Building a Data Science Culture
PPTX
What is Data?
PPTX
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
PPTX
Data Science Innovations : Democratisation of Data and Data Science
PPTX
Applications of machine learning
PPTX
Vikrant data scientist
PPTX
Big data and Predictive Analytics By : Professor Lili Saghafi
PPTX
Data Science Salon: Digital Transformation: The Data Science Catalyst
PPTX
Introduction to Data Analytics
PPTX
Predictive Analytics: Business Perspective & Use Cases
DOCX
What is data science artical
PPTX
Artificial Intelligence
Data science & data scientist
Data analytics
Data analytics
Big data analytics
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
introduction to data science
2005)
 
Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Salon: Building a Data Science Culture
What is Data?
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
Data Science Innovations : Democratisation of Data and Data Science
Applications of machine learning
Vikrant data scientist
Big data and Predictive Analytics By : Professor Lili Saghafi
Data Science Salon: Digital Transformation: The Data Science Catalyst
Introduction to Data Analytics
Predictive Analytics: Business Perspective & Use Cases
What is data science artical
Artificial Intelligence
Ad

Similar to What is data science ? (20)

PDF
Best Data Science Hybrid Course in Pune.
PDF
Data Science for Finance Interview.
PDF
Untitled document.pdf
PDF
Data Analytics: Tools, Techniques &Trend
PDF
Difference b/w DataScience, Data Analyst
PDF
PPT presentation Data science courses in kochi.pdf
PPTX
L3 Big Data and Application.pptx
PDF
_What Is Data Science.pdf
 
PDF
Data science mastery course in pitampura
PDF
Embracing data science
DOCX
Big data (word file)
PDF
Best Data Science training institute in Hyderabad
PDF
Introduction to Data Science.pdf
PDF
Introduction to Data Science: data science process
PPT
PPTX
Data analytics presentation- Management career institute
PPTX
What is Data analytics? How is data analytics a better career option?
PPTX
Data science in business Administration Nagarajan.pptx
PPTX
ds.pptx
 
PDF
365 Data Science
Best Data Science Hybrid Course in Pune.
Data Science for Finance Interview.
Untitled document.pdf
Data Analytics: Tools, Techniques &Trend
Difference b/w DataScience, Data Analyst
PPT presentation Data science courses in kochi.pdf
L3 Big Data and Application.pptx
_What Is Data Science.pdf
 
Data science mastery course in pitampura
Embracing data science
Big data (word file)
Best Data Science training institute in Hyderabad
Introduction to Data Science.pdf
Introduction to Data Science: data science process
Data analytics presentation- Management career institute
What is Data analytics? How is data analytics a better career option?
Data science in business Administration Nagarajan.pptx
ds.pptx
 
365 Data Science
Ad

More from Bohitesh Misra, PMP (10)

PDF
Innovation in enterpreneurship_2021
PDF
Use of data science for startups_Sept 2021
PDF
Building castles on sand - Project Management in distributed project environment
PDF
Disruptive technologies - Session 4 - Biochip Digital twin Smart Fabrics
PDF
Disruptive technologies - Session 3 - Green it_Smartdust
PDF
Disruptive technologies - Session 2 - Blockchain smart_contracts
PDF
Disruptive technologies - Session 1 - introduction
PDF
Big data and analytics
PPTX
Business analytics why now_what next
PDF
Internet of Things (IoT) based Solar Energy System security considerations
Innovation in enterpreneurship_2021
Use of data science for startups_Sept 2021
Building castles on sand - Project Management in distributed project environment
Disruptive technologies - Session 4 - Biochip Digital twin Smart Fabrics
Disruptive technologies - Session 3 - Green it_Smartdust
Disruptive technologies - Session 2 - Blockchain smart_contracts
Disruptive technologies - Session 1 - introduction
Big data and analytics
Business analytics why now_what next
Internet of Things (IoT) based Solar Energy System security considerations

Recently uploaded (20)

PPTX
Global journeys: estimating international migration
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Computer network topology notes for revision
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
Launch Your Data Science Career in Kochi – 2025
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
Mega Projects Data Mega Projects Data
PDF
Clinical guidelines as a resource for EBP(1).pdf
Global journeys: estimating international migration
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
oil_refinery_comprehensive_20250804084928 (1).pptx
Business Ppt On Nestle.pptx huunnnhhgfvu
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
Introduction-to-Cloud-ComputingFinal.pptx
Moving the Public Sector (Government) to a Digital Adoption
Galatica Smart Energy Infrastructure Startup Pitch Deck
Computer network topology notes for revision
.pdf is not working space design for the following data for the following dat...
Introduction to Knowledge Engineering Part 1
STUDY DESIGN details- Lt Col Maksud (21).pptx
Launch Your Data Science Career in Kochi – 2025
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Fluorescence-microscope_Botany_detailed content
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Mega Projects Data Mega Projects Data
Clinical guidelines as a resource for EBP(1).pdf

What is data science ?

  • 1. 1 | P a g e What is data science and why it is important now? What is data science and why it is important now? Author – Bohitesh Misra (bohitesh.misra@gmail.com), September 2017 Data Science! Fundamentally, in layman terms, data scientists collect data from various data sources, clean them, organize the data and shape them to be able to analyze them. We can separate data into training and testing to assess and experiment the algorithm or model that is developed using statistics and apply them to any area or sector that we find suitable. Data mining helps end users extract useful business information from large databases. Asking the right questions Asking the right questions is extremely important, and hence apt communications skills is essential for data scientists. With the advent of technology and the internet, we now have access to data instantly and the technology to test our interpretation to make decisions rapidly and promptly. Data scientist Data scientists use their data and analytical ability to find and interpret rich data sources; manage large volume of data; merge data sources; ensure consistency of datasets; create visualizations in understanding data; build mathematical models using the data; and present and communicate the data insights and findings to business decision makers. "Data scientist" has become a popular buzzword with Harvard Business Review dubbing it "The Sexiest Job of the 21st Century" and McKinsey & Company projecting a global excess demand of 1.5 million new data scientists. Statistical models
  • 2. 2 | P a g e What is data science and why it is important now? How does data mining works? It works the same way a human being does. Basically, it uses historical information to learn for future. Mathematical models like linear algebra, probability, statistics and calculus, regression, clustering, predictive analysis are indispensable in data science. Python and R are preferred programming languages that have packages and libraries built specifically for data science which allow us to learn programming and start applying. I’ve begun with R and use basic libraries for text and data mining. Data Cleaning 80% of the work by data scientists is data cleaning. Data is sometimes available in preferred formats such as csv and xls, but you’ll find very little data directly available to be executed using programming. APIs, web scraping and SQL come in to the rescue of Data Scientists. Spark and Map-Reduce are used to clean and analyze large and distributed datasets. It’s everywhere! Data-driven solutions are being used everywhere, from e-commerce websites, social networking sites, financial visualization and interpretation. Data-driven practices are increasingly being employed by companies over the last few years. In fact, it would be difficult to find a sector in which data science cannot be used to take better decisions, and companies are slowly realizing this and adopting it. Want to learn it? I came across data science and decided it was the right fit for me and recently completed Executive Management Programme from Indian Institute of Technology Delhi in the same subject. Learning data science is very easy and convenient, with the large number of MOOCs and eBooks available for free online. I urge you to think about how it may be applied to you, whether it is your business where you can gather data in the form of reviews and opinions of
  • 3. 3 | P a g e What is data science and why it is important now? customers to make better data-driven decisions. You can use the data from movie review sites to choose your next movie. Data science for Startups Startups critically need a Data strategy around the collection, storage and usage of large data, in a way that data can serve the purpose behind the selling point of a startup and can also open-up additional potential monetisation avenues in the future. A common case can be recommendation engine, which can benefit from all kinds of information about the users: age, gender, purchases, offerings and discounts. Designing the platform in a way that improves information collection from its users, results in a big database that can be used to improve in better managing discount deals, improving advertising or even the user experience on the platform. A clear data strategy can provide startups with additional revenue scope and can also provide with a competitive advantage.