SlideShare a Scribd company logo
How to Start Thinking Like a
Data Scientist.
Debashish Jana
What Do Data Scientist Do?
As chief datahandlers and strategists, they are tasked with
transforming volumes of data into actionable insights,
enabling the organization to strengthen customer
relationships, improve service delivery and drive new
opportunities.
What tasks consume most of
Their time?
Data preparation tends to be one of my
most time-intensive activities. It’s
extremely critical, but an onerous task.
It also leaves less time to actually
analyze the data and deliver new
insights to decision makers.
Complex data types yield large volumes of information to be
analyzed. But it’s not the amount of data per se that consumes
the most time; it’s getting it in the right format, augmenting it and
figuring out what information might be missing. It’s an ongoing
process that they have to perform again and again.
What are some of the challenges
they face on a day-to-day basis?
One of the biggest challenges as a data scientist is applying the domain
expertise to solve a problem. They have a plethora of algorithms and
techniques to get value from data, but they need solutions to help
them apply those to applications—to connect the dots from the
statistics to the business opportunity.
One of the biggest obstacles to analytical productivity is refining
and formatting the data required for high-quality analytics. The lack
of a universal or standardized programming language specifically
geared to the data science domain doesn’t make it any easier. Even
with the best tools today, there is really not a good way around
cleaning up the data manually. It’s a continuous cycle of collecting
and cleaning data and trying to figure out if it will yield significantly
relevant insights
What skills are the most valuable for
data scientists?
Data scientists need expertise within
multiple disciplines. They need to be
good at databases. They need some
knowledge of software engineering.
They need to know some machine
learning. And they need to know
some statistics.
Data scientists are
inquisitive. They are
continuously exploring,
asking questions, doing
what-if analyses
questioning existing
assumptions and
processes.
How to start thinking like a data scientist
Data scientists need a good blend of domain knowledge and a blend of
business expertise. They need to be extremely inquisitive and relentless
at figuring out how to solve a particular problem. That means digging
into different approaches and alternatives.
A skilled data scientist explores and examines data from multiple
disparate sources.They simply do not collect and report on data, but
also look at it from many angles, determine what it means and then
recommend ways to apply the findings
Data scientists often become the liaison between IT and C-level
executives. Therefore, they need to be able to speak both languages
and understand the hierarchy of data; they can’t just be the data
expert.
What has been your experience in
working across
teams to drive an end result?
Data science is a sharing-oriented discipline. In their line experience,
they have found that if “I have a question”, there’s always someone out
there with an answer—whether it’s inside or outside the organization.
One thing that becomes increasingly important with big data is keeping
track of how analytics applications are performing. The data scientists
have provided a lot of value by building up metrics that enable them to
measure what’s going on in an application when we push a code
change.
Advice from data scientist to companies that
are trying to get value from their data?
Building out a data scientist role or data science team will foster
collaboration among the organization and provide ‘champions of data’
who can derive maximum business value from the organization’s data.
Embracing the big data challenge
Why and how are these insights relevant to a
manager in India?
According to a recent India Jobs study, data science is the fastest-
growing field in India. As every company turns into a technology
company, organizations in industries such as manufacturing, healthcare
and retail are all looking to strengthen their data teams. India is
emerging as one of the world’s largest data science capitals, with
companies like Mercedes-Benz, Walmart, PayPal and AIG setting up
data science centers in the country. Research from the Everest Group
shows that India holds between 35 percent and 50 percent of the
global analytics services market.
The number of analytics jobs nearly doubled from April 2016 to April
2017, and there are almost 50,000 analytics and data jobs currently
open in the country. Hiring companies are looking for people skilled in
tools such as R, Python and Hadoop.
How to start thinking like a data scientist

More Related Content

PPTX
Data analytics
PDF
Welcome to Data Science
PDF
How to start your journey as a data scientist
PPTX
Analytics 2
PPTX
Data analytics
PPTX
Data Scientist: The Sexiest Job in the 21st Century
PPT
Data analytics & its Trends
DOCX
What is Data Science?
Data analytics
Welcome to Data Science
How to start your journey as a data scientist
Analytics 2
Data analytics
Data Scientist: The Sexiest Job in the 21st Century
Data analytics & its Trends
What is Data Science?

What's hot (20)

PDF
Data scientists are all liars
PPTX
Introduction to Data Analytics
PPTX
The Truth About Analytics
PPTX
Data Science And Analytics Information Part 3
PPTX
Data Analytics
PDF
Big Data Analytics
PPTX
Data analytics
PPTX
Data analytics vs. Data analysis
PPTX
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
PPTX
Analytics for actuaries cia
PDF
Data science team, a practice to setup
PPTX
Talent Analytics
PDF
Understanding big data and data analytics big data
PPTX
Data Analytics: An On-Ramp to a Better Understanding of Your Business
PDF
SAS/MIT/Sloan Data Analytics
PDF
Data science team (new version)
PPTX
Data is worthless if you don;t communicate
PDF
What is data science ?
PPTX
Importance of data analytics for business
Data scientists are all liars
Introduction to Data Analytics
The Truth About Analytics
Data Science And Analytics Information Part 3
Data Analytics
Big Data Analytics
Data analytics
Data analytics vs. Data analysis
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
Analytics for actuaries cia
Data science team, a practice to setup
Talent Analytics
Understanding big data and data analytics big data
Data Analytics: An On-Ramp to a Better Understanding of Your Business
SAS/MIT/Sloan Data Analytics
Data science team (new version)
Data is worthless if you don;t communicate
What is data science ?
Importance of data analytics for business
Ad

Similar to How to start thinking like a data scientist (20)

PPTX
Week1day2 (1)
PPTX
Big Data Courses In Mumbai
PDF
data scientists and their role
PDF
Challenges Of A Junior Data Scientist_ Best Tips To Help You Along The Way.pdf
PPTX
What is data_science_by_khawar_shehzad
PDF
What's the profile of a data scientist?
PDF
Data Scientist - Good Rebels -
PDF
Achieving Business Success with Data.pdf
PDF
Who is a data scientist
PPTX
PPTX
How Data Science is Changing the World
PDF
What does it_takes_to_be_a_good_data_scientist_2019_aim_simplilearn
PDF
Data Science Growth Accelerator
PDF
Data Science Whitepaper
PDF
best data science training in bangalore
PPTX
Week1 day2slide
PDF
What Managers Need to Know about Data Science
PPTX
Data Scientist
PPTX
Ds article ppt
ODP
Data Scientists : The Hottest job of the 21st century
Week1day2 (1)
Big Data Courses In Mumbai
data scientists and their role
Challenges Of A Junior Data Scientist_ Best Tips To Help You Along The Way.pdf
What is data_science_by_khawar_shehzad
What's the profile of a data scientist?
Data Scientist - Good Rebels -
Achieving Business Success with Data.pdf
Who is a data scientist
How Data Science is Changing the World
What does it_takes_to_be_a_good_data_scientist_2019_aim_simplilearn
Data Science Growth Accelerator
Data Science Whitepaper
best data science training in bangalore
Week1 day2slide
What Managers Need to Know about Data Science
Data Scientist
Ds article ppt
Data Scientists : The Hottest job of the 21st century
Ad

More from Debashish Jana (10)

PPTX
Lies damned lies and statistics
PPTX
Data to make hit tv show
PPTX
Guide to data analytics
PPTX
Bad statistics
PPTX
Predictive analytics
PPTX
Data communication
PPTX
The beauty of data visualization
PPTX
Make data more human
PPTX
Big data
PPTX
Data analysis
Lies damned lies and statistics
Data to make hit tv show
Guide to data analytics
Bad statistics
Predictive analytics
Data communication
The beauty of data visualization
Make data more human
Big data
Data analysis

Recently uploaded (20)

PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
Lecture1 pattern recognition............
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPT
Quality review (1)_presentation of this 21
PPTX
Computer network topology notes for revision
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Database Infoormation System (DBIS).pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Introduction to machine learning and Linear Models
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Qualitative Qantitative and Mixed Methods.pptx
Lecture1 pattern recognition............
Introduction-to-Cloud-ComputingFinal.pptx
Business Ppt On Nestle.pptx huunnnhhgfvu
IBA_Chapter_11_Slides_Final_Accessible.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Quality review (1)_presentation of this 21
Computer network topology notes for revision
STUDY DESIGN details- Lt Col Maksud (21).pptx
Supervised vs unsupervised machine learning algorithms
Clinical guidelines as a resource for EBP(1).pdf
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
.pdf is not working space design for the following data for the following dat...
Database Infoormation System (DBIS).pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Introduction to machine learning and Linear Models
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx

How to start thinking like a data scientist

  • 1. How to Start Thinking Like a Data Scientist. Debashish Jana
  • 2. What Do Data Scientist Do? As chief datahandlers and strategists, they are tasked with transforming volumes of data into actionable insights, enabling the organization to strengthen customer relationships, improve service delivery and drive new opportunities.
  • 3. What tasks consume most of Their time? Data preparation tends to be one of my most time-intensive activities. It’s extremely critical, but an onerous task. It also leaves less time to actually analyze the data and deliver new insights to decision makers.
  • 4. Complex data types yield large volumes of information to be analyzed. But it’s not the amount of data per se that consumes the most time; it’s getting it in the right format, augmenting it and figuring out what information might be missing. It’s an ongoing process that they have to perform again and again.
  • 5. What are some of the challenges they face on a day-to-day basis? One of the biggest challenges as a data scientist is applying the domain expertise to solve a problem. They have a plethora of algorithms and techniques to get value from data, but they need solutions to help them apply those to applications—to connect the dots from the statistics to the business opportunity.
  • 6. One of the biggest obstacles to analytical productivity is refining and formatting the data required for high-quality analytics. The lack of a universal or standardized programming language specifically geared to the data science domain doesn’t make it any easier. Even with the best tools today, there is really not a good way around cleaning up the data manually. It’s a continuous cycle of collecting and cleaning data and trying to figure out if it will yield significantly relevant insights
  • 7. What skills are the most valuable for data scientists? Data scientists need expertise within multiple disciplines. They need to be good at databases. They need some knowledge of software engineering. They need to know some machine learning. And they need to know some statistics.
  • 8. Data scientists are inquisitive. They are continuously exploring, asking questions, doing what-if analyses questioning existing assumptions and processes.
  • 10. Data scientists need a good blend of domain knowledge and a blend of business expertise. They need to be extremely inquisitive and relentless at figuring out how to solve a particular problem. That means digging into different approaches and alternatives.
  • 11. A skilled data scientist explores and examines data from multiple disparate sources.They simply do not collect and report on data, but also look at it from many angles, determine what it means and then recommend ways to apply the findings
  • 12. Data scientists often become the liaison between IT and C-level executives. Therefore, they need to be able to speak both languages and understand the hierarchy of data; they can’t just be the data expert.
  • 13. What has been your experience in working across teams to drive an end result? Data science is a sharing-oriented discipline. In their line experience, they have found that if “I have a question”, there’s always someone out there with an answer—whether it’s inside or outside the organization.
  • 14. One thing that becomes increasingly important with big data is keeping track of how analytics applications are performing. The data scientists have provided a lot of value by building up metrics that enable them to measure what’s going on in an application when we push a code change.
  • 15. Advice from data scientist to companies that are trying to get value from their data? Building out a data scientist role or data science team will foster collaboration among the organization and provide ‘champions of data’ who can derive maximum business value from the organization’s data.
  • 16. Embracing the big data challenge
  • 17. Why and how are these insights relevant to a manager in India? According to a recent India Jobs study, data science is the fastest- growing field in India. As every company turns into a technology company, organizations in industries such as manufacturing, healthcare and retail are all looking to strengthen their data teams. India is emerging as one of the world’s largest data science capitals, with companies like Mercedes-Benz, Walmart, PayPal and AIG setting up data science centers in the country. Research from the Everest Group shows that India holds between 35 percent and 50 percent of the global analytics services market.
  • 18. The number of analytics jobs nearly doubled from April 2016 to April 2017, and there are almost 50,000 analytics and data jobs currently open in the country. Hiring companies are looking for people skilled in tools such as R, Python and Hadoop.