SlideShare a Scribd company logo
7
Most read
9
Most read
15
Most read
Top 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka
INTRODUCTION TO DATA SCIENCE
DATA SCIENCE TOOLS
DATA SCIENCE TOOLS FOR DATA MANIPULATION
DATA SCIENCE TOOLS FOR EDA
www.edureka.co
DATA SCIENCE TOOLS FOR DATA STORAGE
DATA SCIENCE TOOLS FOR DATA VISUALIZATION
INTRODUCTION TO DATA SCIENCE
www.edureka.co
Introduction To Data Science
www.edureka.co
Data Science is the process of extracting knowledge and insights from data by
using scientific methods.
Data Science involves collecting, analysing and modelling data to solve real-world problems. It is
used for fraud detection, disease detection, recommendation engines and so on.
DATA SCIENCE TOOLS
www.edureka.co
Data Science Tools come with pre-defined functions, algorithms, and a very user-friendly GUI.
Hence, they can be used to build convoluted Machine Learning models without the use of a
programming language.
DATA SCIENCE TOOLS
Data Science
Data Collection
Exploratory Data Analysis
Data Modelling
Data Visualization
www.edureka.co
DATA SCIENCE TOOLS FOR DATA STORAGE
www.edureka.co
Scale and manage massive
amounts of data
Hadoop Distributed File System
(HDFS) for data storage
Integrate with , Hadoop
MapReduce, Hadoop YARN
www.edureka.co
Data processing via Apache
Hadoop and Spark clusters
The default storage system is
Windows Azure Blob
Provides Microsoft R Server
www.edureka.co
DATA SCIENCE TOOLS FOR EDA
www.edureka.co
Data Integration tool based on
Extract Transform Load architecture
Extract Transform Load tool
to manage data
Support for distributed processing, grid
computing, adaptive load balancing.
www.edureka.co
Data processing, building
Machine Learning models, etc
Support for integrating Hadoop
framework
Generate predictive models
through automated modelling
www.edureka.co
DATA SCIENCE TOOLS FOR DATA MODELLING
www.edureka.co
Easy to apply Machine Learning
Supports GLM, Boosting ML models
& Deep Learning
Support to integrate with Apache
Hadoop
www.edureka.co
Supports parallel programming to
perform data analysis, data
modelling, etc
Tests and trains Machine Learning
models at lightning fast speed
Makes model evaluation much
easier.
www.edureka.co
DATA SCIENCE TOOLS FOR VISUALIZATION
www.edureka.co
Can visualize massive data sets to find
correlations and patterns
Create customized reports and
dashboards
Support to integrate with Apache
Hadoop
www.edureka.co
Clear & concise visualizations
Supports in-memory data
processing
Automatically generates data
associations
www.edureka.co
www.edureka.co

More Related Content

PDF
Tools and techniques for data science
PDF
Machine Learning in 10 Minutes | What is Machine Learning? | Edureka
PPTX
introduction to data science
PDF
Machine Learning and its Applications
PPT
PDF
Data science
PPTX
Analytical tools
PDF
An introduction to Machine Learning
Tools and techniques for data science
Machine Learning in 10 Minutes | What is Machine Learning? | Edureka
introduction to data science
Machine Learning and its Applications
Data science
Analytical tools
An introduction to Machine Learning

What's hot (20)

PPTX
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
PDF
Introduction to Data Science
PPTX
Data science applications and usecases
PDF
Data Science Introduction
PPTX
Introduction to Data Science.pptx
PPTX
Data Science
PDF
Machine Learning Deep Learning AI and Data Science
PPTX
Introduction to Data Science
PDF
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
PPTX
Introduction to Data Engineering
PDF
Data Science Full Course | Edureka
PDF
Data Science Project Lifecycle
PPTX
Data science life cycle
PDF
Machine Learning Pipelines
PDF
Data preprocessing using Machine Learning
PPTX
Feature Selection in Machine Learning
PPTX
Introduction to data science.pptx
PPTX
Data Science Training | Data Science For Beginners | Data Science With Python...
PPTX
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
PPTX
Introduction to Data Engineering
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Introduction to Data Science
Data science applications and usecases
Data Science Introduction
Introduction to Data Science.pptx
Data Science
Machine Learning Deep Learning AI and Data Science
Introduction to Data Science
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Introduction to Data Engineering
Data Science Full Course | Edureka
Data Science Project Lifecycle
Data science life cycle
Machine Learning Pipelines
Data preprocessing using Machine Learning
Feature Selection in Machine Learning
Introduction to data science.pptx
Data Science Training | Data Science For Beginners | Data Science With Python...
Machine Learning Tutorial Part - 1 | Machine Learning Tutorial For Beginners ...
Introduction to Data Engineering
Ad

Similar to Top 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka (20)

PDF
Data Science and Machine Learning for Non Programmers | Edureka
PPTX
Introduction_to_Data_Science_PPT2.pptx format
PPTX
Data science
PPTX
_Data Science_ Unlocking Insights and Driving Innovation”.pptx
PDF
: A Crucial Combination for Data Science
PDF
A Beginner’s Guide to An Incredible Technology Data Science.pdf
PDF
a-beginner-guide-to-an-incredible-technology-data-science.pdf
PDF
Top 10 Myths Regarding Data Scientists Roles in India | Edureka
PPTX
Data_Science_visual for engineers and.pptx
PPTX
Data Science2
PDF
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
PPTX
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
PPTX
DILEEP DATA SCIERNCES PROJECT POWERPOINT PPT
PDF
Data Science Tools and Techniques - ed11
PPTX
Unit 1-FDS. .pptx
PDF
Untitled document.pdf
PDF
Join data mining with brief introduction to data science
PDF
Data Science Unit1 AMET.pdf
PPTX
Introduction to Data Science for iSchool KKU
PPTX
Datascience.pptx
Data Science and Machine Learning for Non Programmers | Edureka
Introduction_to_Data_Science_PPT2.pptx format
Data science
_Data Science_ Unlocking Insights and Driving Innovation”.pptx
: A Crucial Combination for Data Science
A Beginner’s Guide to An Incredible Technology Data Science.pdf
a-beginner-guide-to-an-incredible-technology-data-science.pdf
Top 10 Myths Regarding Data Scientists Roles in India | Edureka
Data_Science_visual for engineers and.pptx
Data Science2
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
DILEEP DATA SCIERNCES PROJECT POWERPOINT PPT
Data Science Tools and Techniques - ed11
Unit 1-FDS. .pptx
Untitled document.pdf
Join data mining with brief introduction to data science
Data Science Unit1 AMET.pdf
Introduction to Data Science for iSchool KKU
Datascience.pptx
Ad

More from Edureka! (20)

PDF
What to learn during the 21 days Lockdown | Edureka
PDF
Top 10 Dying Programming Languages in 2020 | Edureka
PDF
Top 5 Trending Business Intelligence Tools | Edureka
PDF
Tableau Tutorial for Data Science | Edureka
PDF
Python Programming Tutorial | Edureka
PDF
Top 5 PMP Certifications | Edureka
PDF
Top Maven Interview Questions in 2020 | Edureka
PDF
Linux Mint Tutorial | Edureka
PDF
How to Deploy Java Web App in AWS| Edureka
PDF
Importance of Digital Marketing | Edureka
PDF
RPA in 2020 | Edureka
PDF
Email Notifications in Jenkins | Edureka
PDF
EA Algorithm in Machine Learning | Edureka
PDF
Cognitive AI Tutorial | Edureka
PDF
AWS Cloud Practitioner Tutorial | Edureka
PDF
Blue Prism Top Interview Questions | Edureka
PDF
Big Data on AWS Tutorial | Edureka
PDF
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
PDF
Kubernetes Installation on Ubuntu | Edureka
PDF
Introduction to DevOps | Edureka
What to learn during the 21 days Lockdown | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Tableau Tutorial for Data Science | Edureka
Python Programming Tutorial | Edureka
Top 5 PMP Certifications | Edureka
Top Maven Interview Questions in 2020 | Edureka
Linux Mint Tutorial | Edureka
How to Deploy Java Web App in AWS| Edureka
Importance of Digital Marketing | Edureka
RPA in 2020 | Edureka
Email Notifications in Jenkins | Edureka
EA Algorithm in Machine Learning | Edureka
Cognitive AI Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Blue Prism Top Interview Questions | Edureka
Big Data on AWS Tutorial | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Kubernetes Installation on Ubuntu | Edureka
Introduction to DevOps | Edureka

Recently uploaded (20)

PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Electronic commerce courselecture one. Pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Cloud computing and distributed systems.
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Approach and Philosophy of On baking technology
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Encapsulation theory and applications.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
sap open course for s4hana steps from ECC to s4
PPT
Teaching material agriculture food technology
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
Programs and apps: productivity, graphics, security and other tools
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Electronic commerce courselecture one. Pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Advanced methodologies resolving dimensionality complications for autism neur...
Dropbox Q2 2025 Financial Results & Investor Presentation
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Cloud computing and distributed systems.
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Approach and Philosophy of On baking technology
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Spectral efficient network and resource selection model in 5G networks
Encapsulation theory and applications.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
sap open course for s4hana steps from ECC to s4
Teaching material agriculture food technology
20250228 LYD VKU AI Blended-Learning.pptx
Review of recent advances in non-invasive hemoglobin estimation

Top 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka

  • 2. INTRODUCTION TO DATA SCIENCE DATA SCIENCE TOOLS DATA SCIENCE TOOLS FOR DATA MANIPULATION DATA SCIENCE TOOLS FOR EDA www.edureka.co DATA SCIENCE TOOLS FOR DATA STORAGE DATA SCIENCE TOOLS FOR DATA VISUALIZATION
  • 3. INTRODUCTION TO DATA SCIENCE www.edureka.co
  • 4. Introduction To Data Science www.edureka.co Data Science is the process of extracting knowledge and insights from data by using scientific methods. Data Science involves collecting, analysing and modelling data to solve real-world problems. It is used for fraud detection, disease detection, recommendation engines and so on.
  • 6. Data Science Tools come with pre-defined functions, algorithms, and a very user-friendly GUI. Hence, they can be used to build convoluted Machine Learning models without the use of a programming language. DATA SCIENCE TOOLS Data Science Data Collection Exploratory Data Analysis Data Modelling Data Visualization www.edureka.co
  • 7. DATA SCIENCE TOOLS FOR DATA STORAGE www.edureka.co
  • 8. Scale and manage massive amounts of data Hadoop Distributed File System (HDFS) for data storage Integrate with , Hadoop MapReduce, Hadoop YARN www.edureka.co
  • 9. Data processing via Apache Hadoop and Spark clusters The default storage system is Windows Azure Blob Provides Microsoft R Server www.edureka.co
  • 10. DATA SCIENCE TOOLS FOR EDA www.edureka.co
  • 11. Data Integration tool based on Extract Transform Load architecture Extract Transform Load tool to manage data Support for distributed processing, grid computing, adaptive load balancing. www.edureka.co
  • 12. Data processing, building Machine Learning models, etc Support for integrating Hadoop framework Generate predictive models through automated modelling www.edureka.co
  • 13. DATA SCIENCE TOOLS FOR DATA MODELLING www.edureka.co
  • 14. Easy to apply Machine Learning Supports GLM, Boosting ML models & Deep Learning Support to integrate with Apache Hadoop www.edureka.co
  • 15. Supports parallel programming to perform data analysis, data modelling, etc Tests and trains Machine Learning models at lightning fast speed Makes model evaluation much easier. www.edureka.co
  • 16. DATA SCIENCE TOOLS FOR VISUALIZATION www.edureka.co
  • 17. Can visualize massive data sets to find correlations and patterns Create customized reports and dashboards Support to integrate with Apache Hadoop www.edureka.co
  • 18. Clear & concise visualizations Supports in-memory data processing Automatically generates data associations www.edureka.co