SlideShare a Scribd company logo
2
Most read
4
Most read
22
Most read
Introduction to Basic Data
Analytics Tools
Sa-ad Mahmud
What is Data Analytics?
Data analytics is the science of analyzing raw data in order to make
conclusions about that information.
Data Analytics Pipeline
Collect Refine Store Analyze Presentation
Data Acquisition
How To Collect Data!
★ REST API
★ From end users
★ Web scrape
★ Email and cloud storage
★ Client’s server
Requests
A library for making HTTP
requests in Python.
Tool for Data Acquisition:
Key Features:
• Keep-alive & Connection Pooling
• Sessions with Cookie Persistence
BeautifulSoup
A library for parsing HTML
and XML documents.
Tool for Data Acquisition:
Key Features:
• Multiple parser support (e.g., lxml,
html5lib, and others)
• Creates parse tree which is easy to
navigate
Flask, Flask-RESTPlus
and Swagger UI
Flask is a micro web framework
written in Python.
Flask-RESTPlus is an extension for
Flask that adds support for
quickly building REST APIs. It
automatically documents the APIs
which is visible in Swagger UI.
Tools for Data Acquisition:
Data Pre-Processing and
Storage
How To Clean Data!
★ Remove duplicate
★ Validate
★ Handle missing data
★ Fix errors
★ Filter outliers
How To Store Data!
★ RDBMS
★ ORM
Pandas
A library for data
manipulation and analysis.
Tool for Data Manipulation:
Key Features:
• Loading data into in-memory data
objects from different file formats.
• Data alignment and integrated
handling of missing data.
SQLAlchemy
SQLAlchemy is a popular
SQL toolkit and Object
Relational Mapper.
Tool for Database Operations:
Key Features:
• Function-based query construction.
• Multiple database support (e.g.,
SQLite, Postgresql, MySQL, Oracle,
MS-SQL, Firebird, Sybase and
others).
Data Analysis
How To Analyze Data!
★ Five number summary
(maximum, minimum, median,
1st quartile, 3rd quartile)
★ Average
★ Standard Deviation
★ Ratio
★ Interval
★ Trends
★ Aggregate and group by
★ Regression
★ Clustering
R and RStudio
R is a popular
programming language
for data analysis. RStudio
is an IDE for R.
Tools for Data Analysis:
Original Classes Clusters by k-means
Data Presentation
How To Visualize Data!
★ Charts
○ Line
○ Bar
○ Pie
○ Scatter
★ Graphs
★ Maps
○ Bubble
○ Polygon
★ Dashboards
Plotly
An interactive graphing
library.
Tools for Data Visualization:
Matplotlib
A plotting library for
Python.
Apache Superset
A Data Visualization and Data
Exploration Platform.
Tool for Data Visualization:
Key Features:
• It supports all the data sources that support SQL
Alchemy and supports querying using SQL.
• Superset allows sharing dashboards.
• It comes with security features like Authentication,
User Management and Roles.
Other Notable Tools
★ Excel
★ Tableau Public
★ Grafana
★ Microsoft Power BI
★ And many more . . .
Challenges
1. Poor quality data
2. Data privacy and security
3. Weak infrastructure
4. Data from multiple sources
5. Scaling data analysis
Links
Flask: https://flask.palletsprojects.com/en/2.0.x/
Flask-RESTPlus: https://flask-restplus.readthedocs.io/en/stable/
SQLAlchemy: https://www.sqlalchemy.org/
R Programming Language: https://www.r-project.org/
k-means Clustering: https://en.wikipedia.org/wiki/K-means_clustering
Plotly: https://plotly.com/
Superset Docs: https://superset.apache.org/docs/intro
Presentation GitHub Link: https://github.com/saadrumon/basic-data-analytics-tools-presentation.git
Thank You
Any Questions?
“Information is the oil of the 21st century, and analytics is the combustion engine.”
- Peter Sondergaard

More Related Content

PPTX
MODULE 1_Introduction to Data analytics and life cycle..pptx
PPTX
Introduction to data science
PPTX
Data analytics vs. Data analysis
PPTX
kinds of analytics
PPTX
Exploratory data analysis
PPTX
Analytical tools
PPTX
Big data and data science overview
PPTX
Data visualization with R
MODULE 1_Introduction to Data analytics and life cycle..pptx
Introduction to data science
Data analytics vs. Data analysis
kinds of analytics
Exploratory data analysis
Analytical tools
Big data and data science overview
Data visualization with R

What's hot (20)

PDF
Exploratory data analysis data visualization
PPTX
Data quality and data profiling
PPTX
Data Visualization.pptx
PPTX
Exploratory data analysis with Python
PPTX
Exploratory data analysis
PPTX
Knowledge Discovery and Data Mining
PPTX
Data Visualization & Analytics.pptx
PDF
Introduction to data analytics
PPTX
Data Analysis & Visualization using MS. Excel
PDF
Big Data
PDF
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
PDF
Big Data Visualization
PPTX
Tableau slideshare
PDF
Statistics For Data Science | Statistics Using R Programming Language | Hypot...
PPT
Data preprocessing
PDF
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
PDF
Introduction to Data Science and Analytics
PPTX
Data analysis with R
PDF
Tools and techniques for data science
PPTX
Data Cleaning Techniques
Exploratory data analysis data visualization
Data quality and data profiling
Data Visualization.pptx
Exploratory data analysis with Python
Exploratory data analysis
Knowledge Discovery and Data Mining
Data Visualization & Analytics.pptx
Introduction to data analytics
Data Analysis & Visualization using MS. Excel
Big Data
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Big Data Visualization
Tableau slideshare
Statistics For Data Science | Statistics Using R Programming Language | Hypot...
Data preprocessing
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Introduction to Data Science and Analytics
Data analysis with R
Tools and techniques for data science
Data Cleaning Techniques
Ad

Similar to Introduction to basic data analytics tools (20)

PPTX
Advanced Data Analytics techniques .pptx
PDF
Data Science & AI Road Map by Python & Computer science tutor in Malaysia
PPTX
DATA ANALYSIS AND VISUALISATION using python 2
PPTX
Overview data analyis and visualisation tools 2020
PPTX
Top 10 Data analytics tools to look for in 2021
PDF
Python's slippy path and Tao of thick Pandas: give my data, Rrrrr...
PPTX
unit 1 big data.pptx
PPTX
Data Analytic s (Unit -1).pRESENTATION .PPT
PDF
Open source analytics
PPTX
Introduction to Data Analytics
PPTX
Data Analytics presentation for college.
PPTX
Introduction to data analytics is important
PPTX
Short term internship project report on power Bi
PPTX
Certified Python Business Analyst
PDF
Data science tools - A.Marchev and K.Haralampiev
PPTX
Data Analysis And Visualization using Python
PDF
Introduction To Data Science With Python
PDF
DAVLectuer3 Exploratory data analysis .pdf
PPT
Data Munging in concepts of data mining in DS
PPTX
Data analytics Course for Beginners (1).pptx
Advanced Data Analytics techniques .pptx
Data Science & AI Road Map by Python & Computer science tutor in Malaysia
DATA ANALYSIS AND VISUALISATION using python 2
Overview data analyis and visualisation tools 2020
Top 10 Data analytics tools to look for in 2021
Python's slippy path and Tao of thick Pandas: give my data, Rrrrr...
unit 1 big data.pptx
Data Analytic s (Unit -1).pRESENTATION .PPT
Open source analytics
Introduction to Data Analytics
Data Analytics presentation for college.
Introduction to data analytics is important
Short term internship project report on power Bi
Certified Python Business Analyst
Data science tools - A.Marchev and K.Haralampiev
Data Analysis And Visualization using Python
Introduction To Data Science With Python
DAVLectuer3 Exploratory data analysis .pdf
Data Munging in concepts of data mining in DS
Data analytics Course for Beginners (1).pptx
Ad

More from Nascenia IT (20)

PPTX
Exploring DeepSeek A Hands-On Dive & How to Adapt the AI Surge.pptx
PPTX
AI Tools for Productivity: Exploring Prompt Engineering and Key Features
PPTX
Communication workshop in nascenia
PPTX
The Art of Statistical Deception
PDF
করোনায় কী করি!
PPTX
GDPR compliance expectations from the development team
PPTX
Writing Clean Code
PPTX
History & Introduction of Neural Network and use of it in Computer Vision
PPTX
Ruby on Rails: Coding Guideline
PPTX
iphone 11 new features
PPTX
Software quality assurance and cyber security
PPTX
Job Market Scenario For Freshers
PPTX
Modern Frontend Technologies (BEM, Retina)
PPTX
CSS for Developers
PPTX
Big commerce app development
PPTX
Integrating QuickBooks Desktop with Rails Application
PPTX
Shopify
PPTX
TypeScript: Basic Features and Compilation Guide
PPTX
Clean code
PPTX
Ruby conf 2016 - Secrets of Testing Rails 5 Apps
Exploring DeepSeek A Hands-On Dive & How to Adapt the AI Surge.pptx
AI Tools for Productivity: Exploring Prompt Engineering and Key Features
Communication workshop in nascenia
The Art of Statistical Deception
করোনায় কী করি!
GDPR compliance expectations from the development team
Writing Clean Code
History & Introduction of Neural Network and use of it in Computer Vision
Ruby on Rails: Coding Guideline
iphone 11 new features
Software quality assurance and cyber security
Job Market Scenario For Freshers
Modern Frontend Technologies (BEM, Retina)
CSS for Developers
Big commerce app development
Integrating QuickBooks Desktop with Rails Application
Shopify
TypeScript: Basic Features and Compilation Guide
Clean code
Ruby conf 2016 - Secrets of Testing Rails 5 Apps

Recently uploaded (20)

PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Cloud computing and distributed systems.
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Encapsulation theory and applications.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Network Security Unit 5.pdf for BCA BBA.
MYSQL Presentation for SQL database connectivity
Understanding_Digital_Forensics_Presentation.pptx
Cloud computing and distributed systems.
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
NewMind AI Weekly Chronicles - August'25 Week I
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
20250228 LYD VKU AI Blended-Learning.pptx
The AUB Centre for AI in Media Proposal.docx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Chapter 3 Spatial Domain Image Processing.pdf
Encapsulation theory and applications.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Building Integrated photovoltaic BIPV_UPV.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
NewMind AI Monthly Chronicles - July 2025
Review of recent advances in non-invasive hemoglobin estimation
Network Security Unit 5.pdf for BCA BBA.

Introduction to basic data analytics tools

  • 1. Introduction to Basic Data Analytics Tools Sa-ad Mahmud
  • 2. What is Data Analytics? Data analytics is the science of analyzing raw data in order to make conclusions about that information. Data Analytics Pipeline Collect Refine Store Analyze Presentation
  • 4. How To Collect Data! ★ REST API ★ From end users ★ Web scrape ★ Email and cloud storage ★ Client’s server
  • 5. Requests A library for making HTTP requests in Python. Tool for Data Acquisition: Key Features: • Keep-alive & Connection Pooling • Sessions with Cookie Persistence
  • 6. BeautifulSoup A library for parsing HTML and XML documents. Tool for Data Acquisition: Key Features: • Multiple parser support (e.g., lxml, html5lib, and others) • Creates parse tree which is easy to navigate
  • 7. Flask, Flask-RESTPlus and Swagger UI Flask is a micro web framework written in Python. Flask-RESTPlus is an extension for Flask that adds support for quickly building REST APIs. It automatically documents the APIs which is visible in Swagger UI. Tools for Data Acquisition:
  • 9. How To Clean Data! ★ Remove duplicate ★ Validate ★ Handle missing data ★ Fix errors ★ Filter outliers How To Store Data! ★ RDBMS ★ ORM
  • 10. Pandas A library for data manipulation and analysis. Tool for Data Manipulation: Key Features: • Loading data into in-memory data objects from different file formats. • Data alignment and integrated handling of missing data.
  • 11. SQLAlchemy SQLAlchemy is a popular SQL toolkit and Object Relational Mapper. Tool for Database Operations: Key Features: • Function-based query construction. • Multiple database support (e.g., SQLite, Postgresql, MySQL, Oracle, MS-SQL, Firebird, Sybase and others).
  • 13. How To Analyze Data! ★ Five number summary (maximum, minimum, median, 1st quartile, 3rd quartile) ★ Average ★ Standard Deviation ★ Ratio ★ Interval ★ Trends ★ Aggregate and group by ★ Regression ★ Clustering
  • 14. R and RStudio R is a popular programming language for data analysis. RStudio is an IDE for R. Tools for Data Analysis: Original Classes Clusters by k-means
  • 16. How To Visualize Data! ★ Charts ○ Line ○ Bar ○ Pie ○ Scatter ★ Graphs ★ Maps ○ Bubble ○ Polygon ★ Dashboards
  • 17. Plotly An interactive graphing library. Tools for Data Visualization: Matplotlib A plotting library for Python.
  • 18. Apache Superset A Data Visualization and Data Exploration Platform. Tool for Data Visualization: Key Features: • It supports all the data sources that support SQL Alchemy and supports querying using SQL. • Superset allows sharing dashboards. • It comes with security features like Authentication, User Management and Roles.
  • 19. Other Notable Tools ★ Excel ★ Tableau Public ★ Grafana ★ Microsoft Power BI ★ And many more . . .
  • 20. Challenges 1. Poor quality data 2. Data privacy and security 3. Weak infrastructure 4. Data from multiple sources 5. Scaling data analysis
  • 21. Links Flask: https://flask.palletsprojects.com/en/2.0.x/ Flask-RESTPlus: https://flask-restplus.readthedocs.io/en/stable/ SQLAlchemy: https://www.sqlalchemy.org/ R Programming Language: https://www.r-project.org/ k-means Clustering: https://en.wikipedia.org/wiki/K-means_clustering Plotly: https://plotly.com/ Superset Docs: https://superset.apache.org/docs/intro Presentation GitHub Link: https://github.com/saadrumon/basic-data-analytics-tools-presentation.git
  • 22. Thank You Any Questions? “Information is the oil of the 21st century, and analytics is the combustion engine.” - Peter Sondergaard