SlideShare a Scribd company logo
BIG DATA
ANALYTICS
DR. OWAIS BHAT
DS-Visualization-Unit-4 COMPUTER SCIENCE.pdf
DATA-SCIENCE APPLICATIONS
Data science is a rapidly growing field that is being used in a wide variety of
applications.
• Fraud detection: Data science can be used to identify and prevent fraudulent
transactions. For example, banks use data science to identify suspicious
activity in credit card transactions.
• Customer segmentation: Data science can be used to segment customers into
groups based on their characteristics. This information can be used to target
customers with specific marketing messages.
• Product recommendations: Data science can be used to recommend products
to customers based on their past purchases or browsing history. This
information can be used to increase sales and improve customer satisfaction.
• Risk assessment: Data science can be used to assess the risk of an event
happening. For example, insurance companies use data science to assess the
risk of a customer filing a claim.
• Personalized medicine: Data science can be used to personalize medical
treatment for patients. For example, doctors can use data science to identify
the best treatment for a patient based on their individual characteristics.
RECENT TRENDS IN DATA COLLECTION & ANALYSIS
• The rise of real-time data collection and analysis.
• The increasing use of artificial intelligence (AI) and machine learning (ML) for
data analysis.
• The growing popularity of cloud-based data collection and analysis tools.
• The increasing focus on data privacy and security
Some of the most popular technologies for data visualization in data science:
• Matplotlib: Matplotlib is a Python library for creating static, animated, and
interactive visualizations. It is a popular choice for data scientists because it is
easy to use and versatile.
• Seaborn: Seaborn is a Python library that builds on Matplotlib to provide a
high-level interface for creating attractive and informative visualizations. It is a
good choice for data scientists who want to create visualizations that are both
visually appealing and easy to understand.
Plotly: Plotly is a Python library for creating interactive visualizations that can
be embedded in web pages or documents. It is a good choice for data scientists
who want to create visualizations that can be shared and explored online.
Tableau: Tableau is a commercial data visualization software that is known for
its ease of use and interactive capabilities. It is a good choice for businesses
and organizations that need to create data-driven visualizations for non-
technical audiences.
Qlik Sense: Qlik Sense is another commercial data visualization software that is
known for its speed and scalability. It is a good choice for businesses that need
to process large amounts of data quickly and create interactive visualizations.
DATA-SCIENCE – R LANGUAGE
R is a powerful programming language for data science, and it can be used to
develop a variety of applications. Some of the most common application
development methods in data science using R include:
Shiny:
• Shiny is an R package that facilitates the creation of interactive web
applications directly from R scripts.
• It allows data scientists to build dynamic dashboards, visualizations, and data-
driven web interfaces without extensive web development knowledge.
• Shiny apps can be hosted online or deployed on local servers.
R Markdown:
• R Markdown is a versatile tool for creating reproducible reports, documents,
and presentations that integrate R code, visualizations, and narrative text.
• It enables data scientists to weave code, output, and text into a single
document, making it easy to share insights and analysis.
Plumber:
• Plumber is an R package for building APIs (Application Programming
Interfaces) using R code.
• Data scientists can create RESTful APIs to expose R models, functions, or data
processing pipelines for integration with other applications.
RStudio Connect:
• RStudio Connect is a platform that allows you to publish and share Shiny apps,
R Markdown documents, and Plumber APIs securely within your organization.
• It simplifies the deployment and management of R-based applications.
R Packages:
• R allows you to develop custom R packages that encapsulate functions, data,
and documentation for specific data science tasks.
• Packages can be shared and reused across projects, enhancing code
modularity and reusability.
R with SQL Databases:
• R can be integrated with SQL databases using packages like RMySQL or DBI,
enabling data retrieval, analysis, and visualization directly from databases.
• This is useful for building data-driven applications that fetch and process data
from relational databases.
DS-Visualization-Unit-4 COMPUTER SCIENCE.pdf
THANK YOU
owais@iust.ac.in

More Related Content

PPTX
UNIT-1 Data Visualization used in daily life
PPTX
UNIT-1 Data Visualization for the life use
PPTX
semana1.pptx
PDF
Analytical Innovation: How to Build the Next Generation Data Platform
PPTX
Top Big data Analytics tools: Emerging trends and Best practices
PDF
Big Data Technologies.pdf
PPTX
Data scientist What is inside it?
PPTX
Data analytics,...........................
UNIT-1 Data Visualization used in daily life
UNIT-1 Data Visualization for the life use
semana1.pptx
Analytical Innovation: How to Build the Next Generation Data Platform
Top Big data Analytics tools: Emerging trends and Best practices
Big Data Technologies.pdf
Data scientist What is inside it?
Data analytics,...........................

Similar to DS-Visualization-Unit-4 COMPUTER SCIENCE.pdf (20)

PPTX
DA DS traning.pptx. Data Science is marking its graph on a high note by expan...
PPSX
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
PPTX
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
PPTX
Data Analytic s (Unit -1).pRESENTATION .PPT
PPTX
Unit-I_Big data life cycle.pptx, sources of Big Data
PPTX
BDA UNIT 1big data – web analytics – big data applications– big data technolo...
PPTX
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
PPTX
BIG DATA ANALYTICS USING R
PDF
Data Analytics with Python: A Comprehensive Approach - CETPA Infotech
PPTX
Short term internship project report on power Bi
PPTX
Data science using r multisoft systems
PPTX
Abhishek Training PPT.pptx
PDF
Agile data science
PDF
Python para Manual de Ciência de Dados
PDF
How to Identify, Train or Become a Data Scientist
PPTX
2019 DSA 105 Introduction to Data Science Week 4
PPTX
Introduction to Data Visualization, Importance and types
PDF
Big Data Analytics M1.pdf big data analytics
PDF
DA DS traning.pptx. Data Science is marking its graph on a high note by expan...
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
Data Analytic s (Unit -1).pRESENTATION .PPT
Unit-I_Big data life cycle.pptx, sources of Big Data
BDA UNIT 1big data – web analytics – big data applications– big data technolo...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
BIG DATA ANALYTICS USING R
Data Analytics with Python: A Comprehensive Approach - CETPA Infotech
Short term internship project report on power Bi
Data science using r multisoft systems
Abhishek Training PPT.pptx
Agile data science
Python para Manual de Ciência de Dados
How to Identify, Train or Become a Data Scientist
2019 DSA 105 Introduction to Data Science Week 4
Introduction to Data Visualization, Importance and types
Big Data Analytics M1.pdf big data analytics
Ad

More from coreyanderson7866 (6)

PPTX
2-The unified process in computer Science.pptx
PDF
Cloud and Grid Computing PPT computer science.pdf
PPTX
Learn various loops and Iterations in Python
PPTX
Presentation_ON _HTML Irfan Rashid .pptx
PPTX
Contingency arguments for Existence Of Allah.edd.pptx
PDF
Microbiology report card test smims .pdf
2-The unified process in computer Science.pptx
Cloud and Grid Computing PPT computer science.pdf
Learn various loops and Iterations in Python
Presentation_ON _HTML Irfan Rashid .pptx
Contingency arguments for Existence Of Allah.edd.pptx
Microbiology report card test smims .pdf
Ad

Recently uploaded (20)

PPTX
Lesson notes of climatology university.
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Pre independence Education in Inndia.pdf
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PPTX
GDM (1) (1).pptx small presentation for students
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Computing-Curriculum for Schools in Ghana
PDF
Complications of Minimal Access Surgery at WLH
PDF
Supply Chain Operations Speaking Notes -ICLT Program
Lesson notes of climatology university.
VCE English Exam - Section C Student Revision Booklet
O5-L3 Freight Transport Ops (International) V1.pdf
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF
2.FourierTransform-ShortQuestionswithAnswers.pdf
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
human mycosis Human fungal infections are called human mycosis..pptx
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Pre independence Education in Inndia.pdf
Renaissance Architecture: A Journey from Faith to Humanism
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
GDM (1) (1).pptx small presentation for students
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Module 4: Burden of Disease Tutorial Slides S2 2025
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Computing-Curriculum for Schools in Ghana
Complications of Minimal Access Surgery at WLH
Supply Chain Operations Speaking Notes -ICLT Program

DS-Visualization-Unit-4 COMPUTER SCIENCE.pdf

  • 3. DATA-SCIENCE APPLICATIONS Data science is a rapidly growing field that is being used in a wide variety of applications. • Fraud detection: Data science can be used to identify and prevent fraudulent transactions. For example, banks use data science to identify suspicious activity in credit card transactions.
  • 4. • Customer segmentation: Data science can be used to segment customers into groups based on their characteristics. This information can be used to target customers with specific marketing messages. • Product recommendations: Data science can be used to recommend products to customers based on their past purchases or browsing history. This information can be used to increase sales and improve customer satisfaction. • Risk assessment: Data science can be used to assess the risk of an event happening. For example, insurance companies use data science to assess the risk of a customer filing a claim.
  • 5. • Personalized medicine: Data science can be used to personalize medical treatment for patients. For example, doctors can use data science to identify the best treatment for a patient based on their individual characteristics.
  • 6. RECENT TRENDS IN DATA COLLECTION & ANALYSIS • The rise of real-time data collection and analysis. • The increasing use of artificial intelligence (AI) and machine learning (ML) for data analysis. • The growing popularity of cloud-based data collection and analysis tools. • The increasing focus on data privacy and security
  • 7. Some of the most popular technologies for data visualization in data science: • Matplotlib: Matplotlib is a Python library for creating static, animated, and interactive visualizations. It is a popular choice for data scientists because it is easy to use and versatile. • Seaborn: Seaborn is a Python library that builds on Matplotlib to provide a high-level interface for creating attractive and informative visualizations. It is a good choice for data scientists who want to create visualizations that are both visually appealing and easy to understand.
  • 8. Plotly: Plotly is a Python library for creating interactive visualizations that can be embedded in web pages or documents. It is a good choice for data scientists who want to create visualizations that can be shared and explored online. Tableau: Tableau is a commercial data visualization software that is known for its ease of use and interactive capabilities. It is a good choice for businesses and organizations that need to create data-driven visualizations for non- technical audiences. Qlik Sense: Qlik Sense is another commercial data visualization software that is known for its speed and scalability. It is a good choice for businesses that need to process large amounts of data quickly and create interactive visualizations.
  • 9. DATA-SCIENCE – R LANGUAGE R is a powerful programming language for data science, and it can be used to develop a variety of applications. Some of the most common application development methods in data science using R include: Shiny: • Shiny is an R package that facilitates the creation of interactive web applications directly from R scripts. • It allows data scientists to build dynamic dashboards, visualizations, and data- driven web interfaces without extensive web development knowledge. • Shiny apps can be hosted online or deployed on local servers.
  • 10. R Markdown: • R Markdown is a versatile tool for creating reproducible reports, documents, and presentations that integrate R code, visualizations, and narrative text. • It enables data scientists to weave code, output, and text into a single document, making it easy to share insights and analysis.
  • 11. Plumber: • Plumber is an R package for building APIs (Application Programming Interfaces) using R code. • Data scientists can create RESTful APIs to expose R models, functions, or data processing pipelines for integration with other applications. RStudio Connect: • RStudio Connect is a platform that allows you to publish and share Shiny apps, R Markdown documents, and Plumber APIs securely within your organization. • It simplifies the deployment and management of R-based applications.
  • 12. R Packages: • R allows you to develop custom R packages that encapsulate functions, data, and documentation for specific data science tasks. • Packages can be shared and reused across projects, enhancing code modularity and reusability. R with SQL Databases: • R can be integrated with SQL databases using packages like RMySQL or DBI, enabling data retrieval, analysis, and visualization directly from databases. • This is useful for building data-driven applications that fetch and process data from relational databases.