www.edureka.in/data-science
Data Science
Sentiment Analysis in Retail
Domain
www.edureka.co/r-for-analyticsSlide 2 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Objectives
What is data mining
Stages of data mining??
 What is R
What is data science??
What is need of data scientist??
 Roles and Responsibilities of a Data Scientist.
 Sentiment analysis on zomato reviews
At the end of this session, you will be able to
www.edureka.in/data-scienceSlide 3
How about this?
Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
www.edureka.in/data-scienceSlide 4
Data Science Applications??
According to Wikipedia: Data science is the study of the generalizable extraction of knowledge
from data, yet the key word is science.
These scenarios involve:
 Storing, organizing and integrating huge amount of unstructured data
 Processing and analyzing the data
 Extracting knowledge, insights and predict future from the data
Storage of big data is done in Hadoop. For more details on Hadoop please refer Big data and
Hadoop blog http://www.edureka.in/blog/category/big-data-and-hadoop/
Processing, Analyzing, extracting knowledge and insights are done through Machine Learning
Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Slide 5Slide 5 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Knowledge discovery and data mining ( KDD)
Stages of Analytics / Data Mining
Slide 6Slide 6 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
What is R
R is Programming Language
R is Environment for Statistical Analysis
R is Data Analysis Software
Slide 7Slide 7 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Data Visualization in R
This plot represents the
locations of all the traffic
signals in the city.
It is recognizable as
Toronto without any other
geographic data being
plotted - the structure of
the city comes out in the
data alone.
Slide 8Slide 8 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
What is data science??
“More data usually beats better algorithms,” Such as: Recommending movies or music
based on past preferences
No matter how extremely unpleasant your algorithm is, they can often be beaten simply by
having more data (and a less sophisticated algorithm).
Slide 9Slide 9 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Components data science??
Slide 10 www.edureka.in/data-science
Data Science: Demand Supply Gap
Big Data Analyst
Big Data Architect
Big Data Engineer
Big Data Research Analyst
Big Data Visualizer
Data Scientist
50
43
44
31
23
18
50
57
56
69
77
82
Filled job vs unfilled jobs in big data
Filled Unfilled
Vacancy/Filled(%)
Gartner Says Big Data Creates Big Jobs: 4.4 Million IT
Jobs Globally to Support Big Data By
2015http://www.gartner.com/newsroom/id/2207915
Slide 11 www.edureka.in/data-science
Data Science: Job Trends
Slide 12 www.edureka.in/data-science
Slide 13 www.edureka.in/data-science
Hadoop and R together
Slide 14 www.edureka.in/data-science
Machine Learning
We have so many algorithms for data mining which can be used to build systems that can read past data and can
generate a system that can accommodate any future data and derive useful insight from it
Such set of algorithms comes under machine learning
Machine learning focuses on the development of computer programs that can teach themselves to grow and change
when exposed to new data
Train data
ML
model
Algorithms
Slide 15 www.edureka.in/data-science
Types of Learning
Supervised Learning Unsupervised Learning
1. Uses a known dataset to make
predictions.
2. The training dataset includes
input data and response values.
3. From it, the supervised learning
algorithm builds a model to make
predictions of the response
values for a new dataset.
1. Draw inferences from datasets
consisting of input data without
labeled responses.
2. Used for exploratory data analysis
to find hidden patterns or grouping
in data
3. The most common unsupervised
learning method is cluster analysis.
Machine Learning
Slide 16 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
• Common Machine Learning Algorithms
Types of Learning
Supervised Learning
Unsupervised Learning
Algorithms
 Naïve Bayes
 Support Vector Machines
 Random Forests
 Decision Trees
Algorithms
 K-means
 Fuzzy Clustering
 Hierarchical Clustering
Gaussian mixture models
Self-organizing maps
Slide 17 www.edureka.in/data-science
Use Case : Zomato Ratings Review
Slide 18 www.edureka.in/data-science
 Module 1
» Introduction to Data Science
 Module 2
» Basic Data Manipulation using R
 Module 3
» Machine Learning Techniques using R Part -1
- Clustering
- TF-IDF and Cosine Similarity
- Association Rule Mining
 Module 4
» Machine Learning Techniques using R Part -2
- Supervised and Unsupervised Learning
- Decision Tree Classifier
Course Topics
 Module 5
» Machine Learning Techniques using R Part -3
- Random Forest Classifier
- Naïve Bayer’s Classifier
 Module 6
» Introduction to Hadoop Architecture
 Module 7
» Integrating R with Hadoop
 Module 8
» Mahout Introduction and Algorithm
Implementation
 Module 9
» Additional Mahout Algorithms and Parallel
Processing in R
 Module 10
» Project
Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Slide 19
Questions?
Enroll for the Complete Course at : www.edureka.in/data_science
Twitter @edurekaIN, Facebook /edurekaIN, use #askEdureka for Questions
www.edureka.in/data_science
Please Don’t forget to fill in the survey report
Class Recording and Presentation will be available in 24 hours at:
http://www.edureka.in/blog/application-of-clustering-in-data-science-using-real-life-examples/

More Related Content

PDF
Data Science : Make Smarter Business Decisions
PDF
Logistic Regression In Data Science
PDF
Business Analytics with R
PDF
Webinar : Introduction to R Programming and Machine Learning
PDF
Business Analytics Decision Tree in R
PDF
Linear Regression With R
PPTX
Application of Clustering in Data Science using Real-life Examples
PDF
Webinar: Data Visualization-How to Make Sense of Data
Data Science : Make Smarter Business Decisions
Logistic Regression In Data Science
Business Analytics with R
Webinar : Introduction to R Programming and Machine Learning
Business Analytics Decision Tree in R
Linear Regression With R
Application of Clustering in Data Science using Real-life Examples
Webinar: Data Visualization-How to Make Sense of Data

What's hot (20)

PDF
Python webinar 4th june
PDF
Association Mining
PPTX
Business Analytics with R
PDF
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
PDF
Data science presentation
PPTX
Webinar: Mastering Python - An Excellent tool for Web Scraping and Data Anal...
PDF
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
PDF
Data science presentation 2nd CI day
PDF
Python for Data Science
PPTX
Data Science: Not Just For Big Data
PPTX
Introduction to Apache Mahout
PPTX
Data Science using Python
PPTX
Introduction to Big Data/Machine Learning
PDF
Introduction to Big Data Analytics and Data Science
PDF
Introduction To Data Science With Python
PDF
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
PDF
Data science with Perl & Raku
PDF
Introduction to Python for Data Science
PDF
Big Data Analytics and Data Science
PDF
Introduction To Data Science
Python webinar 4th june
Association Mining
Business Analytics with R
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
Data science presentation
Webinar: Mastering Python - An Excellent tool for Web Scraping and Data Anal...
Data Science Training | Data Science Tutorial for Beginners | Data Science wi...
Data science presentation 2nd CI day
Python for Data Science
Data Science: Not Just For Big Data
Introduction to Apache Mahout
Data Science using Python
Introduction to Big Data/Machine Learning
Introduction to Big Data Analytics and Data Science
Introduction To Data Science With Python
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data science with Perl & Raku
Introduction to Python for Data Science
Big Data Analytics and Data Science
Introduction To Data Science
Ad

Similar to Sentiment Analysis In Retail Domain (20)

PPT
Data Science tutorial for beginner level to advanced level | Data Science pro...
PPTX
Data Science- Basics.pptx
PDF
Deep learning applications and challenges in big data analytics
PDF
Intro big data.pdf
DOCX
1. Web Mining – Web mining is an application of data mining for di.docx
PDF
The Research Blueprint: Excelling in Data science, Data Analysis and AI
PPTX
2016 03-16 digital energy luncheon
PPTX
Ch1IntroductiontoDataScience.pptx
PDF
Luciano uvi hackfest.28.10.2020
PDF
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
PPTX
Information entanglement
PDF
How to Prepare for a Career in Data Science
PDF
Data Science Unit1 AMET.pdf
PPTX
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
PPTX
Data Science Demystified
PDF
Introduction on Data Science
PDF
How to crack down big data?
PPTX
Pemanfaatan Big Data Dalam Riset 2023.pptx
PDF
Introduction to Data Science.pdf
PDF
Big Data Intoduction & Hadoop ArchitectureModule1.pdf
Data Science tutorial for beginner level to advanced level | Data Science pro...
Data Science- Basics.pptx
Deep learning applications and challenges in big data analytics
Intro big data.pdf
1. Web Mining – Web mining is an application of data mining for di.docx
The Research Blueprint: Excelling in Data science, Data Analysis and AI
2016 03-16 digital energy luncheon
Ch1IntroductiontoDataScience.pptx
Luciano uvi hackfest.28.10.2020
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
Information entanglement
How to Prepare for a Career in Data Science
Data Science Unit1 AMET.pdf
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
Data Science Demystified
Introduction on Data Science
How to crack down big data?
Pemanfaatan Big Data Dalam Riset 2023.pptx
Introduction to Data Science.pdf
Big Data Intoduction & Hadoop ArchitectureModule1.pdf
Ad

More from Edureka! (20)

PDF
What to learn during the 21 days Lockdown | Edureka
PDF
Top 10 Dying Programming Languages in 2020 | Edureka
PDF
Top 5 Trending Business Intelligence Tools | Edureka
PDF
Tableau Tutorial for Data Science | Edureka
PDF
Python Programming Tutorial | Edureka
PDF
Top 5 PMP Certifications | Edureka
PDF
Top Maven Interview Questions in 2020 | Edureka
PDF
Linux Mint Tutorial | Edureka
PDF
How to Deploy Java Web App in AWS| Edureka
PDF
Importance of Digital Marketing | Edureka
PDF
RPA in 2020 | Edureka
PDF
Email Notifications in Jenkins | Edureka
PDF
EA Algorithm in Machine Learning | Edureka
PDF
Cognitive AI Tutorial | Edureka
PDF
AWS Cloud Practitioner Tutorial | Edureka
PDF
Blue Prism Top Interview Questions | Edureka
PDF
Big Data on AWS Tutorial | Edureka
PDF
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
PDF
Kubernetes Installation on Ubuntu | Edureka
PDF
Introduction to DevOps | Edureka
What to learn during the 21 days Lockdown | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Tableau Tutorial for Data Science | Edureka
Python Programming Tutorial | Edureka
Top 5 PMP Certifications | Edureka
Top Maven Interview Questions in 2020 | Edureka
Linux Mint Tutorial | Edureka
How to Deploy Java Web App in AWS| Edureka
Importance of Digital Marketing | Edureka
RPA in 2020 | Edureka
Email Notifications in Jenkins | Edureka
EA Algorithm in Machine Learning | Edureka
Cognitive AI Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Blue Prism Top Interview Questions | Edureka
Big Data on AWS Tutorial | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Kubernetes Installation on Ubuntu | Edureka
Introduction to DevOps | Edureka

Recently uploaded (20)

PDF
The influence of sentiment analysis in enhancing early warning system model f...
PPTX
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
PDF
Flame analysis and combustion estimation using large language and vision assi...
PDF
A proposed approach for plagiarism detection in Myanmar Unicode text
PPTX
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
PDF
sbt 2.0: go big (Scala Days 2025 edition)
PPT
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
PDF
Architecture types and enterprise applications.pdf
PDF
Improvisation in detection of pomegranate leaf disease using transfer learni...
PDF
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PPT
Module 1.ppt Iot fundamentals and Architecture
PPT
What is a Computer? Input Devices /output devices
PDF
Credit Without Borders: AI and Financial Inclusion in Bangladesh
PDF
STKI Israel Market Study 2025 version august
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PPTX
2018-HIPAA-Renewal-Training for executives
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
The influence of sentiment analysis in enhancing early warning system model f...
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
Flame analysis and combustion estimation using large language and vision assi...
A proposed approach for plagiarism detection in Myanmar Unicode text
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
sbt 2.0: go big (Scala Days 2025 edition)
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
Architecture types and enterprise applications.pdf
Improvisation in detection of pomegranate leaf disease using transfer learni...
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Module 1.ppt Iot fundamentals and Architecture
What is a Computer? Input Devices /output devices
Credit Without Borders: AI and Financial Inclusion in Bangladesh
STKI Israel Market Study 2025 version august
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
2018-HIPAA-Renewal-Training for executives
Custom Battery Pack Design Considerations for Performance and Safety

Sentiment Analysis In Retail Domain

  • 2. www.edureka.co/r-for-analyticsSlide 2 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Objectives What is data mining Stages of data mining??  What is R What is data science?? What is need of data scientist??  Roles and Responsibilities of a Data Scientist.  Sentiment analysis on zomato reviews At the end of this session, you will be able to
  • 3. www.edureka.in/data-scienceSlide 3 How about this? Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
  • 4. www.edureka.in/data-scienceSlide 4 Data Science Applications?? According to Wikipedia: Data science is the study of the generalizable extraction of knowledge from data, yet the key word is science. These scenarios involve:  Storing, organizing and integrating huge amount of unstructured data  Processing and analyzing the data  Extracting knowledge, insights and predict future from the data Storage of big data is done in Hadoop. For more details on Hadoop please refer Big data and Hadoop blog http://www.edureka.in/blog/category/big-data-and-hadoop/ Processing, Analyzing, extracting knowledge and insights are done through Machine Learning Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
  • 5. Slide 5Slide 5 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Knowledge discovery and data mining ( KDD) Stages of Analytics / Data Mining
  • 6. Slide 6Slide 6 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions What is R R is Programming Language R is Environment for Statistical Analysis R is Data Analysis Software
  • 7. Slide 7Slide 7 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Data Visualization in R This plot represents the locations of all the traffic signals in the city. It is recognizable as Toronto without any other geographic data being plotted - the structure of the city comes out in the data alone.
  • 8. Slide 8Slide 8 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions What is data science?? “More data usually beats better algorithms,” Such as: Recommending movies or music based on past preferences No matter how extremely unpleasant your algorithm is, they can often be beaten simply by having more data (and a less sophisticated algorithm).
  • 9. Slide 9Slide 9 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Components data science??
  • 10. Slide 10 www.edureka.in/data-science Data Science: Demand Supply Gap Big Data Analyst Big Data Architect Big Data Engineer Big Data Research Analyst Big Data Visualizer Data Scientist 50 43 44 31 23 18 50 57 56 69 77 82 Filled job vs unfilled jobs in big data Filled Unfilled Vacancy/Filled(%) Gartner Says Big Data Creates Big Jobs: 4.4 Million IT Jobs Globally to Support Big Data By 2015http://www.gartner.com/newsroom/id/2207915
  • 14. Slide 14 www.edureka.in/data-science Machine Learning We have so many algorithms for data mining which can be used to build systems that can read past data and can generate a system that can accommodate any future data and derive useful insight from it Such set of algorithms comes under machine learning Machine learning focuses on the development of computer programs that can teach themselves to grow and change when exposed to new data Train data ML model Algorithms
  • 15. Slide 15 www.edureka.in/data-science Types of Learning Supervised Learning Unsupervised Learning 1. Uses a known dataset to make predictions. 2. The training dataset includes input data and response values. 3. From it, the supervised learning algorithm builds a model to make predictions of the response values for a new dataset. 1. Draw inferences from datasets consisting of input data without labeled responses. 2. Used for exploratory data analysis to find hidden patterns or grouping in data 3. The most common unsupervised learning method is cluster analysis. Machine Learning
  • 16. Slide 16 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions • Common Machine Learning Algorithms Types of Learning Supervised Learning Unsupervised Learning Algorithms  Naïve Bayes  Support Vector Machines  Random Forests  Decision Trees Algorithms  K-means  Fuzzy Clustering  Hierarchical Clustering Gaussian mixture models Self-organizing maps
  • 17. Slide 17 www.edureka.in/data-science Use Case : Zomato Ratings Review
  • 18. Slide 18 www.edureka.in/data-science  Module 1 » Introduction to Data Science  Module 2 » Basic Data Manipulation using R  Module 3 » Machine Learning Techniques using R Part -1 - Clustering - TF-IDF and Cosine Similarity - Association Rule Mining  Module 4 » Machine Learning Techniques using R Part -2 - Supervised and Unsupervised Learning - Decision Tree Classifier Course Topics  Module 5 » Machine Learning Techniques using R Part -3 - Random Forest Classifier - Naïve Bayer’s Classifier  Module 6 » Introduction to Hadoop Architecture  Module 7 » Integrating R with Hadoop  Module 8 » Mahout Introduction and Algorithm Implementation  Module 9 » Additional Mahout Algorithms and Parallel Processing in R  Module 10 » Project Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
  • 19. Slide 19 Questions? Enroll for the Complete Course at : www.edureka.in/data_science Twitter @edurekaIN, Facebook /edurekaIN, use #askEdureka for Questions www.edureka.in/data_science Please Don’t forget to fill in the survey report Class Recording and Presentation will be available in 24 hours at: http://www.edureka.in/blog/application-of-clustering-in-data-science-using-real-life-examples/