SlideShare a Scribd company logo
Introduction to
Machine Learning
Introduction to
Machine Learning
Anurag Srivastava
Software Consultant
Knoldus Software LLP
Anurag Srivastava
Software Consultant
Knoldus Software LLP
Topics CoveredTopics Covered
● What is machine learning
● Different kinds of machine learning
● Key elements of machine learning
● Types of machine learning
● Techniques for machine learning
What is Machine Learning ?What is Machine Learning ?
What is Machine Learning ?What is Machine Learning ?
Machine learning is a type of artificial intelligence (AI) that provides computers with the
ability to learn without being explicitly programmed. Machine learning focuses on the
development of computer programs that can teach themselves to grow and change when
exposed to new data.
What is Machine Learning ?What is Machine Learning ?
Machine learning is a type of artificial intelligence (AI) that provides computers with the
ability to learn without being explicitly programmed. Machine learning focuses on the
development of computer programs that can teach themselves to grow and change when
exposed to new data.
Where we can used learning ?
1.Result vary every time.
2.Solution needs to be adapted to particular cases.
3.Human does not exist.
Different kinds of machine learningDifferent kinds of machine learning
● Data Mining :
Data Mining is the combination Artificial Intelligence and statistical analysis tools
that are bringing together to discover hidden information in our data. There are
many hidden information in data and these are :
● Association
● Sequence : Sequence for tie events to together.
● Classification : Classification for recognizing patterns.
● Forecasting : Forecasting is used for predicting on the based on their past pattern.
● Anomalies : Anomalies, outliers, frauds, many different types of things we can do.
● Grouping : Grouping of data
● Predictive Analysis :
Predictive models and analysis are typically used to forecast future probabilities.
Applied to business, predictive models are used to analyze current data and
historical facts in order to better understand. It uses a number of techniques,
including data mining, statistical modeling and machine learning to help analysts
make future business forecasts.
Different kinds of machine learningDifferent kinds of machine learning
● Advance Analytic :
It is the autonomous or semi-autonomous process on data using sophisticated
techniques and tools. Its beyond of traditional Business Intelligence. It helps to
find more deeper information of data, to make prediction and generate
recommendations.
● Data Science :
Data science is an interdisciplinary field about processes and systems to extract
knowledge or insights from data in various forms, either structured or
unstructured,which is a continuation of some of the data analysis fields such as
statistics, data mining, and predictive analytic, similar to Knowledge Discovery in
Databases.
Key elements of machine learningKey elements of machine learning
● Explore Data
● Find Patterns
● Performs Prediction
Key elements of machine learningKey elements of machine learning
● Explore Data :
1. Labeled Data : Labeled data is a data with some meaningful
“tag, label or class”. We know about the data and which type of
operation performed on that data.
2. Unlabeled Data : Unlabeled data is a simple raw data. We do
not know about the data and there is no explanation for that data.
Key elements of machine learningKey elements of machine learning
➔ Explore Data :
➔ Data Preparation Process : This is very important part for the machine
learning because when you feed them right data than it solve problem
with accuracy. This is 3 step process :
➔ Select Data : In this process we select the subset data from the
available data that you will be working.
➔ Preprocess Data : In this process we try to get selected data into the
form that we can work. This is also 3 step process :
1. Formatting : It can be that data is not in a required format. We
Format the data into relational database or in text file.
2. Cleaning : In this process we remove or fix missing data. It may be
that data is incomplete or it may be contains sensitive data and these
data need to be removed.
Key elements of machine learningKey elements of machine learning
Data Preparation Process Continue 

3. Sampling : We use sampling for exploring and prototyping solution
before perform the whole dataset because if we take whole dataset that
time it took longer time to run algorithm and computational and memory
requirement.
➔ Transform Data : This is the final step for data preparation. We use :
1. Scaling : Data may contain attribute with various quantities like
dollars, kilogram. So data attributes have same scale such as between 0
and 1 for smallest and largest value.
2. Decomposition : In the data there may be complex concept which
may be more meaningful when we split it.
3. Aggregation : There may be features that can be more meaningful
when we aggregate them.
Key elements of machine learningKey elements of machine learning
● Explore Data
We divide data into 3 part :
Training Data,
Testing Data,
Validating Data.
Validating Data : Validation data doesn't always come into play. It's very
useful when you have a model on your network when you have to do all
the tuning and optimization of the parameters and layers and things like
that.
Key elements of machine learningKey elements of machine learning
● Explore Data
● Find Patterns
● Performs Prediction
Key elements of machine learningKey elements of machine learning
● Explore Data
● Find Patterns
● Performs Prediction
Types of machine learningTypes of machine learning
● Supervised Learning :
Supervised learning is to build a model which can make prediction based on the the
previous result. It provide labeled data. So we provide our inputs are provided along
with their corresponding class variable, and our goal is to predict the evaluate.
● Unsupervised Learning :
Unsupervised learning is data points have no labels associated with them. We don't
have any prior knowledge of any information related to the data. We don't have
provided class value or output value for each one of our vectors or instances. we are
using this in applications of which training data comprises examples of the input
without any corresponding target variable and the goal is to find the naturally co-
occurring patterns such as groupings or clustering or segmentation.
Types of machine learningTypes of machine learning
● Reinforcement learning:
A computer program interacts with a dynamic environment in which it must perform a
certain goal, without a teacher explicitly telling it whether it has come close to its
goal.
● Semi-supervised learning :
It uses unlabeled data for training, typically a small amount of labeled data
with a large amount of unlabeled data.
Technique for machine learningTechnique for machine learning
Classification Algorithms - Naive Bayes Method
Naive Bayes's rule is used for finding the probability of events. If we have events E and
total number of instance H, So, we can calculate the probability of the events.
Naive Bayes rule is : Pr[H|E]= (đ‘·đ‘· [đ‘· |đ‘·] đ‘·đ‘·[đ‘·]) / đ‘·đ‘·[đ‘·]
Where,
Evidence E = instance Event.
H = class value for instance.
Pr [H|E] = Probability of event after evidence has been seen.
Problem For Naive Bayes's Method
Problem For Naive Bayes's Method
Find the probability condition with the data set :
● Pr[Outlook = Sunny | yes] = 2/9
● Pr[Temp= Cool | yes] = 3/9
● Pr[Humidity= High | yes] = 3/9
● Pr[Windy = True | yes] = 3/9
● Pr[yes] = 9/14
● Pr[Outlook = Sunny | no] = 3/5
● Pr[Temp= Cool | no] = 1/5
● Pr[Humidity= High | no] = 4/5
● Pr[Windy = True | no] = 3/5
● Pr[no] = 5/14
Find the probability condition with the data set :
● Pr[Outlook = Sunny | yes] = 2/9
● Pr[Temp= Cool | yes] = 3/9
● Pr[Humidity= High | yes] = 3/9
● Pr[Windy = True | yes] = 3/9
● Pr[yes] = 9/14
● Pr[Outlook = Sunny | no] = 3/5
● Pr[Temp= Cool | no] = 1/5
● Pr[Humidity= High | no] = 4/5
● Pr[Windy = True | no] = 3/5
● Pr[no] = 5/14
Problem For Naive Bayes's Method
Problem For Naive Bayes's Method
P(Yes | Sunny) = (2/9 * 3/9 * 3/9 * 3/9 * 9/14) = .0053
P(No | Sunny) = (3/5 * 1/5 *4/5 * 3/5 * 5/14) = .0206
Now we convert probabilities by normalization :
P[YES] = (.0053) / (.0053 + .0206) = .205
P[NO] = (.0206) / (.0053 + .0206) = .795
So we can see that the probability for not playing tennis in the ~80%.
This is the basic for the Machine Learning and Naive Bayes Method for doing prediction.
ReferencesReferences
● Coursera
● Data Prepration
Any Question?Any Question?
Introduction to Machine learning

More Related Content

PPTX
Machine Learning Tutorial | Machine Learning Basics | Machine Learning Algori...
PDF
Introduction to machine learning
PDF
Classification Based Machine Learning Algorithms
PPTX
What is Machine Learning
PDF
Machine Learning for Dummies
PPT
3. mining frequent patterns
PPTX
Deep Learning Explained
PPTX
Artificial Intelligence, Machine Learning and Deep Learning
Machine Learning Tutorial | Machine Learning Basics | Machine Learning Algori...
Introduction to machine learning
Classification Based Machine Learning Algorithms
What is Machine Learning
Machine Learning for Dummies
3. mining frequent patterns
Deep Learning Explained
Artificial Intelligence, Machine Learning and Deep Learning

What's hot (20)

PDF
Dimensionality Reduction
PDF
SHORTEST PATH FINDING VISUALIZER
PDF
Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...
PDF
Machine Learning
PDF
Linear models for classification
PPTX
Unit 1 - ML - Introduction to Machine Learning.pptx
PPTX
Clusters techniques
PDF
Dimensionality Reduction
PPTX
Python Seaborn Data Visualization
PPTX
Supervised and unsupervised learning
PDF
What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
PPT
2.4 rule based classification
PPTX
Approximation algorithms
PDF
Bayesian Networks - A Brief Introduction
PPTX
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
PPT
Machine Learning
PPTX
Machine Learning - Challenges, Learnings & Opportunities
PDF
Cross-validation Tutorial: What, how and which?
PPTX
Machine learning ppt
PDF
Lecture3 - Machine Learning
Dimensionality Reduction
SHORTEST PATH FINDING VISUALIZER
Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...
Machine Learning
Linear models for classification
Unit 1 - ML - Introduction to Machine Learning.pptx
Clusters techniques
Dimensionality Reduction
Python Seaborn Data Visualization
Supervised and unsupervised learning
What is Machine Learning | Introduction to Machine Learning | Machine Learnin...
2.4 rule based classification
Approximation algorithms
Bayesian Networks - A Brief Introduction
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
Machine Learning
Machine Learning - Challenges, Learnings & Opportunities
Cross-validation Tutorial: What, how and which?
Machine learning ppt
Lecture3 - Machine Learning
Ad

Viewers also liked (20)

ODP
Introduction to Scala JS
ODP
Akka streams
ODP
Drilling the Async Library
ODP
Getting Started With AureliaJs
ODP
String interpolation
ODP
Realm Mobile Database - An Introduction
ODP
Mailchimp and Mandrill - The ‘Hominidae’ kingdom
PDF
Kanban
ODP
Shapeless- Generic programming for Scala
ODP
Introduction to Java 8
ODP
Introduction to Scala Macros
ODP
An Introduction to Quill
ODP
Mandrill Templates
ODP
Introduction to ScalaZ
ODP
Introduction to Knockout Js
ODP
Functors, Applicatives and Monads In Scala
ODP
ANTLR4 and its testing
ODP
Effective way to code in Scala
ODP
HTML5, CSS, JavaScript Style guide and coding conventions
ODP
Functional programming in Javascript
Introduction to Scala JS
Akka streams
Drilling the Async Library
Getting Started With AureliaJs
String interpolation
Realm Mobile Database - An Introduction
Mailchimp and Mandrill - The ‘Hominidae’ kingdom
Kanban
Shapeless- Generic programming for Scala
Introduction to Java 8
Introduction to Scala Macros
An Introduction to Quill
Mandrill Templates
Introduction to ScalaZ
Introduction to Knockout Js
Functors, Applicatives and Monads In Scala
ANTLR4 and its testing
Effective way to code in Scala
HTML5, CSS, JavaScript Style guide and coding conventions
Functional programming in Javascript
Ad

Similar to Introduction to Machine learning (20)

PPTX
Statistical Machine Learning Lecture notes
PPTX
Machine Learning
PDF
Machine learning and_buzzwords
PPTX
Machine learning
PDF
Machine Learning
PPTX
Industrial training machine learning
PPTX
Advanced Machine Learning- Introduction to Machine Learning
PDF
Introduction to Machine Learning
PPTX
Lectuhhhhhhhhhhhhhhhhhhhhhhbbbhhhre 1.pptx
PPTX
Intro to machine learning
PDF
Machine Learning_Unit 2_Full.ppt.pdf
PPT
Machine learning-in-details-with-out-python-code
PDF
Machine Learning an Research Overview
PPTX
Machine Learning course in Chandigarh Join
PDF
Machine learning
PDF
Chapter 5 - Machine of it Learning (1).pdf
PDF
Chapter 5 - Machine which of Learning.pdf
PPT
Demystifying AI AND ml and its applications
PDF
MACHINE LEARNING(R17A0534).pdf
Statistical Machine Learning Lecture notes
Machine Learning
Machine learning and_buzzwords
Machine learning
Machine Learning
Industrial training machine learning
Advanced Machine Learning- Introduction to Machine Learning
Introduction to Machine Learning
Lectuhhhhhhhhhhhhhhhhhhhhhhbbbhhhre 1.pptx
Intro to machine learning
Machine Learning_Unit 2_Full.ppt.pdf
Machine learning-in-details-with-out-python-code
Machine Learning an Research Overview
Machine Learning course in Chandigarh Join
Machine learning
Chapter 5 - Machine of it Learning (1).pdf
Chapter 5 - Machine which of Learning.pdf
Demystifying AI AND ml and its applications
MACHINE LEARNING(R17A0534).pdf

More from Knoldus Inc. (20)

PPTX
Angular Hydration Presentation (FrontEnd)
PPTX
Optimizing Test Execution: Heuristic Algorithm for Self-Healing
PPTX
Self-Healing Test Automation Framework - Healenium
PPTX
Kanban Metrics Presentation (Project Management)
PPTX
Java 17 features and implementation.pptx
PPTX
Chaos Mesh Introducing Chaos in Kubernetes
PPTX
GraalVM - A Step Ahead of JVM Presentation
PPTX
Nomad by HashiCorp Presentation (DevOps)
PPTX
Nomad by HashiCorp Presentation (DevOps)
PPTX
DAPR - Distributed Application Runtime Presentation
PPTX
Introduction to Azure Virtual WAN Presentation
PPTX
Introduction to Argo Rollouts Presentation
PPTX
Intro to Azure Container App Presentation
PPTX
Insights Unveiled Test Reporting and Observability Excellence
PPTX
Introduction to Splunk Presentation (DevOps)
PPTX
Code Camp - Data Profiling and Quality Analysis Framework
PPTX
AWS: Messaging Services in AWS Presentation
PPTX
Amazon Cognito: A Primer on Authentication and Authorization
PPTX
ZIO Http A Functional Approach to Scalable and Type-Safe Web Development
PPTX
Managing State & HTTP Requests In Ionic.
Angular Hydration Presentation (FrontEnd)
Optimizing Test Execution: Heuristic Algorithm for Self-Healing
Self-Healing Test Automation Framework - Healenium
Kanban Metrics Presentation (Project Management)
Java 17 features and implementation.pptx
Chaos Mesh Introducing Chaos in Kubernetes
GraalVM - A Step Ahead of JVM Presentation
Nomad by HashiCorp Presentation (DevOps)
Nomad by HashiCorp Presentation (DevOps)
DAPR - Distributed Application Runtime Presentation
Introduction to Azure Virtual WAN Presentation
Introduction to Argo Rollouts Presentation
Intro to Azure Container App Presentation
Insights Unveiled Test Reporting and Observability Excellence
Introduction to Splunk Presentation (DevOps)
Code Camp - Data Profiling and Quality Analysis Framework
AWS: Messaging Services in AWS Presentation
Amazon Cognito: A Primer on Authentication and Authorization
ZIO Http A Functional Approach to Scalable and Type-Safe Web Development
Managing State & HTTP Requests In Ionic.

Recently uploaded (20)

PDF
Understanding Forklifts - TECH EHS Solution
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
medical staffing services at VALiNTRY
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
 
PPTX
L1 - Introduction to python Backend.pptx
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PDF
System and Network Administraation Chapter 3
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PDF
System and Network Administration Chapter 2
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
PPTX
ISO 45001 Occupational Health and Safety Management System
Understanding Forklifts - TECH EHS Solution
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
VVF-Customer-Presentation2025-Ver1.9.pptx
CHAPTER 2 - PM Management and IT Context
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
medical staffing services at VALiNTRY
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
 
L1 - Introduction to python Backend.pptx
How to Choose the Right IT Partner for Your Business in Malaysia
System and Network Administraation Chapter 3
Operating system designcfffgfgggggggvggggggggg
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
Adobe Illustrator 28.6 Crack My Vision of Vector Design
System and Network Administration Chapter 2
Which alternative to Crystal Reports is best for small or large businesses.pdf
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
ISO 45001 Occupational Health and Safety Management System

Introduction to Machine learning

  • 1. Introduction to Machine Learning Introduction to Machine Learning Anurag Srivastava Software Consultant Knoldus Software LLP Anurag Srivastava Software Consultant Knoldus Software LLP
  • 2. Topics CoveredTopics Covered ● What is machine learning ● Different kinds of machine learning ● Key elements of machine learning ● Types of machine learning ● Techniques for machine learning
  • 3. What is Machine Learning ?What is Machine Learning ?
  • 4. What is Machine Learning ?What is Machine Learning ? Machine learning is a type of artificial intelligence (AI) that provides computers with the ability to learn without being explicitly programmed. Machine learning focuses on the development of computer programs that can teach themselves to grow and change when exposed to new data.
  • 5. What is Machine Learning ?What is Machine Learning ? Machine learning is a type of artificial intelligence (AI) that provides computers with the ability to learn without being explicitly programmed. Machine learning focuses on the development of computer programs that can teach themselves to grow and change when exposed to new data. Where we can used learning ? 1.Result vary every time. 2.Solution needs to be adapted to particular cases. 3.Human does not exist.
  • 6. Different kinds of machine learningDifferent kinds of machine learning ● Data Mining : Data Mining is the combination Artificial Intelligence and statistical analysis tools that are bringing together to discover hidden information in our data. There are many hidden information in data and these are : ● Association ● Sequence : Sequence for tie events to together. ● Classification : Classification for recognizing patterns. ● Forecasting : Forecasting is used for predicting on the based on their past pattern. ● Anomalies : Anomalies, outliers, frauds, many different types of things we can do. ● Grouping : Grouping of data ● Predictive Analysis : Predictive models and analysis are typically used to forecast future probabilities. Applied to business, predictive models are used to analyze current data and historical facts in order to better understand. It uses a number of techniques, including data mining, statistical modeling and machine learning to help analysts make future business forecasts.
  • 7. Different kinds of machine learningDifferent kinds of machine learning ● Advance Analytic : It is the autonomous or semi-autonomous process on data using sophisticated techniques and tools. Its beyond of traditional Business Intelligence. It helps to find more deeper information of data, to make prediction and generate recommendations. ● Data Science : Data science is an interdisciplinary field about processes and systems to extract knowledge or insights from data in various forms, either structured or unstructured,which is a continuation of some of the data analysis fields such as statistics, data mining, and predictive analytic, similar to Knowledge Discovery in Databases.
  • 8. Key elements of machine learningKey elements of machine learning ● Explore Data ● Find Patterns ● Performs Prediction
  • 9. Key elements of machine learningKey elements of machine learning ● Explore Data : 1. Labeled Data : Labeled data is a data with some meaningful “tag, label or class”. We know about the data and which type of operation performed on that data. 2. Unlabeled Data : Unlabeled data is a simple raw data. We do not know about the data and there is no explanation for that data.
  • 10. Key elements of machine learningKey elements of machine learning ➔ Explore Data : ➔ Data Preparation Process : This is very important part for the machine learning because when you feed them right data than it solve problem with accuracy. This is 3 step process : ➔ Select Data : In this process we select the subset data from the available data that you will be working. ➔ Preprocess Data : In this process we try to get selected data into the form that we can work. This is also 3 step process : 1. Formatting : It can be that data is not in a required format. We Format the data into relational database or in text file. 2. Cleaning : In this process we remove or fix missing data. It may be that data is incomplete or it may be contains sensitive data and these data need to be removed.
  • 11. Key elements of machine learningKey elements of machine learning Data Preparation Process Continue 
 3. Sampling : We use sampling for exploring and prototyping solution before perform the whole dataset because if we take whole dataset that time it took longer time to run algorithm and computational and memory requirement. ➔ Transform Data : This is the final step for data preparation. We use : 1. Scaling : Data may contain attribute with various quantities like dollars, kilogram. So data attributes have same scale such as between 0 and 1 for smallest and largest value. 2. Decomposition : In the data there may be complex concept which may be more meaningful when we split it. 3. Aggregation : There may be features that can be more meaningful when we aggregate them.
  • 12. Key elements of machine learningKey elements of machine learning ● Explore Data We divide data into 3 part : Training Data, Testing Data, Validating Data. Validating Data : Validation data doesn't always come into play. It's very useful when you have a model on your network when you have to do all the tuning and optimization of the parameters and layers and things like that.
  • 13. Key elements of machine learningKey elements of machine learning ● Explore Data ● Find Patterns ● Performs Prediction
  • 14. Key elements of machine learningKey elements of machine learning ● Explore Data ● Find Patterns ● Performs Prediction
  • 15. Types of machine learningTypes of machine learning ● Supervised Learning : Supervised learning is to build a model which can make prediction based on the the previous result. It provide labeled data. So we provide our inputs are provided along with their corresponding class variable, and our goal is to predict the evaluate. ● Unsupervised Learning : Unsupervised learning is data points have no labels associated with them. We don't have any prior knowledge of any information related to the data. We don't have provided class value or output value for each one of our vectors or instances. we are using this in applications of which training data comprises examples of the input without any corresponding target variable and the goal is to find the naturally co- occurring patterns such as groupings or clustering or segmentation.
  • 16. Types of machine learningTypes of machine learning ● Reinforcement learning: A computer program interacts with a dynamic environment in which it must perform a certain goal, without a teacher explicitly telling it whether it has come close to its goal. ● Semi-supervised learning : It uses unlabeled data for training, typically a small amount of labeled data with a large amount of unlabeled data.
  • 17. Technique for machine learningTechnique for machine learning Classification Algorithms - Naive Bayes Method Naive Bayes's rule is used for finding the probability of events. If we have events E and total number of instance H, So, we can calculate the probability of the events. Naive Bayes rule is : Pr[H|E]= (đ‘·đ‘· [đ‘· |đ‘·] đ‘·đ‘·[đ‘·]) / đ‘·đ‘·[đ‘·] Where, Evidence E = instance Event. H = class value for instance. Pr [H|E] = Probability of event after evidence has been seen.
  • 18. Problem For Naive Bayes's Method
  • 19. Problem For Naive Bayes's Method
  • 20. Find the probability condition with the data set : ● Pr[Outlook = Sunny | yes] = 2/9 ● Pr[Temp= Cool | yes] = 3/9 ● Pr[Humidity= High | yes] = 3/9 ● Pr[Windy = True | yes] = 3/9 ● Pr[yes] = 9/14 ● Pr[Outlook = Sunny | no] = 3/5 ● Pr[Temp= Cool | no] = 1/5 ● Pr[Humidity= High | no] = 4/5 ● Pr[Windy = True | no] = 3/5 ● Pr[no] = 5/14 Find the probability condition with the data set : ● Pr[Outlook = Sunny | yes] = 2/9 ● Pr[Temp= Cool | yes] = 3/9 ● Pr[Humidity= High | yes] = 3/9 ● Pr[Windy = True | yes] = 3/9 ● Pr[yes] = 9/14 ● Pr[Outlook = Sunny | no] = 3/5 ● Pr[Temp= Cool | no] = 1/5 ● Pr[Humidity= High | no] = 4/5 ● Pr[Windy = True | no] = 3/5 ● Pr[no] = 5/14 Problem For Naive Bayes's Method
  • 21. Problem For Naive Bayes's Method P(Yes | Sunny) = (2/9 * 3/9 * 3/9 * 3/9 * 9/14) = .0053 P(No | Sunny) = (3/5 * 1/5 *4/5 * 3/5 * 5/14) = .0206 Now we convert probabilities by normalization : P[YES] = (.0053) / (.0053 + .0206) = .205 P[NO] = (.0206) / (.0053 + .0206) = .795 So we can see that the probability for not playing tennis in the ~80%. This is the basic for the Machine Learning and Naive Bayes Method for doing prediction.

Editor's Notes

  • #5: science of creating algorithms and program which learn on their own. Once designed, they do not need a human to become better. Some of the common applications of machine learning include following: Web Search, spam filters, recommender systems, ad placement, credit scoring, fraud detection, stock trading, computer vision and drug design. An easy way to understand is this - it is humanly impossible to create models for every possible search or spam, so you make the machine intelligent enough to learn by itself. When you automate the later part of data mining - it is known as machine learning. E. Fredkin University Professor.
  • #6: 1.Solution vary every time (routing on a computer network) Humans are unable to explain their expertise (speech recognition) Human does not exist (navigating on Mars) needs to be adapted to particular cases (user biometrics) applications of machine learning include following: Web Search, spam filters, recommender systems, ad placement, credit scoring, fraud detection, stock trading,
  • #7: In data mining we combine AI and Statical Analysis(study of collection, organization, analysis and presentation of data),We find hidden information from the data like, Association, sequence, classification, forcasting anomalies and grouping. In association, data mining function that discovers the probability of the co-occurrence of items in a collection, In sequence, finding statistically relevant patterns between data examples where the values are delivered in a sequence. Predictive: This is a loosely used term. People running reporting also say that they are analysing data and so do predictive modelers. I would just take this as any attempt to make sense of data can be called as data analysis.
  • #8: Advance Analytic is does not use Business Intelligence. In BI we used earlier information like what happened, when happened but with the help of advance analytic we asked question what will happen, what will be the outcome. So basically with the help of Advance Analytic we work for the future changes. Data science is the future. It is combination of mathematics, statistics, programming, the context of the problem being solved, with the ways of capturing data that may not be being captured right now plus the ability to look at things
  • #9: There are two types of data 1. Labeled data(structured data, Images with name, sound with data) 2. Unlabeled data(unstructured data). 1. explore data : we explore whole data and clean it and remove all unnecessary data from the data. 3. Perform prediction : after that we apply the algorithm for prediction. We divide data into 3 part Training Data, Testing Data, validating Data. Re-substitution error when training and testing data are same. Validation data doesn't always come into play. It's very useful when you have a model on your network when you have to do all the tuning and optimization of the parameters and layers and things like that.
  • #10: There are two types of data 1. Labeled data(structured data, Images with name, sound with data) 2. Unlabeled data(unstructured data). 1. explore data : we explore whole data and clean it and remove all unnecessary data from the data. 2. Find pattern : we find the patterns between the data and than so we can apply algorithm on that. 3. Perform prediction : after that we apply the algorithm for prediction. We divide data into 3 part Training Data, Testing Data, validating Data. Re-substitution error when training and testing data are same. Validation data doesn't always come into play. It's very useful when you have a model on your network when you have to do all the tuning and optimization of the parameters and layers and things like that.
  • #11: There are two types of data 1. Labeled data(structured data, Images with name, sound with data) 2. Unlabeled data(unstructured data). 1. explore data : we explore whole data and clean it and remove all unnecessary data from the data. 2. Find pattern : we find the patterns between the data and than so we can apply algorithm on that. 3. Perform prediction : after that we apply the algorithm for prediction. We divide data into 3 part Training Data, Testing Data, validating Data. Re-substitution error when training and testing data are same. Validation data doesn't always come into play. It's very useful when you have a model on your network when you have to do all the tuning and optimization of the parameters and layers and things like that.
  • #12: Decomposition : time and date Aggregation : login count which allow user to how many time user can login.
  • #13: Re-substitution error : When training and testing data is same we find re-substitution error.
  • #14: 2. Find pattern : we find the patterns between the data and than so we can apply algorithm on that.
  • #15: 3. Perform prediction : after that we apply the algorithm for prediction.
  • #16: Supervised learning as we learn in the college. Unsupervised learning, on the other hand, allows us to approach problems with little or no idea what our results should look like. We can derive structure from data where we don't necessarily know the effect of the variables. We can derive this structure by clustering the data based on relationships among the variables in the data. there is no feedback based on the prediction results.
  • #17: Reinforcement Learning is the area of Machine Learning concerned with the actions that software agents ought to take in a particular environment in order to maximize rewards. You can apply Reinforcement Learning to robot control, chess, backgammon, checkers, and other activities that a software agent can learn. Reinforcement Learning uses behaviorist psychology in order to achieve reward maximization. Semi-supervised learning involves function estimation on labeled and unlabeled data. This approach is motivated by the fact that labeled data is often costly to generate, whereas unlabeled data is generally not. The challenge here mostly involves the technical question of how to treat data mixed in this fashion.
  • #18: Bayes's Rule says if you have a hypothesis H, and an evidence E, that bares on that hypothesis, then we can use this notation that the probability of hypothesis versus the evidence. And we can calculate the probability of the posterior probability and the conditional event of hypothesis, and so the probability of H/E is going to turn out to be result.
  • #22: Web search: ranking page based on what you are most likely to click on. Computational biology: rational design drugs in the computer based on past experiments. Finance: decide who to send what credit card offers to. Evaluation of risk on credit offers. How to decide where to invest money. E-commerce: Predicting customer churn. Whether or not a transaction is fraudulent. Space exploration: space probes and radio astronomy. Robotics: how to handle uncertainty in new environments. Autonomous. Self-driving car. Information extraction: Ask questions over databases across the web. Social networks: Data on relationships and preferences. Machine learning to extract value from data. Debugging: Use in computer science problems like debugging. Labor intensive process. Could suggest where the bug could be.