SlideShare a Scribd company logo
MIS End Term Exam for AI and A2
on Thursday ,6th February 2020
Concepts
Please read the handwritten scanned
notes also
End Term Syllabus
• Decision Trees – Graphical Representation – Root
Node , Internal Nodes , Leaf
• Level ,Height and Depth of Node
• Decision Tree – As Classifier
• Classifier Case Study /Numerical using 3 Classifier
Algorithms
- ID3
- CART
-Naïve Bayes
End Term Syllabus Cont
• Supervised , Unsupervised and Reinforced
Learning
• Unsupervised Learning – Clustering – use of K
Means in numerical example
• Bias & Variance , Overfitting and Underfitting
Trees with reference to Training and Test Data
• Ensemble Learning Technique using Bagging
• Random Forest
How does id3 algorithm work?
The ID3 algorithm begins with the original set as the root node. On
each iteration of the algorithm, it iterates through every unused
attribute of the set and calculates the entropy or the information gain
of that attribute. ... The set is then split or partitioned by the selected
attribute to produce subsets of the data.
What is id3 in decision tree?
Very simply, ID3 builds a decision tree from a fixed set of examples. ...
The leaf nodes of the decision tree contain the class name whereas a
non-leaf node is a decision node. The decision node is an attribute test
with each branch (to another decision tree) being a possible value of
the attribute.
How does CART algorithm work?
A Classification and Regression Tree(CART) is a predictive algorithm used
in machine learning. It explains how a target variable's values can be
predicted based on other values. It is a decision tree where each fork is a
split in a predictor variable and each node at the end has a prediction for
the target variable
What is CART technique?
Decision Trees
Classification and Regression Trees or CART for short is a term introduced
by Leo Breiman to refer to Decision Tree algorithms that can be used for
classification or regression predictive modeling problems.
What is a cart analysis?
Classification and regression tree (CART) analysis recursively
partitions observations in a matched data set, consisting of a
categorical (for classification trees) or continuous (for regression
trees) dependent (response) variable and one or more independent
(explanatory) variables, into progressively smaller groups
What is Naive Bayes classifier
algorithm?
Naive Bayes classifiers are a collection of
classification algorithms based on Bayes' Theorem. It is not a
single algorithm but a family of algorithms where all of them share a
common principle, i.e. every pair of features being classified is
independent of each other.
What is K means clustering used for?
Business Uses
The K-means clustering algorithm is used to find groups which have
not been explicitly labelled in the data. ... Once the algorithm has
been run and the groups are defined, any new data can be easily
assigned to the correct group. This is a versatile algorithm that can
be used for any type of grouping.
Mis End Term Exam Theory Concepts
What is K means clustering in data
mining?
Data mining with the K-Means algorithm
The k-means clustering algorithm is a data mining and machine
learning tool used to cluster observations into groups of related
observations without any prior knowledge of those relationships. ...
The term "k-means" was coined in 1967 by James McQueen
How is K means calculated?
K-Means Clustering
Select k points at random as cluster centers. Assign objects to
their closest cluster center according to the Euclidean distance
function. Calculate the centroid or mean of all objects in each
cluster. Repeat steps 2, 3 and 4 until the same points are assigned
to each cluster in consecutive rounds.
What is a supervised learning model?
Supervised learning is a learning model built to make prediction,
given an unforeseen input instance. A supervised
learning algorithm takes a known set of input dataset and its
known responses to the data (output) to learn the
regression/classification model.
How does supervised machine
learning work?
Supervised machine learning builds a model that makes predictions
based on evidence in the presence of uncertainty. A supervised
learning algorithm takes a known set of input data and known
responses to the data (output) and trains a model to generate
reasonable predictions for the response to new data.
What is meant by unsupervised
learning?
Unsupervised learning is a type of machine learning algorithm used
to draw inferences from datasets consisting of input data without
labeled responses. The most common unsupervised
learning method is cluster analysis, which is used for exploratory
data analysis to find hidden patterns or grouping in data.
How does unsupervised learning
work?
In unsupervised learning, an AI system is presented with unlabelled,
uncategorized data and the system's algorithms act on the data without
prior training. ... In essence, unsupervised learning can be thought of
as learning without a teacher. In case of supervised learning, the system
has both the inputs and the outputs.
Mis End Term Exam Theory Concepts
Mis End Term Exam Theory Concepts
What is meant by reinforcement
learning?
Reinforcement Learning is a type of Machine Learning, and thereby
also a branch of Artificial Intelligence. It allows machines and software
agents to automatically determine the ideal behaviour within a specific
context, in order to maximize its performance.
Mis End Term Exam Theory Concepts
What is reinforcement learning used
for?
Reinforcement learning is a type of Machine Learning algorithms
which allows software agents and machines to automatically determine
the ideal behaviour within a specific context, to maximize its
performance. An extensive blog post, which you can read here - How
businesses can leverage reinforcement learning?
What is bias and variance?
Bias is the simplifying assumptions made by the model to make the
target function easier to approximate. Variance is the amount that
the estimate of the target function will change given different
training data. Trade-off is tension between the error introduced by
the bias and the variance.
Mis End Term Exam Theory Concepts
What is bias and variance in
Supervised learning?
The bias–variance dilemma or bias–variance problem is the conflict in
trying to simultaneously minimize these two sources of error that
prevent supervised learning algorithms from generalizing beyond their
training set: The bias error is an error from erroneous assumptions in
the learning algorithm.
What is bias and variance in Decision
Trees?
Q: Explain the bias vs. variance trade-off in statistical learning. A:
The bias-variance trade-off is an important aspect of data
science projects based on machine learning. ... The error due to
squared bias is the amount by which the expected model prediction
differs from the true value or target, over the training data.
What is Underfitting and Overfitting in
Decision Trees ?
Overfitting: Good performance on the training data, poor
generalization to other data. Underfitting: Poor performance on the
training data and poor generalization to other data.
Mis End Term Exam Theory Concepts
What is meant by Overfitting of data?
Overfitting is a modelling error which occurs when a function is too
closely fit to a limited set of data points. Overfitting the model
generally takes the form of making an overly complex model to
explain idiosyncrasies in the data under study.
How do you deal with Underfitting?
According to Andrew Ng, the best methods of dealing with
an underfitting model is trying a bigger neural network (adding new layers
or increasing the number of neurons in existing layers) or training the
model a little bit longer.
What are the differences between
Overfitting and Underfitting?
Overfitting is a modelling error which occurs when a function is too
closely fit to a limited set of data points. Underfitting refers to a model
that can neither model the training data nor generalize to new data. ...
Intuitively, underfitting occurs when the model or the algorithm does
not fit the data well enough.
What is Ensemble Learning?
Ensemble Learning is a process using which multiple machine
learning models (such as classifiers) are strategically constructed to
solve a particular problem. Let's take a real example to build the
intuition. Suppose, you want to invest in a company XYZ. You are not
sure about its performance though.
Mis End Term Exam Theory Concepts
How do ensemble models work?
What is an ensemble method? Ensemble models in machine
learning combine the decisions from multiple models to improve the
overall performance. They operate on the similar idea as employed
while buying headphones. The main causes of error in
learning models are due to noise, bias and variance.
Random Forest Model
Random forests, otherwise known as the random forest model, is a
method for classification and other tasks. It operates from decision trees
and outputs classification of the individual trees. Random forests correct
for the habit of decision trees to overfit to their training set.
Mis End Term Exam Theory Concepts
What is test data and train data?
The training data set is a grouped set of examples that are used to fit
the parameters. The test data set is the last evaluation of the final
model fit on the training data set. So like the last test before it's the real
deal. Testing against each other ensures the machine learning model
will be more accurate.
What is meant by test data?
Test data is the data that is used in tests of a software system. ...
When test data is entered the expected result should come and
some test data is used to verify the software behaviour to invalid
input data. Test data is generated by testers or by automation tools
which support testing.
Mis End Term Exam Theory Concepts
Mis End Term Exam Theory Concepts

More Related Content

PPTX
Machine Learning and Real-World Applications
PDF
Machine Learning - Deep Learning
PDF
Applications in Machine Learning
PDF
Introduction to machine learning
PPTX
Machine Learning
PPT
MachineLearning.ppt
DOC
DagdelenSiriwardaneY..
PPTX
Machine learning
Machine Learning and Real-World Applications
Machine Learning - Deep Learning
Applications in Machine Learning
Introduction to machine learning
Machine Learning
MachineLearning.ppt
DagdelenSiriwardaneY..
Machine learning

What's hot (17)

PPT
Machine Learning: Foundations Course Number 0368403401
PPTX
Machine Learning Unit 3 Semester 3 MSc IT Part 2 Mumbai University
PPTX
Machine Learning Unit 1 Semester 3 MSc IT Part 2 Mumbai University
PPTX
Learning in AI
PPT
Machine Learning presentation.
DOC
Chaptr 7 (final)
PPT
Learning
PPT
Basics of Machine Learning
PPTX
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)
PPT
Data Mining
PPTX
Introduction to Machine Learning
PPTX
Internship project presentation_final_upload
PPT
Chapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
PPT
LearningAG.ppt
PPT
Classification and prediction
PPTX
Machine learning
Machine Learning: Foundations Course Number 0368403401
Machine Learning Unit 3 Semester 3 MSc IT Part 2 Mumbai University
Machine Learning Unit 1 Semester 3 MSc IT Part 2 Mumbai University
Learning in AI
Machine Learning presentation.
Chaptr 7 (final)
Learning
Basics of Machine Learning
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)
Data Mining
Introduction to Machine Learning
Internship project presentation_final_upload
Chapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
LearningAG.ppt
Classification and prediction
Machine learning
Ad

Similar to Mis End Term Exam Theory Concepts (20)

PPTX
Machine Learning Approach.pptx
PPT
PPT
ai4.ppt
PPT
ai4.ppt
PPT
Introduction to Machine Learning Aristotelis Tsirigos
PPT
classification in data warehouse and mining
PPTX
Machine learning(UNIT 4)
PPTX
Primer on major data mining algorithms
PDF
Machine Learning Interview Questions and Answers
PPTX
20211229120253D6323_PERT 06_ Ensemble Learning.pptx
PDF
CPSC 340: Machine Learning and Data Mining K-Means Clustering Andreas Lehrman...
PPTX
Forms of learning in ai
PPTX
Introduction to Machine Learning Concepts
PPTX
DECISION TREE AND PROBABILISTIC MODELS.pptx
PPT
Supervised and unsupervised learning
PPTX
Machine learning and decision trees
PDF
Machine Learning Unit-5 Decesion Trees & Random Forest.pdf
PDF
IRJET- Performance Evaluation of Various Classification Algorithms
PDF
IRJET- Performance Evaluation of Various Classification Algorithms
PDF
BIM Data Mining Unit3 by Tekendra Nath Yogi
Machine Learning Approach.pptx
ai4.ppt
ai4.ppt
Introduction to Machine Learning Aristotelis Tsirigos
classification in data warehouse and mining
Machine learning(UNIT 4)
Primer on major data mining algorithms
Machine Learning Interview Questions and Answers
20211229120253D6323_PERT 06_ Ensemble Learning.pptx
CPSC 340: Machine Learning and Data Mining K-Means Clustering Andreas Lehrman...
Forms of learning in ai
Introduction to Machine Learning Concepts
DECISION TREE AND PROBABILISTIC MODELS.pptx
Supervised and unsupervised learning
Machine learning and decision trees
Machine Learning Unit-5 Decesion Trees & Random Forest.pdf
IRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Performance Evaluation of Various Classification Algorithms
BIM Data Mining Unit3 by Tekendra Nath Yogi
Ad

Recently uploaded (20)

PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PPTX
Cell Structure & Organelles in detailed.
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
Complications of Minimal Access Surgery at WLH
PPTX
Pharma ospi slides which help in ospi learning
PDF
01-Introduction-to-Information-Management.pdf
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
Basic Mud Logging Guide for educational purpose
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Module 4: Burden of Disease Tutorial Slides S2 2025
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
Cell Structure & Organelles in detailed.
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
human mycosis Human fungal infections are called human mycosis..pptx
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Microbial diseases, their pathogenesis and prophylaxis
Pharmacology of Heart Failure /Pharmacotherapy of CHF
TR - Agricultural Crops Production NC III.pdf
Complications of Minimal Access Surgery at WLH
Pharma ospi slides which help in ospi learning
01-Introduction-to-Information-Management.pdf
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
VCE English Exam - Section C Student Revision Booklet
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Anesthesia in Laparoscopic Surgery in India
Basic Mud Logging Guide for educational purpose

Mis End Term Exam Theory Concepts

  • 1. MIS End Term Exam for AI and A2 on Thursday ,6th February 2020 Concepts Please read the handwritten scanned notes also
  • 2. End Term Syllabus • Decision Trees – Graphical Representation – Root Node , Internal Nodes , Leaf • Level ,Height and Depth of Node • Decision Tree – As Classifier • Classifier Case Study /Numerical using 3 Classifier Algorithms - ID3 - CART -Naïve Bayes
  • 3. End Term Syllabus Cont • Supervised , Unsupervised and Reinforced Learning • Unsupervised Learning – Clustering – use of K Means in numerical example • Bias & Variance , Overfitting and Underfitting Trees with reference to Training and Test Data • Ensemble Learning Technique using Bagging • Random Forest
  • 4. How does id3 algorithm work? The ID3 algorithm begins with the original set as the root node. On each iteration of the algorithm, it iterates through every unused attribute of the set and calculates the entropy or the information gain of that attribute. ... The set is then split or partitioned by the selected attribute to produce subsets of the data.
  • 5. What is id3 in decision tree? Very simply, ID3 builds a decision tree from a fixed set of examples. ... The leaf nodes of the decision tree contain the class name whereas a non-leaf node is a decision node. The decision node is an attribute test with each branch (to another decision tree) being a possible value of the attribute.
  • 6. How does CART algorithm work? A Classification and Regression Tree(CART) is a predictive algorithm used in machine learning. It explains how a target variable's values can be predicted based on other values. It is a decision tree where each fork is a split in a predictor variable and each node at the end has a prediction for the target variable
  • 7. What is CART technique? Decision Trees Classification and Regression Trees or CART for short is a term introduced by Leo Breiman to refer to Decision Tree algorithms that can be used for classification or regression predictive modeling problems.
  • 8. What is a cart analysis? Classification and regression tree (CART) analysis recursively partitions observations in a matched data set, consisting of a categorical (for classification trees) or continuous (for regression trees) dependent (response) variable and one or more independent (explanatory) variables, into progressively smaller groups
  • 9. What is Naive Bayes classifier algorithm? Naive Bayes classifiers are a collection of classification algorithms based on Bayes' Theorem. It is not a single algorithm but a family of algorithms where all of them share a common principle, i.e. every pair of features being classified is independent of each other.
  • 10. What is K means clustering used for? Business Uses The K-means clustering algorithm is used to find groups which have not been explicitly labelled in the data. ... Once the algorithm has been run and the groups are defined, any new data can be easily assigned to the correct group. This is a versatile algorithm that can be used for any type of grouping.
  • 12. What is K means clustering in data mining? Data mining with the K-Means algorithm The k-means clustering algorithm is a data mining and machine learning tool used to cluster observations into groups of related observations without any prior knowledge of those relationships. ... The term "k-means" was coined in 1967 by James McQueen
  • 13. How is K means calculated? K-Means Clustering Select k points at random as cluster centers. Assign objects to their closest cluster center according to the Euclidean distance function. Calculate the centroid or mean of all objects in each cluster. Repeat steps 2, 3 and 4 until the same points are assigned to each cluster in consecutive rounds.
  • 14. What is a supervised learning model? Supervised learning is a learning model built to make prediction, given an unforeseen input instance. A supervised learning algorithm takes a known set of input dataset and its known responses to the data (output) to learn the regression/classification model.
  • 15. How does supervised machine learning work? Supervised machine learning builds a model that makes predictions based on evidence in the presence of uncertainty. A supervised learning algorithm takes a known set of input data and known responses to the data (output) and trains a model to generate reasonable predictions for the response to new data.
  • 16. What is meant by unsupervised learning? Unsupervised learning is a type of machine learning algorithm used to draw inferences from datasets consisting of input data without labeled responses. The most common unsupervised learning method is cluster analysis, which is used for exploratory data analysis to find hidden patterns or grouping in data.
  • 17. How does unsupervised learning work? In unsupervised learning, an AI system is presented with unlabelled, uncategorized data and the system's algorithms act on the data without prior training. ... In essence, unsupervised learning can be thought of as learning without a teacher. In case of supervised learning, the system has both the inputs and the outputs.
  • 20. What is meant by reinforcement learning? Reinforcement Learning is a type of Machine Learning, and thereby also a branch of Artificial Intelligence. It allows machines and software agents to automatically determine the ideal behaviour within a specific context, in order to maximize its performance.
  • 22. What is reinforcement learning used for? Reinforcement learning is a type of Machine Learning algorithms which allows software agents and machines to automatically determine the ideal behaviour within a specific context, to maximize its performance. An extensive blog post, which you can read here - How businesses can leverage reinforcement learning?
  • 23. What is bias and variance? Bias is the simplifying assumptions made by the model to make the target function easier to approximate. Variance is the amount that the estimate of the target function will change given different training data. Trade-off is tension between the error introduced by the bias and the variance.
  • 25. What is bias and variance in Supervised learning? The bias–variance dilemma or bias–variance problem is the conflict in trying to simultaneously minimize these two sources of error that prevent supervised learning algorithms from generalizing beyond their training set: The bias error is an error from erroneous assumptions in the learning algorithm.
  • 26. What is bias and variance in Decision Trees? Q: Explain the bias vs. variance trade-off in statistical learning. A: The bias-variance trade-off is an important aspect of data science projects based on machine learning. ... The error due to squared bias is the amount by which the expected model prediction differs from the true value or target, over the training data.
  • 27. What is Underfitting and Overfitting in Decision Trees ? Overfitting: Good performance on the training data, poor generalization to other data. Underfitting: Poor performance on the training data and poor generalization to other data.
  • 29. What is meant by Overfitting of data? Overfitting is a modelling error which occurs when a function is too closely fit to a limited set of data points. Overfitting the model generally takes the form of making an overly complex model to explain idiosyncrasies in the data under study.
  • 30. How do you deal with Underfitting? According to Andrew Ng, the best methods of dealing with an underfitting model is trying a bigger neural network (adding new layers or increasing the number of neurons in existing layers) or training the model a little bit longer.
  • 31. What are the differences between Overfitting and Underfitting? Overfitting is a modelling error which occurs when a function is too closely fit to a limited set of data points. Underfitting refers to a model that can neither model the training data nor generalize to new data. ... Intuitively, underfitting occurs when the model or the algorithm does not fit the data well enough.
  • 32. What is Ensemble Learning? Ensemble Learning is a process using which multiple machine learning models (such as classifiers) are strategically constructed to solve a particular problem. Let's take a real example to build the intuition. Suppose, you want to invest in a company XYZ. You are not sure about its performance though.
  • 34. How do ensemble models work? What is an ensemble method? Ensemble models in machine learning combine the decisions from multiple models to improve the overall performance. They operate on the similar idea as employed while buying headphones. The main causes of error in learning models are due to noise, bias and variance.
  • 35. Random Forest Model Random forests, otherwise known as the random forest model, is a method for classification and other tasks. It operates from decision trees and outputs classification of the individual trees. Random forests correct for the habit of decision trees to overfit to their training set.
  • 37. What is test data and train data? The training data set is a grouped set of examples that are used to fit the parameters. The test data set is the last evaluation of the final model fit on the training data set. So like the last test before it's the real deal. Testing against each other ensures the machine learning model will be more accurate.
  • 38. What is meant by test data? Test data is the data that is used in tests of a software system. ... When test data is entered the expected result should come and some test data is used to verify the software behaviour to invalid input data. Test data is generated by testers or by automation tools which support testing.