Sanjivani Rural Education Society’s
Sanjivani College of Engineering, Kopargaon-423 603
(An Autonomous Institute, Affiliated to Savitribai Phule Pune University, Pune)
NACC ‘A’ Grade Accredited, ISO 9001:2015 Certified
Department of Computer Engineering
(NBA Accredited)
Prof. S. A. Shivarkar
Assistant Professor
Contact No.8275032712
Email- shivarkarsandipcomp@sanjivani.org.in
Subject- Unsupervised Modeling for AIML (CO9301)
Subject- Unsupervised Modeling for AIML (CO9301)
Content
Content
 Lectures: 4 Hrs/Week
 Credits: 4
 Examination Scheme:
 CIA: 40
 End Semester: 60
Course Outcome
Course Outcome
 Understand project management methodology and
Exploratory data analysis.
 Apply feature engineering techniques.
 Apply clustering techniques.
 Apply dimensionality reduction techniques
 Apply association rules and recommendation system
Tecniques.
 Apply text mining and NLP Techniques.
Course Objective
Course Objective
 To learn CRISP-ML(Q) method of machine learning models
 To understand Clustering, dimensionality reduction
 To learn Association rules and recommendation system
 To understand various NLP strategies
 To learn how to evaluate the models and performance metrics
Unit I: Requirement to Machine Learning
Unit I: Requirement to Machine Learning
 Project management methodology(CRISP-ML (Q)),Prescriptive
Analytics, Predictive Analytics, Diagnostic Analytics, Descriptive
Analytics, introduction of data types, measurement levels,
measure of central tendency, expected value ,Explorative data
analysis, number summary, boxplot, bargraph, Histogram,
correlation graph, scatter plots ,exploring two or more
variables,Data sampling and its types,various types bias.
Unit II: Feature Engineering Techniques
Unit II: Feature Engineering Techniques
 Dummy variables conversion techniques Standardization and
normalization, outlier identification and outlier treatment
techniques, skewness identification and its treatment. Finding
null values and its treatment.
Unit III: Unsupervised Learning-Clustering
Unit III: Unsupervised Learning-Clustering
 Supervised Vs Unsupervised learning, clustering/segmentation
algorithms-Hierarchical, Distance metrics for categorical data,
Distance metrics for continuous ,distance metrics for mixed
data, distance for clusters, k-means clustering, k selection-
elbow curve, drawbacks and comparison
Unit IV: Unsupervised Learning -Dimensionality Reduction
Unit IV: Unsupervised Learning -Dimensionality Reduction
 Need for dimensionality reduction, Principal component
analysis(PCA),applications for PCA, Singular Value
Decomposition(SVD),application of SVD
Unit V: Unsupervised Learning -
Unit V: Unsupervised Learning -Association rules and
Association rules and
recommendation system
recommendation system
 Market basket analysis,Association rules intuition,Association
rules applications ,Association rules terminology, need for
recommendation systems,similaritymeasures,user based
recommendation system,item to item collaborative filtering.
Unit VI:
Unit VI: Text Mining-Sentiment Analysis and NLP
Text Mining-Sentiment Analysis and NLP
 Need of text mining, Bag of words, terminology and
preprocessing,DTM and TDM,corpus level word cloud.
Introduction of NLP,data preprocessing in NLP context ,NLP
terminology ,feature extraction from text,topic modeling,
vector representation
Unit I: Requirement to Machine Learning
Unit I: Requirement to Machine Learning
 Project management methodology(CRISP-ML (Q)),Prescriptive
Analytics, Predictive Analytics, Diagnostic Analytics, Descriptive
Analytics, introduction of data types, measurement levels,
measure of central tendency, expected value ,Explorative data
analysis, number summary, boxplot, bargraph, Histogram,
correlation graph, scatter plots ,exploring two or more
variables,Data sampling and its types,various types bias.
Project management methodology(CRISP-ML (Q))
Project management methodology(CRISP-ML (Q))
 Overall, the CRISP-ML(Q) process model describes six phases:
1. Business and Data Understanding
2. Data Engineering (Data Preparation)
3. Machine Learning Model Engineering
4. Quality Assurance for Machine Learning Applications
5. Deployment
6. Monitoring and Maintenance.
Project management methodology(CRISP-ML (Q))
Project management methodology(CRISP-ML (Q))
Project management methodology(CRISP-ML (Q))
Project management methodology(CRISP-ML (Q))
Project management methodology(CRISP-ML (Q))
Project management methodology(CRISP-ML (Q))
 Business and Data Understanding:
 Developing machine learning applications starts with
identifying the scope of the ML application, the success
criteria, and a data quality verification.
 The goal of this first phase is to ensure the feasibility of the
project.
 Defining clear and measurable Key Performance Indicators
(KPI) such as “time savings per user and session” is required.
Project management methodology(CRISP-ML (Q))
Project management methodology(CRISP-ML (Q))
 Machine Learning Model Engineering
 The modeling phase includes model selection, model
specialization, and model training tasks.
 Additionally, depending on the application, we might use a pre-trained
model, compress the model, or apply ensemble learning methods to get the
final ML model.
 Many phases in ML development are iterative.
 Sometimes, we might need to review the business goals, KPIs, and available
data from the previous steps to adjust the outcomes of the ML model results.
 Finally, we package the ML workflow in a pipeline to create repeatable model
training during the modeling phase.
Project management methodology(CRISP-ML (Q))
Project management methodology(CRISP-ML (Q))
 Evaluating Machine Learning Models
 Model training is followed by a model evaluation phase, also known as offline testing.
 During this phase, the performance of the trained model needs to be validated on a
test set.
 Additionally, the model robustness should be assessed using noisy or wrong input data.
 Finally, the model deployment decision should be met automatically based on success
criteria or manually by domain and ML experts. Similar to the modeling phase, all
outcomes of the evaluation phase need to be documented.
 Deployment: a process of the ML model integration into the existing software system.
 Monitoring and Maintenance
 https://ml-ops.org/content/crisp-ml
Project management methodology(CRISP-ML (Q))
Project management methodology(CRISP-ML (Q))
 Deployment:
 The ML model deployment denotes a process of the ML model integration
into the existing software system.
 After succeeding in the evaluation step in the ML development life cycle, the
ML model is graduated to be deployed in the (pre-) production environment.
 The ML model deployment includes the following tasks: inference hardware
definition, model evaluation in a production environment (online testing, e.g.,
A/B tests), providing user acceptance and usability testing, providing a fall-
back plan for model outages, and setting up the deployment strategy to roll
out the new model gradually (e.g. canary or green/blue deployment).
Project management methodology(CRISP-ML (Q))
Project management methodology(CRISP-ML (Q))
 Monitoring and Maintenance
 Once the ML model has been put into production, it is essential to monitor its performance
and maintain it.
 When an ML model performs on real-world data, the main risk is the “model staleness”
effect when the performance of the ML model drops as it starts operating on unseen data.
 Furthermore, model performance is affected by hardware performance and the existing
software stack.
 Therefore, the best practice to prevent the model performance drop is to perform
the monitoring task when the model performance is continuously evaluated to decide
whether the model needs to be re-trained.
 This is known as the Continued Model Evaluation pattern.
 The decision from the monitoring task leads to the second task - updating the ML model.
DEPARTMENT OF COMPUTER ENGINEERING, Sanjivani COE, Kopargaon 20
Reference
Reference
 https://ml-ops.org/content/crisp-ml

More Related Content

PDF
Module 5 - Data Science Methodology.pdf
PDF
Machine-Learning for Data analytics and detection
PPTX
artificial intelligence.pptx
PDF
Introduction to Machine Learning: Process and Roles
PPTX
Data Mining Implementation process.pptx
PDF
Data Mining methodology
PPT
Lecture -2 Classification (Machine Learning Basic and kNN).ppt
PDF
ML Zoomcamp 1.4 - CRISP-DM
Module 5 - Data Science Methodology.pdf
Machine-Learning for Data analytics and detection
artificial intelligence.pptx
Introduction to Machine Learning: Process and Roles
Data Mining Implementation process.pptx
Data Mining methodology
Lecture -2 Classification (Machine Learning Basic and kNN).ppt
ML Zoomcamp 1.4 - CRISP-DM

Similar to Project management methodology(CRISP-ML (Q)) (20)

PDF
Chapter01 introductory handbook
PDF
Learn How to Make Machine Learning Work
PPTX
(Faiz) MachineLearning(ppt).pptx
PPT
MACHINE LEARNING LIFE CYCLE
PDF
newmicrosoftpowerpointpresentation-210512111200.pdf
PPTX
Introduction to ML (Machine Learning)
PDF
Data mining
PPTX
Agile Mumbai 2022 - Ashwinee Singh | Agile in AI or AI in Agile?
PPTX
WELCOME TO AI PROJECT shidhant mittaal.pptx
PDF
Introduction to Machine Learning Techniques
PPTX
Ml leaning this ppt display number of mltypes.pptx
PPTX
The 4 Machine Learning Models Imperative for Business Transformation
PPT
Lecture 10 - DataMiningEngineering.ppt
PPT
Crisp dm
PPT
Easily apply Quality Assurance and Testing in the ML Project
PDF
AI&BigData Lab 2016. Сергей Шельпук: Методология Data Science проектов
PDF
CRISP-DM: a data science project methodology
PPTX
KDD 2019 IADSS Workshop - Skills to Master Machine Learning and Data Science ...
PPTX
Machine learning Basics Introduction ppt
Chapter01 introductory handbook
Learn How to Make Machine Learning Work
(Faiz) MachineLearning(ppt).pptx
MACHINE LEARNING LIFE CYCLE
newmicrosoftpowerpointpresentation-210512111200.pdf
Introduction to ML (Machine Learning)
Data mining
Agile Mumbai 2022 - Ashwinee Singh | Agile in AI or AI in Agile?
WELCOME TO AI PROJECT shidhant mittaal.pptx
Introduction to Machine Learning Techniques
Ml leaning this ppt display number of mltypes.pptx
The 4 Machine Learning Models Imperative for Business Transformation
Lecture 10 - DataMiningEngineering.ppt
Crisp dm
Easily apply Quality Assurance and Testing in the ML Project
AI&BigData Lab 2016. Сергей Шельпук: Методология Data Science проектов
CRISP-DM: a data science project methodology
KDD 2019 IADSS Workshop - Skills to Master Machine Learning and Data Science ...
Machine learning Basics Introduction ppt
Ad

More from ShivarkarSandip (20)

PDF
Measure of Central Tendency, Different data types
PDF
Types of AnalysisPrescriptive Analytics, Predictive Analytics, Diagnostic An...
PDF
MEASURES OF DATA: SCALE, TENDENCY, VARIATION SHAPE
PDF
STATISTICS AND PROBABILITY FOR DATA SCIENCE,
PDF
Introduction to Data Science: data science process
PDF
Prerquisite for Data Sciecne, KDD, Attribute Type
PDF
NBaysian classifier, Naive Bayes classifier
PDF
Supervised Learning Ensemble Techniques Machine Learning
PDF
Microcontroller 8051- Architecture Memory Organization
PDF
Data Preprocessing -Data Quality Noisy Data
PDF
Supervised Learning Decision Trees Review of Entropy
PDF
Supervised Learning Decision Trees Machine Learning
PDF
Cluster Analysis: Measuring Similarity & Dissimilarity
PDF
Classification, Attribute Selection, Classifiers- Decision Tree, ID3,C4.5,Nav...
PDF
Frequent Pattern Analysis, Apriori and FP Growth Algorithm
PDF
Data Warehouse and Architecture, OLAP Operation
PDF
Data Preparation and Preprocessing , Data Cleaning
PDF
Introduction to Data Mining, KDD Process, OLTP and OLAP
PDF
Introduction to Data Mining KDD Process OLAP
PDF
Issues in data mining Patterns Online Analytical Processing
Measure of Central Tendency, Different data types
Types of AnalysisPrescriptive Analytics, Predictive Analytics, Diagnostic An...
MEASURES OF DATA: SCALE, TENDENCY, VARIATION SHAPE
STATISTICS AND PROBABILITY FOR DATA SCIENCE,
Introduction to Data Science: data science process
Prerquisite for Data Sciecne, KDD, Attribute Type
NBaysian classifier, Naive Bayes classifier
Supervised Learning Ensemble Techniques Machine Learning
Microcontroller 8051- Architecture Memory Organization
Data Preprocessing -Data Quality Noisy Data
Supervised Learning Decision Trees Review of Entropy
Supervised Learning Decision Trees Machine Learning
Cluster Analysis: Measuring Similarity & Dissimilarity
Classification, Attribute Selection, Classifiers- Decision Tree, ID3,C4.5,Nav...
Frequent Pattern Analysis, Apriori and FP Growth Algorithm
Data Warehouse and Architecture, OLAP Operation
Data Preparation and Preprocessing , Data Cleaning
Introduction to Data Mining, KDD Process, OLTP and OLAP
Introduction to Data Mining KDD Process OLAP
Issues in data mining Patterns Online Analytical Processing
Ad

Recently uploaded (20)

PDF
Influence of Green Infrastructure on Residents’ Endorsement of the New Ecolog...
PPTX
Building constraction Conveyance of water.pptx
PDF
Java Basics-Introduction and program control
PPTX
wireless networks, mobile computing.pptx
PPT
Chapter 1 - Introduction to Manufacturing Technology_2.ppt
PDF
20250617 - IR - Global Guide for HR - 51 pages.pdf
PPTX
Feature types and data preprocessing steps
PPTX
tack Data Structure with Array and Linked List Implementation, Push and Pop O...
PDF
Soil Improvement Techniques Note - Rabbi
PPTX
mechattonicsand iotwith sensor and actuator
PDF
Design Guidelines and solutions for Plastics parts
PDF
Computer System Architecture 3rd Edition-M Morris Mano.pdf
PDF
First part_B-Image Processing - 1 of 2).pdf
PDF
Prof. Dr. KAYIHURA A. SILAS MUNYANEZA, PhD..pdf
PPTX
Chapter 2 -Technology and Enginerring Materials + Composites.pptx
PPTX
Chemical Technological Processes, Feasibility Study and Chemical Process Indu...
PPTX
A Brief Introduction to IoT- Smart Objects: The "Things" in IoT
PPTX
Software Engineering and software moduleing
PPTX
"Array and Linked List in Data Structures with Types, Operations, Implementat...
PPTX
Petroleum Refining & Petrochemicals.pptx
Influence of Green Infrastructure on Residents’ Endorsement of the New Ecolog...
Building constraction Conveyance of water.pptx
Java Basics-Introduction and program control
wireless networks, mobile computing.pptx
Chapter 1 - Introduction to Manufacturing Technology_2.ppt
20250617 - IR - Global Guide for HR - 51 pages.pdf
Feature types and data preprocessing steps
tack Data Structure with Array and Linked List Implementation, Push and Pop O...
Soil Improvement Techniques Note - Rabbi
mechattonicsand iotwith sensor and actuator
Design Guidelines and solutions for Plastics parts
Computer System Architecture 3rd Edition-M Morris Mano.pdf
First part_B-Image Processing - 1 of 2).pdf
Prof. Dr. KAYIHURA A. SILAS MUNYANEZA, PhD..pdf
Chapter 2 -Technology and Enginerring Materials + Composites.pptx
Chemical Technological Processes, Feasibility Study and Chemical Process Indu...
A Brief Introduction to IoT- Smart Objects: The "Things" in IoT
Software Engineering and software moduleing
"Array and Linked List in Data Structures with Types, Operations, Implementat...
Petroleum Refining & Petrochemicals.pptx

Project management methodology(CRISP-ML (Q))

  • 1. Sanjivani Rural Education Society’s Sanjivani College of Engineering, Kopargaon-423 603 (An Autonomous Institute, Affiliated to Savitribai Phule Pune University, Pune) NACC ‘A’ Grade Accredited, ISO 9001:2015 Certified Department of Computer Engineering (NBA Accredited) Prof. S. A. Shivarkar Assistant Professor Contact No.8275032712 Email- shivarkarsandipcomp@sanjivani.org.in Subject- Unsupervised Modeling for AIML (CO9301) Subject- Unsupervised Modeling for AIML (CO9301)
  • 2. Content Content  Lectures: 4 Hrs/Week  Credits: 4  Examination Scheme:  CIA: 40  End Semester: 60
  • 3. Course Outcome Course Outcome  Understand project management methodology and Exploratory data analysis.  Apply feature engineering techniques.  Apply clustering techniques.  Apply dimensionality reduction techniques  Apply association rules and recommendation system Tecniques.  Apply text mining and NLP Techniques.
  • 4. Course Objective Course Objective  To learn CRISP-ML(Q) method of machine learning models  To understand Clustering, dimensionality reduction  To learn Association rules and recommendation system  To understand various NLP strategies  To learn how to evaluate the models and performance metrics
  • 5. Unit I: Requirement to Machine Learning Unit I: Requirement to Machine Learning  Project management methodology(CRISP-ML (Q)),Prescriptive Analytics, Predictive Analytics, Diagnostic Analytics, Descriptive Analytics, introduction of data types, measurement levels, measure of central tendency, expected value ,Explorative data analysis, number summary, boxplot, bargraph, Histogram, correlation graph, scatter plots ,exploring two or more variables,Data sampling and its types,various types bias.
  • 6. Unit II: Feature Engineering Techniques Unit II: Feature Engineering Techniques  Dummy variables conversion techniques Standardization and normalization, outlier identification and outlier treatment techniques, skewness identification and its treatment. Finding null values and its treatment.
  • 7. Unit III: Unsupervised Learning-Clustering Unit III: Unsupervised Learning-Clustering  Supervised Vs Unsupervised learning, clustering/segmentation algorithms-Hierarchical, Distance metrics for categorical data, Distance metrics for continuous ,distance metrics for mixed data, distance for clusters, k-means clustering, k selection- elbow curve, drawbacks and comparison
  • 8. Unit IV: Unsupervised Learning -Dimensionality Reduction Unit IV: Unsupervised Learning -Dimensionality Reduction  Need for dimensionality reduction, Principal component analysis(PCA),applications for PCA, Singular Value Decomposition(SVD),application of SVD
  • 9. Unit V: Unsupervised Learning - Unit V: Unsupervised Learning -Association rules and Association rules and recommendation system recommendation system  Market basket analysis,Association rules intuition,Association rules applications ,Association rules terminology, need for recommendation systems,similaritymeasures,user based recommendation system,item to item collaborative filtering.
  • 10. Unit VI: Unit VI: Text Mining-Sentiment Analysis and NLP Text Mining-Sentiment Analysis and NLP  Need of text mining, Bag of words, terminology and preprocessing,DTM and TDM,corpus level word cloud. Introduction of NLP,data preprocessing in NLP context ,NLP terminology ,feature extraction from text,topic modeling, vector representation
  • 11. Unit I: Requirement to Machine Learning Unit I: Requirement to Machine Learning  Project management methodology(CRISP-ML (Q)),Prescriptive Analytics, Predictive Analytics, Diagnostic Analytics, Descriptive Analytics, introduction of data types, measurement levels, measure of central tendency, expected value ,Explorative data analysis, number summary, boxplot, bargraph, Histogram, correlation graph, scatter plots ,exploring two or more variables,Data sampling and its types,various types bias.
  • 12. Project management methodology(CRISP-ML (Q)) Project management methodology(CRISP-ML (Q))  Overall, the CRISP-ML(Q) process model describes six phases: 1. Business and Data Understanding 2. Data Engineering (Data Preparation) 3. Machine Learning Model Engineering 4. Quality Assurance for Machine Learning Applications 5. Deployment 6. Monitoring and Maintenance.
  • 13. Project management methodology(CRISP-ML (Q)) Project management methodology(CRISP-ML (Q))
  • 14. Project management methodology(CRISP-ML (Q)) Project management methodology(CRISP-ML (Q))
  • 15. Project management methodology(CRISP-ML (Q)) Project management methodology(CRISP-ML (Q))  Business and Data Understanding:  Developing machine learning applications starts with identifying the scope of the ML application, the success criteria, and a data quality verification.  The goal of this first phase is to ensure the feasibility of the project.  Defining clear and measurable Key Performance Indicators (KPI) such as “time savings per user and session” is required.
  • 16. Project management methodology(CRISP-ML (Q)) Project management methodology(CRISP-ML (Q))  Machine Learning Model Engineering  The modeling phase includes model selection, model specialization, and model training tasks.  Additionally, depending on the application, we might use a pre-trained model, compress the model, or apply ensemble learning methods to get the final ML model.  Many phases in ML development are iterative.  Sometimes, we might need to review the business goals, KPIs, and available data from the previous steps to adjust the outcomes of the ML model results.  Finally, we package the ML workflow in a pipeline to create repeatable model training during the modeling phase.
  • 17. Project management methodology(CRISP-ML (Q)) Project management methodology(CRISP-ML (Q))  Evaluating Machine Learning Models  Model training is followed by a model evaluation phase, also known as offline testing.  During this phase, the performance of the trained model needs to be validated on a test set.  Additionally, the model robustness should be assessed using noisy or wrong input data.  Finally, the model deployment decision should be met automatically based on success criteria or manually by domain and ML experts. Similar to the modeling phase, all outcomes of the evaluation phase need to be documented.  Deployment: a process of the ML model integration into the existing software system.  Monitoring and Maintenance  https://ml-ops.org/content/crisp-ml
  • 18. Project management methodology(CRISP-ML (Q)) Project management methodology(CRISP-ML (Q))  Deployment:  The ML model deployment denotes a process of the ML model integration into the existing software system.  After succeeding in the evaluation step in the ML development life cycle, the ML model is graduated to be deployed in the (pre-) production environment.  The ML model deployment includes the following tasks: inference hardware definition, model evaluation in a production environment (online testing, e.g., A/B tests), providing user acceptance and usability testing, providing a fall- back plan for model outages, and setting up the deployment strategy to roll out the new model gradually (e.g. canary or green/blue deployment).
  • 19. Project management methodology(CRISP-ML (Q)) Project management methodology(CRISP-ML (Q))  Monitoring and Maintenance  Once the ML model has been put into production, it is essential to monitor its performance and maintain it.  When an ML model performs on real-world data, the main risk is the “model staleness” effect when the performance of the ML model drops as it starts operating on unseen data.  Furthermore, model performance is affected by hardware performance and the existing software stack.  Therefore, the best practice to prevent the model performance drop is to perform the monitoring task when the model performance is continuously evaluated to decide whether the model needs to be re-trained.  This is known as the Continued Model Evaluation pattern.  The decision from the monitoring task leads to the second task - updating the ML model.
  • 20. DEPARTMENT OF COMPUTER ENGINEERING, Sanjivani COE, Kopargaon 20 Reference Reference  https://ml-ops.org/content/crisp-ml