SlideShare a Scribd company logo
Machine learning workshop using Orange datamining framework
Workshop on Orange :
Data mining framework, data visualization,
and data analytics.
Introduced by : Amr Rashed
Lecturer, Department of Computer Engineering,
College of Computers and Information Technology, Taif University
M.Sc., Electronics & Communication Engineering Faculty of Engineering,
Mansoura University
Agenda
Topics:
Introduction & overview
Application 1:
Fault Detection for Attaining Service
Continuity of Photovoltaic Power
System
Application 2:
Data Mining for Diagnosis of Breast
Cancer in Medical Ultrasonic Images
Project 1
Project 2
Applied
Machine
Learning
Process
Improve Improve Results
Present Present Results
Spot Spot Check Algorithms
Prepare Prepare Data
Define Define the Problem
Define the
Problem
Step 1: What is the problem?
Step 2: Why does the problem
need to be solved?
Step 3: How would I solve the
problem?
Data Preparation Techniques
1.Common Data
Preparation
Tasks
1.Data Cleaning
1.Feature
Selection
1.Data
Transforms
1.Feature
Engineering
1.Dimensionality
Reduction
Common Data Preparation Tasks
•Step 1:
Define
Problem.
•Step 2:
Prepare Data.
•Step 3:
Evaluate
Models.
•Step 4:
Finalize
Model.
Data
Cleaning
Data cleaning involves fixing systematic problems
or errors in “messy” data.
Using statistics to define normal data and identify
outliers.
Identifying columns that have the same value or no
variance and removing them
Identifying duplicate rows of data and removing
them.
Marking empty values as missing.
Imputing missing values using statistics or a learned
model.
Overview of data cleaning
Overview of
feature
selection
techniques
Overview of
Data
Transforms
Feature Engineering
Feature
engineering is
the process of
creating new
input variables
from the
available data.
Adding a Boolean flag variable for some state.
Adding a group or global summary statistic, such as a mean.
Adding new variables for each component of a compound variable, such as a
date-time.
Polynomial Transform: Create copies of numerical input variables that are
raised to a power(raising them to a power or multiplied with other input
variables).
Overview of
Dimensionality
Reduction
Techniques
Overview of data types
Orange installation
Cont.
Cont.
Cont.
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Create Dataset for the First Project
(250kw grid connected PV array)
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Example
Machine learning workshop using Orange datamining framework
MATLAB Script
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Project 2
• Data Mining for Diagnosis of Breast
Cancer in Medical Ultrasonic Images
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework
Machine learning workshop using Orange datamining framework

More Related Content

ODP
Computer Vision for Traffic Sign Recognition
PPTX
Orange Data Mining & Data Visualization Tool
PPTX
Traffic sign detection
PDF
Rangkuman Rumus Parabola, Elips, Hiperbola
PPSX
Data Mining Tools / Orange
PPTX
The publishing process
PPTX
Type i and type ii errors
Computer Vision for Traffic Sign Recognition
Orange Data Mining & Data Visualization Tool
Traffic sign detection
Rangkuman Rumus Parabola, Elips, Hiperbola
Data Mining Tools / Orange
The publishing process
Type i and type ii errors

What's hot (20)

PDF
Quantum Computing: The Why and How
PPT
Back propagation
PPTX
Support vector machine
PPTX
Practical Swarm Optimization (PSO)
DOC
Electronics in daily life
DOCX
Learning Methods in a Neural Network
PPTX
PPTX
Machine Learning
PPT
Ibm quantum computing
PDF
Smart Energy
PPTX
Random forest
PPTX
RNN-LSTM.pptx
PDF
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
PPTX
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
PPTX
Introduction to artificial neural network
DOCX
Solar power bank
PDF
Deep learning and Healthcare
PPTX
Artificial intelligence in power systems
PDF
Understanding random forests
PPTX
Lect6 Association rule & Apriori algorithm
Quantum Computing: The Why and How
Back propagation
Support vector machine
Practical Swarm Optimization (PSO)
Electronics in daily life
Learning Methods in a Neural Network
Machine Learning
Ibm quantum computing
Smart Energy
Random forest
RNN-LSTM.pptx
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Introduction to artificial neural network
Solar power bank
Deep learning and Healthcare
Artificial intelligence in power systems
Understanding random forests
Lect6 Association rule & Apriori algorithm
Ad

Similar to Machine learning workshop using Orange datamining framework (20)

PDF
IRJET- Breast Cancer Prediction using Deep Learning
PDF
PARKINSON’S DISEASE DETECTION USING MACHINE LEARNING
PDF
SEMI SUPERVISED BASED SPATIAL EM FRAMEWORK FOR MICROARRAY ANALYSIS
PDF
IRJET- Result on the Application for Multiple Disease Prediction from Symptom...
PDF
BRAIN TUMOUR DETECTION AND CLASSIFICATION
PDF
Fault Detection in Mobile Communication Networks Using Data Mining Techniques...
PPT
PACS strategic plan and needs assessment, technical Issues, PACS architecture.
PDF
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
PDF
IRJET- Chest Abnormality Detection from X-Ray using Deep Learning
PDF
IRJET- Breast Cancer Disease Prediction : Using Machine Learning Approach
PPTX
Lecture-2 Applied ML .pptx
DOCX
STRATAGIES FOR DETECTING DATA POISONING IN DISTRIBUTED ML.docx
DOCX
STRATAGIES FOR DETECTING DATA POISONING IN DISTRIBUTED ML (1).docx
PPTX
Computer aid in medical instrument term paper PPT
PDF
IRJET- Chest Abnormality Detection from X-Ray using Deep Learning
PDF
Lung Cancer Detection using Decision Tree Algorithm
PDF
IRJET- A Survey on Prediction of Heart Disease Presence using Data Mining and...
PPTX
TSEMINAR-COLLEGE-DEUCATION-TECHNICAL-SUBJ
PPTX
Skin Cancer Disease by image classification Slidesgo.pptx
PDF
IRJET- Fault Detection and Prediction of Failure using Vibration Analysis
IRJET- Breast Cancer Prediction using Deep Learning
PARKINSON’S DISEASE DETECTION USING MACHINE LEARNING
SEMI SUPERVISED BASED SPATIAL EM FRAMEWORK FOR MICROARRAY ANALYSIS
IRJET- Result on the Application for Multiple Disease Prediction from Symptom...
BRAIN TUMOUR DETECTION AND CLASSIFICATION
Fault Detection in Mobile Communication Networks Using Data Mining Techniques...
PACS strategic plan and needs assessment, technical Issues, PACS architecture.
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
IRJET- Chest Abnormality Detection from X-Ray using Deep Learning
IRJET- Breast Cancer Disease Prediction : Using Machine Learning Approach
Lecture-2 Applied ML .pptx
STRATAGIES FOR DETECTING DATA POISONING IN DISTRIBUTED ML.docx
STRATAGIES FOR DETECTING DATA POISONING IN DISTRIBUTED ML (1).docx
Computer aid in medical instrument term paper PPT
IRJET- Chest Abnormality Detection from X-Ray using Deep Learning
Lung Cancer Detection using Decision Tree Algorithm
IRJET- A Survey on Prediction of Heart Disease Presence using Data Mining and...
TSEMINAR-COLLEGE-DEUCATION-TECHNICAL-SUBJ
Skin Cancer Disease by image classification Slidesgo.pptx
IRJET- Fault Detection and Prediction of Failure using Vibration Analysis
Ad

More from Amr Rashed (20)

PDF
Introduction to Deep Learning: Concepts, Architectures, and Applications
PPTX
Introduction to Autoencoders: Types and Applications
PPTX
Introduction to the Fundamentals of Computer Networks
PPTX
Introduction to analog communication system
PPTX
introduction to embedded system presentation
PPT
Discrete Math Ch5 counting + proofs
PPTX
Discrete Math Chapter: 8 Relations
PPTX
Discrete Math Chapter 1 :The Foundations: Logic and Proofs
PPTX
Discrete Math Chapter 2: Basic Structures: Sets, Functions, Sequences, Sums, ...
PPTX
Introduction to deep learning
PPTX
Discrete Structure Mathematics lecture 1
PPTX
Implementation of DNA sequence alignment algorithms using Fpga ,ML,and CNN
PPTX
امن نظم المعلومات وامن الشبكات
PPTX
مقدمة عن الفيجوال بيسك 9-2019
PPTX
Deep learning tutorial 9/2019
PPTX
Deep Learning Tutorial
PDF
Matlab plotting
PPT
License Plate Recognition
PDF
Introduction to FPGA, VHDL
PDF
Introduction to Matlab
Introduction to Deep Learning: Concepts, Architectures, and Applications
Introduction to Autoencoders: Types and Applications
Introduction to the Fundamentals of Computer Networks
Introduction to analog communication system
introduction to embedded system presentation
Discrete Math Ch5 counting + proofs
Discrete Math Chapter: 8 Relations
Discrete Math Chapter 1 :The Foundations: Logic and Proofs
Discrete Math Chapter 2: Basic Structures: Sets, Functions, Sequences, Sums, ...
Introduction to deep learning
Discrete Structure Mathematics lecture 1
Implementation of DNA sequence alignment algorithms using Fpga ,ML,and CNN
امن نظم المعلومات وامن الشبكات
مقدمة عن الفيجوال بيسك 9-2019
Deep learning tutorial 9/2019
Deep Learning Tutorial
Matlab plotting
License Plate Recognition
Introduction to FPGA, VHDL
Introduction to Matlab

Recently uploaded (20)

PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PPT
Mechanical Engineering MATERIALS Selection
PPTX
Construction Project Organization Group 2.pptx
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPT
Project quality management in manufacturing
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
web development for engineering and engineering
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
composite construction of structures.pdf
PDF
Well-logging-methods_new................
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PPTX
Sustainable Sites - Green Building Construction
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
Foundation to blockchain - A guide to Blockchain Tech
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
Mechanical Engineering MATERIALS Selection
Construction Project Organization Group 2.pptx
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Project quality management in manufacturing
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
UNIT 4 Total Quality Management .pptx
UNIT-1 - COAL BASED THERMAL POWER PLANTS
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
web development for engineering and engineering
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
composite construction of structures.pdf
Well-logging-methods_new................
CYBER-CRIMES AND SECURITY A guide to understanding
Sustainable Sites - Green Building Construction
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026

Machine learning workshop using Orange datamining framework

Editor's Notes

  • #13: Feature Selection: Select a subset of input features from the dataset. Unsupervised: Do not use the target variable (e.g. remove redundant variables). e.g., Correlation Supervised: Use the target variable (e.g . remove irrelevant variables). Wrapper: Search for well-performing subsets of features. e.g., RFE Filter: Select subsets of features based on their relationship with the target. Statistical Methods Feature Importance Methods Intrinsic: Algorithms that perform automatic feature selection during training. Decision Trees Dimensionality Reduction: Project input data into a lower-dimensional feature space.
  • #14: Discretization Transform: Encode a numeric variable as an ordinal variable. Ordinal Transform: Encode a categorical variable into an integer variable. One-Hot Transform: Encode a categorical variable into binary variables. Normalization Transform: Scale a variable to the range 0 and 1. Standardization Transform: Scale a variable to a standard Gaussian. Power Transform: Change the distribution of a variable to be more Gaussian. Quantile Transform: Impose a probability distribution such as uniform or Gaussian.
  • #16: Principal Component Analysis (PCA) Singular Value Decomposition (SVD) Linear Discriminant Analysis (LDA) self-organizing maps(SOM)
  • #17: Numeric Data Type: Number values. Integer: Integers with no fractional part. Real: Floating point values. Categorical Data Type: Label values. Ordinal: Labels with a rank ordering. Nominal: Labels with no rank ordering. Boolean: Values True and False.
  • #18: https://orangedatamining.com/download/#windows