SlideShare a Scribd company logo
Data mining 2004
2
Using Xpress-Mosel for Modeling and
Solving Data Mining Problems
Alkis Vazacopoulos
Dash Optimization
3
Agenda
• New Customers
• Mosel (modeling environment)
• IVE (Integrated Visual Env.)
• Optimization Technologies
• Data Mining Problems & applications
• Applications
4
New Customers
• Frito-Lay
• Carmen Systems
• Du Pont
• Deutsche Bank
• Siemens
• Toyota
5
Mosel: key features
• Integration of modeling and solving
• Programming facilities for pre/post processing, algorithms
– No separation between modeling statement and procedure
to solve the problem
• Open, modular architecture
• Highly flexible and extensible
6
Embedding a Mosel model
Problem
Solving
Program
Starts
Program
Terminates
Model
Execution
Result
Retrieval
Data
Input
Output
Results
7
Mosel: A modeling language
• Decision variables, linear constraints
• Arrays, (index-) sets
• Operators: standard arithmetic, aggregate and set
operators e.g. sum, prod, max, and, or, union, inter
• Loops, Selections e.g. forall-[do], if-then-[elif-then]
• Subroutines: functions and procedures
8
Architecture
LP
Xpress-Mosel
e.g. decision
variables and
constracts, etc.
Enterprise
dataEnterprise
data
• Customer history
• Available products
• Profitability models
• Content
• …Pre- and
Post-
processing
Algorithms
e.g.Optimal solutions
MIP
Constraint
Programming
Stochastic
Programming
9
Mosel: components and interfaces
• Mosel Language: to implement problems and solution algorithms
→ Model or Mosel program
• Mosel Model Compiler and Run-time Libraries: to compile,
execute and access models from a programming language
→ C, C++ , Java or VB program
• Mosel Native Interface (NI): to provide new or extend existing
functionality of Mosel Language
→ module
10
XPRESS-IVE
Benchmarking Solving
Modeling
Programming
11
Xpress - IVE
• Development environment
• Enables rapid prototyping and testing
• Entity tree for data, variables and constraints
• Matrix visualization
• Branch and Bound tree visualization
• LP, MIP and user defined charts
12
13
Matrix Visualization
14
Optimization Technologies
Quadratic P
(QP)
Mixed
Integer
Linear
MIQP
Stochastic
Constrained
Programmin
gHeuristics
Nonlinear
15
Data Mining Application Areas
Extracting useful information from large datasets
of various nature and origin arising in
• Finance
• Manufacturing
• Biomedicine
• Telecommunications
• Military Systems
• Other areas
16
Problems
• Revealing internal structure and
patterns of the data:
–Classification
–Regression
–Clustering
17
Approaches
• LP, MIP
• QP, MIQP
• Network Optimization
• Statistical Preprocessing
• Combinations of these Approaches
18
Classification Problems:
general setup
• “Training dataset”: N elements (xi, yi),
i = 1,…,N.
xi is an n-dimensional vector of
element’s attributes (features)
yi denotes the class attribute
(the number of classes is specified)
19
Classification Problems:
general setup
• A new element with known attributes x,
but unknown class attribute y
• The problem is to determine, which class
this element belongs to
• The classification model is “trained” on
the training dataset and applied to new
elements
20
Classification Problems:
general setup
• Main Idea: Constructing separating
surfaces in the n-dimensional space that
would divide it into several regions
• Each region corresponds to a certain
class
• The new element is classified according
to its geometrical location in the vector
space
21
Classification Problems:
example
22
Classification Problems:
LP approach
• Consider binary classification, one
separating plane
• The plane is represented by the standard
equation
• The problem is to find the optimal values
of the parameters w and Îł
23
Classification Problems:
LP approach
• Suppose that vectors xi from the training
dataset are stored in two matrices
A(m×n) and B(k×n) corresponding to m
elements of the 1st class and k elements
of the 2nd class.
• The plane will perfectly separate
elements in A and elements in B if
24
Classification Problems:
LP approach
• Extra variables y and z are introduced to
model classification errors:
• The parameters w and γ are determined from
the LP problem of minimizing the total
misclassification error
25
Classification Problems:
LP formulation
26
Classification Problems:
generalized approaches
• Using multiple, non-linear separating
surfaces (e.g., polynomial, exponential,
logarithmic)
– Finding parameters of these surfaces can also
be reduced to LP
• Selecting a minimum number of attributes
(features) that are taken into account in
classification – feature selection
27
Classification Problems:
Application Examples
• Cancer Diagnosis
(Mangasarian et al, 1995 –
linear separating surfaces)
• Classification of Credit Card Applications,
Bonds Rating
28
Regression Problems:
General Setup
• N elements (xi, yi), i = 1,…,N, xi is a vector in
Rn, yi is a scalar in R
• Find a linear relationship between xi and yi,
i.e., find a vector β in Rn, such that
• We need to minimize
or
29
Regression Problems: LP
formulation
• The problem
can be reformulated as LP:
30
Clustering Problems
• Given a dataset, we need to assign the
elements to K clusters, according to an
appropriate similarity criteria. The number of
clusters K is usually not known a priori.
• Standard algorithms for fixed number of
clusters:
– K-median
– K-mean
31
Integer Programming approach to
classification and regression using
clustering techniques
• CRIO software package (Bertsimas & Shioda, 2002)
• Similar approaches for both classification and
regression
• Outline
– Preprocess data by assigning points to small clusters to
reduce the dimensionality
– Solve a mixed integer problem that assigns clusters to groups
and removes outliers. In the case of regression the model also
selects the regression coefficients for each group.
– Solve continuous optimization problems (quadratic
optimization problems for classification and linear optimization
problems for regression) that assign groups to polyhedral
regions.
32
Extending MOSEL-Native Int.
• Modular environment and open
architecture
• Module = dynamic libraries
• Not dedicated to any particular use:
– Solvers: Xpress-Optimizer, CHIP, OptQuest
– Database access: ODBC
– System commands
33
Stochastic
Modeling
Uncertainty
Stochastic
Solvers
Solution
Techniques
34
Stochastic Programming (SP)
• Stochastic Programming: Decision
making under uncertainty
– Model future uncertainty into mathematical
programming as scenarios
– Make optimal decisions to hedge against
future
35
Available features
New Types
• Svalue: Stochastic values that take different
values with certain probability e.g demand
• Smpvar: Stochastic decision variables that
take different values under different scenarios
• Slinctr: Stochastic constraints built on linear
expressions containing real,Svalue and
Smpvar
36
Example
1 32stage
Svalue
Dem1=
2 w.p 0.6
8 w.p 0.4
Dem2=
3 w.p 0.3
7 w.p 0.6
9 w.p 0.1
Smpvar x1 x2 x3
Slinctr x1+x2+x3<=Inventory
x1>=Dem1
x2>=Dem2
37
Advantages
• Automatic scenario tree generation
2
8
3
7
1
3
7
1
Scenario w.p
1 .18
2 .36
3 .06
4 .12
5 .24
6 .04
38
Advantages
• Elimination of scenario indexed entities e.g
T=3
x: array(1..T) of Smpvar
Dem:array(1..T-1) of Svalue
c:Slinctr
c:=sum(t in 1..T) x(t)<=Inventory
instead of
Scenarios=1..6
x: array(1..T,Scenarios) of mpvar
Dem:array(1..T-1 ,Scenarios) of real
c: arrray(Scenarios) of linctr
forall(s in Scenarios )
c(s):=sum(t in 1..T) x(t,s)<= Inventory
39
Advantages
• Elimination of writing Non-Anticipative Constraints
Scenarios=1..6
x: array(1..T,Scenarios) of mpvar
x(t,s)=x(t,s’) t=1; s,s’ {1..6} :s s’
x(t,s)=x(t,s’) t=2; s,s’ {1..3} :s s’
x(t,s)=x(t,s’) t=2; s,s’ {4..6} :s s’

 






1
2
3
4
5
6
t: 1 2 3
40
Statistical Preprocessing of the
Data
• In many cases, it is helpful to use
statistical preprocessing of the data
before applying mathematical
programming techniques

More Related Content

PPT
Alexander Mikov - Program Tools for Dynamic Investigation of Social Networks
 
PPT
Introduction
PPTX
The statistical physics of learning revisted: Phase transitions in layered ne...
PDF
Session II - Estimation methods and accuracy Li-Chun Zhang Discussion: Sess...
PPSX
The statistical physics of learning - revisited
PPT
PPT
Machine learning and deep learning algorithms
PPTX
Keynote at IWLS 2017
Alexander Mikov - Program Tools for Dynamic Investigation of Social Networks
 
Introduction
The statistical physics of learning revisted: Phase transitions in layered ne...
Session II - Estimation methods and accuracy Li-Chun Zhang Discussion: Sess...
The statistical physics of learning - revisited
Machine learning and deep learning algorithms
Keynote at IWLS 2017

Similar to Data mining 2004 (20)

PDF
30thSep2014
PPT
lecture.ppt
PPTX
Computational Giants_nhom.pptx
PPTX
UNIT-2 Quantitaitive Anlaysis for Mgt Decisions.pptx
PPTX
Machine learning and linear regression programming
PDF
Stochastic Optimization: Solvers and Tools
PDF
Machine learning meetup
PDF
Lecture 1 (bce-7)
PDF
6 data envelopment_analysis
 
PPTX
Final Presentation - Edan&Itzik
PPTX
Deep learning with keras
PPTX
Manufacturing Data Analytics
PDF
General Tips for participating Kaggle Competitions
PDF
AOFW CHG433_Part I_wip_Nov2023.pdf presentation
PDF
Automated Testing of Autonomous Driving Assistance Systems
PPTX
lecture_16.pptx
PDF
AlphaPy: A Data Science Pipeline in Python
PDF
AlphaPy
PDF
Machine Learning and AI: Core Methods and Applications
PPTX
Supervised Learning.pptx
30thSep2014
lecture.ppt
Computational Giants_nhom.pptx
UNIT-2 Quantitaitive Anlaysis for Mgt Decisions.pptx
Machine learning and linear regression programming
Stochastic Optimization: Solvers and Tools
Machine learning meetup
Lecture 1 (bce-7)
6 data envelopment_analysis
 
Final Presentation - Edan&Itzik
Deep learning with keras
Manufacturing Data Analytics
General Tips for participating Kaggle Competitions
AOFW CHG433_Part I_wip_Nov2023.pdf presentation
Automated Testing of Autonomous Driving Assistance Systems
lecture_16.pptx
AlphaPy: A Data Science Pipeline in Python
AlphaPy
Machine Learning and AI: Core Methods and Applications
Supervised Learning.pptx
Ad

More from Alkis Vazacopoulos (20)

PPT
Automatic Fine-tuning Xpress-MP to Solve MIP
PPTX
Amazing results with ODH|CPLEX
PPTX
Bia project poster fantasy football
PPT
NFL Game schedule optimization
PDF
2017 Business Intelligence & Analytics Corporate Event Stevens Institute of T...
PDF
Posters 2017
PDF
Very largeoptimizationparallel
PDF
Retail Pricing Optimization
PDF
Optimization Direct: Introduction and recent case studies
PDF
Informs 2016 Solving Planning and Scheduling Problems with CPLEX
PDF
ODHeuristics
DOCX
Missing-Value Handling in Dynamic Model Estimation using IMPL
DOCX
Finite Impulse Response Estimation of Gas Furnace Data in IMPL Industrial Mod...
DOCX
Industrial Modeling Service (IMS-IMPL)
DOCX
Dither Signal Design Problem (DSDP) for Closed-Loop Estimation Industrial Mod...
DOCX
PPTX
Distillation Curve Optimization Using Monotonic Interpolation
DOCX
Multi-Utility Scheduling Optimization (MUSO) Industrial Modeling Framework (M...
DOCX
Advanced Parameter Estimation (APE) for Motor Gasoline Blending (MGB) Indust...
PDF
Hybrid Dynamic Simulation (HDS) Industrial Modeling Framework (HDS-IMF)
Automatic Fine-tuning Xpress-MP to Solve MIP
Amazing results with ODH|CPLEX
Bia project poster fantasy football
NFL Game schedule optimization
2017 Business Intelligence & Analytics Corporate Event Stevens Institute of T...
Posters 2017
Very largeoptimizationparallel
Retail Pricing Optimization
Optimization Direct: Introduction and recent case studies
Informs 2016 Solving Planning and Scheduling Problems with CPLEX
ODHeuristics
Missing-Value Handling in Dynamic Model Estimation using IMPL
Finite Impulse Response Estimation of Gas Furnace Data in IMPL Industrial Mod...
Industrial Modeling Service (IMS-IMPL)
Dither Signal Design Problem (DSDP) for Closed-Loop Estimation Industrial Mod...
Distillation Curve Optimization Using Monotonic Interpolation
Multi-Utility Scheduling Optimization (MUSO) Industrial Modeling Framework (M...
Advanced Parameter Estimation (APE) for Motor Gasoline Blending (MGB) Indust...
Hybrid Dynamic Simulation (HDS) Industrial Modeling Framework (HDS-IMF)
Ad

Recently uploaded (20)

PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PDF
Fluorescence-microscope_Botany_detailed content
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
Introduction to Business Data Analytics.
PPTX
Database Infoormation System (DBIS).pptx
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
Introduction-to-Cloud-ComputingFinal.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Reliability_Chapter_ presentation 1221.5784
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
IBA_Chapter_11_Slides_Final_Accessible.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Fluorescence-microscope_Botany_detailed content
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
Acceptance and paychological effects of mandatory extra coach I classes.pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Introduction to Knowledge Engineering Part 1
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Introduction to Business Data Analytics.
Database Infoormation System (DBIS).pptx

Data mining 2004

  • 2. 2 Using Xpress-Mosel for Modeling and Solving Data Mining Problems Alkis Vazacopoulos Dash Optimization
  • 3. 3 Agenda • New Customers • Mosel (modeling environment) • IVE (Integrated Visual Env.) • Optimization Technologies • Data Mining Problems & applications • Applications
  • 4. 4 New Customers • Frito-Lay • Carmen Systems • Du Pont • Deutsche Bank • Siemens • Toyota
  • 5. 5 Mosel: key features • Integration of modeling and solving • Programming facilities for pre/post processing, algorithms – No separation between modeling statement and procedure to solve the problem • Open, modular architecture • Highly flexible and extensible
  • 6. 6 Embedding a Mosel model Problem Solving Program Starts Program Terminates Model Execution Result Retrieval Data Input Output Results
  • 7. 7 Mosel: A modeling language • Decision variables, linear constraints • Arrays, (index-) sets • Operators: standard arithmetic, aggregate and set operators e.g. sum, prod, max, and, or, union, inter • Loops, Selections e.g. forall-[do], if-then-[elif-then] • Subroutines: functions and procedures
  • 8. 8 Architecture LP Xpress-Mosel e.g. decision variables and constracts, etc. Enterprise dataEnterprise data • Customer history • Available products • Profitability models • Content • …Pre- and Post- processing Algorithms e.g.Optimal solutions MIP Constraint Programming Stochastic Programming
  • 9. 9 Mosel: components and interfaces • Mosel Language: to implement problems and solution algorithms → Model or Mosel program • Mosel Model Compiler and Run-time Libraries: to compile, execute and access models from a programming language → C, C++ , Java or VB program • Mosel Native Interface (NI): to provide new or extend existing functionality of Mosel Language → module
  • 11. 11 Xpress - IVE • Development environment • Enables rapid prototyping and testing • Entity tree for data, variables and constraints • Matrix visualization • Branch and Bound tree visualization • LP, MIP and user defined charts
  • 12. 12
  • 15. 15 Data Mining Application Areas Extracting useful information from large datasets of various nature and origin arising in • Finance • Manufacturing • Biomedicine • Telecommunications • Military Systems • Other areas
  • 16. 16 Problems • Revealing internal structure and patterns of the data: –Classification –Regression –Clustering
  • 17. 17 Approaches • LP, MIP • QP, MIQP • Network Optimization • Statistical Preprocessing • Combinations of these Approaches
  • 18. 18 Classification Problems: general setup • “Training dataset”: N elements (xi, yi), i = 1,…,N. xi is an n-dimensional vector of element’s attributes (features) yi denotes the class attribute (the number of classes is specified)
  • 19. 19 Classification Problems: general setup • A new element with known attributes x, but unknown class attribute y • The problem is to determine, which class this element belongs to • The classification model is “trained” on the training dataset and applied to new elements
  • 20. 20 Classification Problems: general setup • Main Idea: Constructing separating surfaces in the n-dimensional space that would divide it into several regions • Each region corresponds to a certain class • The new element is classified according to its geometrical location in the vector space
  • 22. 22 Classification Problems: LP approach • Consider binary classification, one separating plane • The plane is represented by the standard equation • The problem is to find the optimal values of the parameters w and Îł
  • 23. 23 Classification Problems: LP approach • Suppose that vectors xi from the training dataset are stored in two matrices A(m×n) and B(k×n) corresponding to m elements of the 1st class and k elements of the 2nd class. • The plane will perfectly separate elements in A and elements in B if
  • 24. 24 Classification Problems: LP approach • Extra variables y and z are introduced to model classification errors: • The parameters w and Îł are determined from the LP problem of minimizing the total misclassification error
  • 26. 26 Classification Problems: generalized approaches • Using multiple, non-linear separating surfaces (e.g., polynomial, exponential, logarithmic) – Finding parameters of these surfaces can also be reduced to LP • Selecting a minimum number of attributes (features) that are taken into account in classification – feature selection
  • 27. 27 Classification Problems: Application Examples • Cancer Diagnosis (Mangasarian et al, 1995 – linear separating surfaces) • Classification of Credit Card Applications, Bonds Rating
  • 28. 28 Regression Problems: General Setup • N elements (xi, yi), i = 1,…,N, xi is a vector in Rn, yi is a scalar in R • Find a linear relationship between xi and yi, i.e., find a vector β in Rn, such that • We need to minimize or
  • 29. 29 Regression Problems: LP formulation • The problem can be reformulated as LP:
  • 30. 30 Clustering Problems • Given a dataset, we need to assign the elements to K clusters, according to an appropriate similarity criteria. The number of clusters K is usually not known a priori. • Standard algorithms for fixed number of clusters: – K-median – K-mean
  • 31. 31 Integer Programming approach to classification and regression using clustering techniques • CRIO software package (Bertsimas & Shioda, 2002) • Similar approaches for both classification and regression • Outline – Preprocess data by assigning points to small clusters to reduce the dimensionality – Solve a mixed integer problem that assigns clusters to groups and removes outliers. In the case of regression the model also selects the regression coefficients for each group. – Solve continuous optimization problems (quadratic optimization problems for classification and linear optimization problems for regression) that assign groups to polyhedral regions.
  • 32. 32 Extending MOSEL-Native Int. • Modular environment and open architecture • Module = dynamic libraries • Not dedicated to any particular use: – Solvers: Xpress-Optimizer, CHIP, OptQuest – Database access: ODBC – System commands
  • 34. 34 Stochastic Programming (SP) • Stochastic Programming: Decision making under uncertainty – Model future uncertainty into mathematical programming as scenarios – Make optimal decisions to hedge against future
  • 35. 35 Available features New Types • Svalue: Stochastic values that take different values with certain probability e.g demand • Smpvar: Stochastic decision variables that take different values under different scenarios • Slinctr: Stochastic constraints built on linear expressions containing real,Svalue and Smpvar
  • 36. 36 Example 1 32stage Svalue Dem1= 2 w.p 0.6 8 w.p 0.4 Dem2= 3 w.p 0.3 7 w.p 0.6 9 w.p 0.1 Smpvar x1 x2 x3 Slinctr x1+x2+x3<=Inventory x1>=Dem1 x2>=Dem2
  • 37. 37 Advantages • Automatic scenario tree generation 2 8 3 7 1 3 7 1 Scenario w.p 1 .18 2 .36 3 .06 4 .12 5 .24 6 .04
  • 38. 38 Advantages • Elimination of scenario indexed entities e.g T=3 x: array(1..T) of Smpvar Dem:array(1..T-1) of Svalue c:Slinctr c:=sum(t in 1..T) x(t)<=Inventory instead of Scenarios=1..6 x: array(1..T,Scenarios) of mpvar Dem:array(1..T-1 ,Scenarios) of real c: arrray(Scenarios) of linctr forall(s in Scenarios ) c(s):=sum(t in 1..T) x(t,s)<= Inventory
  • 39. 39 Advantages • Elimination of writing Non-Anticipative Constraints Scenarios=1..6 x: array(1..T,Scenarios) of mpvar x(t,s)=x(t,s’) t=1; s,s’ {1..6} :s s’ x(t,s)=x(t,s’) t=2; s,s’ {1..3} :s s’ x(t,s)=x(t,s’) t=2; s,s’ {4..6} :s s’          1 2 3 4 5 6 t: 1 2 3
  • 40. 40 Statistical Preprocessing of the Data • In many cases, it is helpful to use statistical preprocessing of the data before applying mathematical programming techniques