Using Qualitative Knowledge in Numerical Learning

USING QUALITATIVE KNOWLEDGE IN NUMERICAL LEARNING Ivan Bratko Faculty of Computer and Info. Sc. University of Ljubljana Slovenia

THIS TALK IS ABOUT: AUTOMATED MODELLING FROM DATA WITH MACHINE LEARNING COMBINING NUMERICAL AND QUALITATIVE REPRESENTATIONS

BUILING MODELS FROM DATA Observed system Machine learning, numerical regression Model of system Data

EXAMPLE: POPULATION DYNAMICS A lake with zooplankton, phytoplankton and nutrient nitrogen Variables in system: Nut Phyto Zoo

POPULATION DYNAMICS Observed behaviour in time Data provided by Todorovski&D žeroski

PRIOR KNOWLEDGE We would like our modelling methods to make use of expert’s prior knowledge (possibly qualitative) Phytoplankton feeds on Nutrient, Zooplankton feeds on Phytoplankton Nutrient Phyto Zoo

QUALITATIVE DIFFICULTIES OF NUMERICAL LEARNING Learn time behavior of water level: h = f( t, initial_outflow) Level h outflow t h

TIME BEHAVIOUR OF WATER LEVEL Initial_ouflow =12.5

VARYING INITIAL OUTFLOW Initial_ouflow =12.5 11.25 10.0 8.75 6.25

PREDICTING WATER LEVEL WITH M5 11.25 10.0 8.75 6.25 7.5 Initial_ouflow =12.5 Qualitatively incorrect – water level cannot increase M5 prediction

QUALITATIVE ERRORS OF NUMERICAL LEARNERS E xperiments with regression (model) trees ( M5; Quinlan 92 ) , LWR (Atkenson et.al. 97) in Weka ( Witten & Frank 2000 ) , neural nets, ... Qualitative errors: water level should never increase water level should not be negative An expert might accept numerical errors, but such qualitative errors are particularly disturbing

Q 2 LEARNING AIMS AT OVERCOMING THESE DIFFICULTIES

Q 2 LEARNING Š uc, Vladu š i č , Bratko; IJCAI’03, AIJ 2004, IJCAI’05 Aims at overcoming these difficulties of numerical learning Q 2 = Q ualitatively faithful Q uantitative learning Q 2 makes use of qualitative constraints

QUALITATIVE CONSTRAINTS FOR WATER LEVEL For any initial outflow: Level is always decreasing with time For any time point: Greater the initial outflow, greater the level

SUMMARY OF Q 2 LEARNING Standard numerical learning approaches make qualitative errors. As a result, numerical predictions are qualitatively inconsistent with expectations Q 2 learning (Qualitatively faithful Quantitative prediction); A method that enforces qualitative consistency Resulting numerical models enable clearer interpretation, and also significantly improve quantitative prediction

IDEA OF Q 2 First find qualitative laws in data Respect these qualitative laws in numerical learning

CONTENTS OF REST OF TALK Building blocks of Q 2 learning: Ideas from Qualitative Reasoning, Algorithms QUIN, QFILTER, QCGRID Experimental analysis Applications: Car modelling, ecological modelling, behavioural cloning (operating a crane, flying an aircraft)

HOW CAN WE DESCRIBE QUALITATIVE PROPERTIES ? We can use concepts from field of qualitative reasoning in AI Related terms: Qualitative physics, Naive physics, Qualitative modelling

QUALITATIVE MODELLING IN AI Naive physics, as opposed to "proper physics“ Qualitative modelling, as opposed to quantitative modelling

ESSENCE OF NAIVE PHYSICS Describe physical processes qualitatively , without numbers or exact numerical relations “ Naive physics”, as opposed to "proper physics“ Close to common sense descriptions

EXAMPLE: BATH TUB What will happen? Amount of water will keep increasing, so will level, until the level reaches the top.

EXAMPLE: U-TUBE What will happen? La Lb Level La will be decreasing, and Lb increasing, until La = Lb.

QUALITATIVE REASONING ABOUT U-TUBE Total amount of water in system constant If La > Lb then flow from A to B Flow causes amount in A to decrease Flow causes amount in B to increase All changes in time happen continuously and smoothly Level La Level Lb A B

QUALITATIVE REASONING ABOUT U-TUBE In any container: the greater the amount, the greater the level So, La will keep decreasing, Lb increasing Level La Level Lb

QUALITATIVE REASONING ABOUT U-TUBE La will keep decreasing, Lb increasing, until they equalise Level La Level Lb La Lb Time

THIS REASONING IS VALID FOR ALL CONTAINERS OF ANY SHAPE AND SIZE, REGARDLESS OF ACTUAL NUMBERS!

QHY REASON QUALITATIVELY? Because it is easier than quantitatively Because it is easy to understand - facilitates explanation We want to exploit these advantages in ML

RELATION BETWEEN AMOUNT AND LEVEL The greater the amount, the greater the level A = M + (L) A is a monotonically increasing function of L

MONOTONIC FUNCTIONS Y = M + (X) specifies a family of functions X Y

MONOTONIC QUALITATIVE CONSTRAINTS, MQCs Generalisation of monotonically increasing functions to several arguments Example: Z = M +,- ( X, Y) Z increases with X, and decreases with Y More precisely: if X increases and Y stays unchanged then Z increases

EXAMPLE: BEHAVIOUR OF GAS Pressure = M +,- (Temperature, Volume) Pressure increases with Temperature Pressure decreases with Volume

Q 2 LEARNING Induce qualitative constraints ( QUIN ) Qualitative to Quantitative Transformation (Q2Q) Numerical predictor : respects qualitative constraints fits data numerically Numerical data One possibility: QFILTER

PROGRAM QUIN INDUCING QUALITATIVE CONSTRAINTS FROM NUMERICAL DATA Šuc 2001 ( PhD Thesis , also as book 2003 ) Šuc and Bratko, ECML ’ 01

QUIN QUIN = Qualitative Induction Numerical examples QUIN Qualitative tree Qualitative tree : similar to decision tree, qualitative constraints in leaves

EXAMPLE PROBLEM FOR QUIN Noisy examples: z = x 2 - y 2 + noise(st.dev. 50)

EXAMPLE PROBLEM FOR QUIN In this region: z = M +,+ (x,y)

INDUCED QUALITATIVE TREE FOR z = x 2 - y 2 + noise z= M -,+ ( x,y) z= M -,- ( x,y) z= M +,+ ( x , y) z= M +,- ( x,y)  0 > 0 > 0  0 > 0  0 y x y

QUIN ALGORITHM: OUTLINE Top-down greedy algorithm (similar to induction of decision trees) For every possible split, find the “most consistent” MQC (min. error-cost) for each subset of examples Select the best split according to MDL

Q2Q Qualitative to Quantitative Transformation

Q2Q EXAMPLE X < 5 y n Y = M + (X) Y = M - (X) 5 X Y

QUALITATIVE TREES IMPOSE NUMERICAL CONSTRAINTS MQCs impose numerical constraints on class values, between pairs of examples y = M + (x) requires: If x 1 > x 2 then y 1 > y 2

RESPECTING MQCs NUMERICALLY z = M +,+ (x,y) requires: If x 1 < x 2 and y 1 < y 2 then z 1 < z 2 (x 2 , y 2 ) (x 1 , y 1 ) x y

QFILTER AN APPROACH TO Q2Q TRANSFORMATION Šuc and Bratko, ECML ’03

TASK OF QFILTER Given: qualitative tree points with class predictions by arbitrary numerical learner learning examples (optionally) Modify class predictions to achieve consistency with qualitative tree

QFILTER IDEA Force numerical predictions to respect qualitative constraints : find minimal changes of predicted values so that qualitative constraints become satisfied “ minimal” = min. sum of squared changes a quadratic programming problem

RESPECTING MQCs NUMERICALLY Y = M + (X) X Y

QFILTER APPLIED TO WATER OUTFLOW Qualitative constraint that applies to water outflow: h = M -,+ (time, InitialOutflow) This could be supplied by domain expert, or induced from data by QUIN

PREDICTING WATER LEVEL WITH M5 7.5 M5 prediction

QFILTER’S PREDICTION QFILTER predictions T rue values

POPULATION DYNAMICS Aquatic ecosystem with zooplankton, phytoplankton and nutrient nitrogen Phyto feeds on Nutrient, Zoo feeds on Phyto Nutrient Phyto Zoo

POPULATION DYNAMICS WITH Q 2 Behaviour in time

PREDICTION PROBLEM Predict the change in zooplankton population: ZooChange(t) = Zoo(t + 1) - Zoo(t) Biologist’s rough idea: ZooChange = Growth - Mortality M +,+ (Zoo, Phyto) M + (Zoo)

APPROXIMATE QUALITATIVE MODEL OF ZOO CHANGE Induced from data by QUIN

EXPERIMENT WITH NOISY DATA All results as MSE (Mean Squared Error) 2.269 ; 1.889 0.112 ; 0.102 0.015 ; 0.008 ZooChange 20 % noise LWR; Q 2 5 % noise LWR; Q 2 no noise LWR; Q 2 Domain

APPLICATIONS OF Q 2 FROM REAL ECOLOGICAL DATA Growth of algae Lagoon of Venice Plankton in Lake Glumsoe

Lake Glumsø Location and properties: Lake Glumsø is located in a sub-glacial valley in Denmark Average depth 2 m Surface area 266000 m 2 Pollution Receives waste water from community with 3000 inhabitants (mainly agricultural) High nitrogen and phosphorus concentration in waste water caused hypereutrophication No submerged vegetation low transparency of water oxygen deficit at the bottom of the lake

Lake Glumsø – data Relevant variables for modelling are: phytoplankton phyto zooplankton zoo soluble nitrogen ns soluble phosphorus ps water temperature temp

PREDICTION ACCURACY Over all (40) experiments. Q 2 better than LWR in 75% (M5, 83%) of the test cases The differences were found significant (t-test) at 0.02 significance level

OTHER ECOLOGICAL MODELLING APPLICATIONS Predicting ozone concentrations in Ljubljana and Nova Gorica Predicting flooding of Savinja river Q2 model by far superior to any predictor so far used in practice

CASE STUDY INTEC’S CAR SIMULATION MODELS Goal: simplify INTEC’s car models to speed up simulation Context: Clockwork European project (engineering design)

Learning Manouvres Learning manouvres were very simple: Sinus bump Turning left Turning right Road excitation: Steering position

WHEEL MODEL : PREDICTING TOE ANGLE 

WHEEL MODEL : PREDICTING TOE ANGLE  Qualiative errors Q 2 predicted alpha Q 2

BEHAVIOURAL CLONING Given a skilled operator, reconstruct the human’s sub cognitive skill

EXAMPLE: GANTRY CRANE Control force Load Carriage

USE MACHINE LEARNING: BASIC IDEA Controller System Observe Execution trace Learning program Reconstructed controller (“clone”) Actions States

CRITERIA OF SUCCESS Induced controller description has to: Be comprehensible Work as a controller

WHY COMPREHENSIBILITY? To help the user’s intuition about the essential mechanism and causalities that enable the controller achieve the goal

SKILL RECONSTRUTION IN CRANE Control forces: F x , F L State: X, d X,  , d  , L, d L

CARRIAGE CONTROL QUIN: dX des = f(X,  , d  ) M - ( X ) M + (  ) X < 20.7 X < 60.1 M + ( X ) yes yes no no First the trolley velocity is increasing From about middle distance from the goal until the goal the trolley velocity is decreasing At the goal reduce the swing of the rope (by acceleration of the trolley when the rope angle increases)

CARRIAGE CONTROL: dX des = f(X,  , d  ) M - ( X ) M + (  ) X < 20.7 X < 60.1 X < 29.3 M + ( X ) d  < -0.02 M - ( X ) M -,+ ( X ,  ) M +,+,- ( X,  , d  ) yes yes yes yes no no no no Enables reconstruction of individual differences in control styles Operator S Operator L

CASE STUDY IN REVERSE ENGINEERING: ANTI-SWAY CRANE

ANTI-SWAY CRANE Industrial crane controller minimising load swing, “anti-sway crane” Developed by M. Valasek (Czech Technical University, CTU) Reverse engineering of anti-sway crane: a case study in the Clockwork European project

ANTI-SWAY CRANE OF CTU Crane parameters: travel distance 100m height 15m, width 30m 80-120 tons In daily use at Nova Hut metallurgical factory, Ostrava

EXPLAINING HOW CONTROLLER WORKS Load swinging to right; Accelerate cart to right to reduce swing

EMPIRICAL EVALUATION Compare errors of base-learners and corresponding Q 2 learners differences btw. a base-learner and a Q 2 learner are only due to the induced qualitative constraints Experiments with three base-learners: Locally Weighted Regression (LWR) Model trees Regression trees

Robot Arm Domain Y 1 Y 2 Two-link, two-joint robot arm Link 1 extendible: L 1  [2, 10] Y 1 = L 1 sin(  1 ) Y 2 = L 1 sin(  1 ) + 5 sin(  1 +  2 )  1  2 Four learning problems: A: Y 1 = f(L 1 ,  1 ) B: Y 2 = f(L 1 ,  1 ,  2 ,  sum , Y 1 ) C: Y 2 = f(L 1 ,  1 ,  2 ,  sum ) D: Y 2 = f(L 1 ,  1 ,  2 ) L 1 Derived attribute  sum =  1 +  2 Difficulty for Q 2

Robot Arm: LWR and Q 2 at different noise levels Q 2 outperforms LWR with all four learning problems (at all three noise levels) A 0, 5, 10% n. | B 0, 5, 10% n. | C 0, 5, 10% n. | D 0, 5, 10% n.

UCI and Dynamic Domains Five smallest regression data sets from UCI Dynamic domains: typical domains where QUIN was applied so far to explain the control skill or control the system until now was not possible to measure accuracy of the learned concepts (qualitative trees) AntiSway logged data from an anti-sway crane controller CraneSkill1, CraneSkill2: logged data of experienced human operators controlling a crane

UCI and Dynamic Domains: LWR compared to Q 2 Similar results with other two base-learners. Q 2 significantly better than base-learners in 18 out of 24 comparisons (24 = 8 datasets * 3 base-learners)

Q 2 - CONCLUSIONS A novel approach to numerical learning Can take into account qualitative prior knowledge Advantages: qualitative consistency of induced models and data – important for interpretation of induced models improved numerical accuracy of predictions

Q 2 TEAM + ACKNOWLEDGEMENTS Q 2 learning, QUIN, Qfilter , QCGRID (AI Lab, Ljubljana): Dorian Š uc Daniel Vladušič Car modelling data Wolfgan Rulka (INTEC, Munich) Zbinek Šika (Czech Technical Univ.) Population dynamics data Sašo Džeroski, Ljupčo Todorovski (J. Stefan Institute, Ljubljana) Lake Glumsoe Sven Joergensen Boris Kompare, Jure Žabkar, D. Vladušič

RELEVANT PAPERS Clark and Matwin 93: also used qualitative constraints in numerical predictions Š uc, Vladu š i č and Bratko; IJCAI’03 Š uc, Vladu š i č and Bratko; Artificial Intelligence Journal, 2004 Š uc and Bratko; ECML’03 Š uc and Bratko; IJCAI’05

Using Qualitative Knowledge in Numerical Learning

More Related Content

Similar to Using Qualitative Knowledge in Numerical Learning (20)

More from butest (20)

Using Qualitative Knowledge in Numerical Learning

Editor's Notes