SlideShare a Scribd company logo
"Apprentissage non supervisé" de la
       théorie à la pratique


      Miguel Arturo Barreto Sánz
Outline
● Introduction

 The unsupervised learning


● The   Self-Organizing Map
 The biological inspiration
 The algorithm
 Characteristics
 Examples


● Practical   examples using MATLAB




                              1
Introduction
Unsupervised learning is a way to form “natural groupings”
or clusters of patterns.

Unsupervised learning seeks to determine how the data are
organized.

It is distinguished from supervised learning in that the
learner is given only unlabeled examples.
.
Among neural network models, the Self-Organizing Map
(SOM) are commonly used unsupervised learning
algorithms.

The SOM is a topographic organization in which nearby
locations in the map represent inputs with similar properties.


                          2
The Self-Organizing Map
               The biological inspiration

                    Sensory information is processed in the
                    neocortex by highly ordered neuronal
                    networks.

                    • Tangential to the cortical surface,
W. Penfield         representations of the sensory periphery
                    are organized into well-ordered maps.

                    • Taste maps in gustatory cortex (Accolla
                    et al., 2007)

                    • Somatotopic maps in primary
                    somatosensory cortex (Kaas, 1991).

                             3
The Self-Organizing Map
 The biological inspiration

                Other prominent cortical maps
                are the tonotopic organization
                of auditory cortex (Kalatsky et
                al., 2005),




                The most intensely studied
                example is the primary visual
                cortex, which is arranged with
                superimposed maps of
                retinotopy, ocular dominance
                and orientation (Bonhoeffer
                and Grinvald, 1991).
            4
The Self-Organizing Map
 The biological inspiration




                       Humunculus




            5
The Self-Organizing Map
         The biological inspiration




Somatosensory cortex dominated by the representation
of teeth in the naked mole-rat brain
Kenneth C. Catania, and Michael S. Remple.


                                        6
The Self-Organizing Map
      The biological inspiration




A remarkably high degree of organization is obvious in the
primary somatosensory cortex, in which a clear pattern of
cytoarchitectonic units termed ‘barrels’ are observed in
perfect match with the arrangement of the whiskers on the
snout of the mouse (Woolsey and Van der Loos, 1970)
                         7
The Self-Organizing Map
  The biological inspiration



Mapping functionally related sensory
information onto nearby cortical regions is
thought to minimize axonal wiring length and
simplify the synaptic circuits underlying
correlation-based associational plasticity.




                  8
The Self-Organizing Map

                In a topology-preserving map, units located physically
                next to each other will respond to classes of input vectors
                that are likewise next to each other.

                Although it is easy to visualize units next to each other in a
Teuvo Kohonen
                two-dimensional array, it is not so easy to determine
                which classes of vectors are next to each other in a high-
                dimensional space.

                Large-dimensional input vectors are, in a sense, projected
                down on the two dimensional map in a way that maintains
                the natural order of the input vectors.

                This dimensional reduction could allow us to visualize
                 easily important relationships among the data that
                 otherwise might go unnoticed.
                                     9
The Self-Organizing Map

A SOM is formed of neurons located on a
regular, usually 1- or 2-dimensional grid.

The neurons are connected to adjacent
neurons by a neighborhood relation
dictating the structure of the map.

In the 2-dimensional case the neurons of
the map can be arranged either on a
rectangular or a hexagonal lattice

 2                        2
     1                     1
                               0             Input   Input
         0



                                   10
The algorithm
The weights of the neurons
are initialized
                   t=0




                             2
The algorithm




Example
          2
The algorithm
The training utilizes               BMU
competitive learning.

The neuron with weight
vector most similar to the
input is called the best
matching unit (BMU).

The weights of the BMU
and neurons close to it in
the SOM lattice are
adjusted towards the
input vector.

The magnitude of the
change decreases with
time and with distance
from the BMU.
                             2
The algorithm




Next example


               2
The algorithm




     2
The algorithm




     2
The algorithm




     2
Characteristics




Inputs: State of health,   Quality of life word map
nutrition, educational
services etc.

                               2
Characteristics
    Input 3 Dimentions             Output 2 dimentions


      z



                x
                                                 x
y

                                                         y




                               2
Visualization




      2
2
Introduction




     2
Visualization




      2
Clusters of sites with similar
                   characteristics

  Soil     What crops or varieties are likely to perform well where and
           when.


Climate




Genotype




                     Homologues places for Colombian coffee production.
                     Brazil, Equator, East Africa, and New Guinea.
                                   14
                                   2
Clusters of sites with similar
           characteristics

For commercial (mass production) crops (rice, corn) it is known the
“when” and “where”

For native crops (guanabana, lulo) or special types of crops (coffee
varieties) it is not the case.

                     When and what I must cultivate ?
                     Market demand




                                               DAPA
                                               (Diversification
                                               Agriculture Project
                 The COCH project              Alliance)


                             16
                             2
1. Large database
                           The challenges
2. Multivariable problem
                                              1 point

                                                        1 Km



                                               1 Km


                                            1 336,025 points




                                 2
The challenges
                               Introduction
   1. Large datasets
   2. Multivariate problem
   Climate, management, variety, climate estimates, soil etc.

   Example. BIOCLIM is a bioclimatic prediction system which uses surrogate
   terms (bioclimatic parameters) derived from mean monthly climate
   estimates, to approximate energy and water balances at a given location

B1. Annual Mean Temperature                              B11. Mean Temperature of Coldest Quarter
B2. Mean Diurnal Range(Mean(period max-min))             B12. Annual Precipitation
B3. Isothermality (P2/P7)                                B13. Precipitation of Wettest Period
B4. Temperature Seasonality (Coefficient of Variation)   B14. Precipitation of Driest Period
B5. Max Temperature of Warmest Period                    B15. Precipitation Seasonality
B6. Min Temperature of Coldest Period                    (Coefficient of Variation)
B7. Temperature Annual Range (P5-P6)                     B16. Precipitation of Wettest Quarter
B8. Mean Temperature of Wettest Quarter                  B17. Precipitation of Driest Quarter
B9. Mean Temperature of Driest Quarter                   B18. Precipitation of Warmest Quarter
B10. Mean Temperature of Warmest Quarter                 B19. Precipitation of Coldest Quarter


                                               2
Clusters of sites with similar
                   characteristics
How to work ?
How to obtain Prototypes, Clustering and Visualization at the same
time ?

Approach
Unsupervised learning
Self-organizing maps

Two flavors of SOMs

Self-organizing maps                       Growing hierarchical map
Static map – Just one representation       Different representations to different levels




                                       2
Clusters of sites with similar
                        characteristics
Self-Organizing Map (SOM)



                                           The clusters found in the
                                           feature space in many
                                           cases are not the same as
                                           those found in geographic
                                           space.

                                           Represent clusters of a
                                           multidimensional space:
                                           map multidimensional data
                                           onto a two-dimensional
                                           lattice of cells.

                                         Similarity of sugarcane
                                         growing environmental
                                         conditions (1999-2005)
                                         using Self-organizing

                                2        maps
                                    29
Approaches
 GHSOM
             P




    2
P1. Annual Mean Temperature
                       P2. Mean Diurnal Range(Mean(period max-min))



            Introduction
                       P3. Isothermality (P2/P7)
                       P4. Temperature Seasonality (Coefficient of Variation)
                       P5. Max Temperature of Warmest Period
                       P6. Min Temperature of Coldest Period
                       P7. Temperature Annual Range (P5-P6)
                       P8. Mean Temperature of Wettest Quarter
                       P9. Mean Temperature of Driest Quarter
                       P10. Mean Temperature of Warmest Quarter
                       P11. Mean Temperature of Coldest Quarter
                       P12. Annual Precipitation
                       P13. Precipitation of Wettest Period
                       P14. Precipitation of Driest Period
                       P15. Precipitation Seasonality(Coefficient of Variation)
                       P16. Precipitation of Wettest Quarter
                       P17. Precipitation of Driest Quarter
                       P18. Precipitation of Warmest Quarter
                       P19. Precipitation of Coldest Quarter




GHSOM
Component
planes




                 2
Merci !

More Related Content

PPTX
Kohonen self organizing maps
PPTX
Self Organizing Maps
PDF
Lecture 9 Markov decision process
PDF
Brief Introduction to Boltzmann Machine
PDF
Introduction to MCMC methods
PPT
Variants of Turing Machine
PDF
Self-Organising Maps for Customer Segmentation using R - Shane Lynn - Dublin R
PDF
Ridge regression, lasso and elastic net
Kohonen self organizing maps
Self Organizing Maps
Lecture 9 Markov decision process
Brief Introduction to Boltzmann Machine
Introduction to MCMC methods
Variants of Turing Machine
Self-Organising Maps for Customer Segmentation using R - Shane Lynn - Dublin R
Ridge regression, lasso and elastic net

What's hot (20)

PDF
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
PPTX
PRML Chapter 6
PPTX
DBSCAN (2014_11_25 06_21_12 UTC)
PDF
[DL輪読会]Adversarial Learning for Zero-shot Domain Adaptation
PDF
Mean shift and Hierarchical clustering
PPTX
PDF
Introduction to Bayesian Methods
PPTX
Self-organizing map
PPT
3.1 clustering
PDF
DMTM Lecture 13 Representative based clustering
PDF
Support vector-machines-presentation
PPTX
移動ロボットのナビゲーション
PDF
Density Based Clustering
PPTX
Resampling methods
PPT
K means Clustering Algorithm
PDF
異常検知と変化検知 9章 部分空間法による変化点検知
PDF
Artificial Neural Networks Lect1: Introduction & neural computation
PPTX
Branch and bound method
PPTX
Lect5 principal component analysis
PPT
My8clst
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
PRML Chapter 6
DBSCAN (2014_11_25 06_21_12 UTC)
[DL輪読会]Adversarial Learning for Zero-shot Domain Adaptation
Mean shift and Hierarchical clustering
Introduction to Bayesian Methods
Self-organizing map
3.1 clustering
DMTM Lecture 13 Representative based clustering
Support vector-machines-presentation
移動ロボットのナビゲーション
Density Based Clustering
Resampling methods
K means Clustering Algorithm
異常検知と変化検知 9章 部分空間法による変化点検知
Artificial Neural Networks Lect1: Introduction & neural computation
Branch and bound method
Lect5 principal component analysis
My8clst
Ad

Viewers also liked (20)

PPT
Sefl Organizing Map
PDF
Neural Networks: Self-Organizing Maps (SOM)
PPT
Neural networks Self Organizing Map by Engr. Edgar Carrillo II
PDF
Customer Clustering For Retail Marketing
PPTX
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
PDF
Artificial neural networks
PPT
Cluster analysis for market segmentation
PDF
MATLAB IMPLEMENTATION OF SELF-ORGANIZING MAPS FOR CLUSTERING OF REMOTE SENSIN...
PPTX
Application of Clustering in Data Science using Real-life Examples
PDF
Cluster Analysis for Dummies
PPTX
The Transformation of Systems Biology Into A Large Data Science
PPTX
Intrusion Detection Model using Self Organizing Maps.
PDF
Neural network-toolbox
DOC
Intro matlab-nn
PPT
learning Vector Quantization LVQ2 LVQ3
PDF
project_report
PPTX
Matlab Neural Network Toolbox
PDF
Timo Honkela: Kohonen's Self-Organizing Maps for Intelligent Systems Developm...
PDF
Timo Honkela: Turning quantity into quality and making concepts visible using...
PDF
Kohonen SOM dan Learning Vector Quantization (LVQ)
Sefl Organizing Map
Neural Networks: Self-Organizing Maps (SOM)
Neural networks Self Organizing Map by Engr. Edgar Carrillo II
Customer Clustering For Retail Marketing
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
Artificial neural networks
Cluster analysis for market segmentation
MATLAB IMPLEMENTATION OF SELF-ORGANIZING MAPS FOR CLUSTERING OF REMOTE SENSIN...
Application of Clustering in Data Science using Real-life Examples
Cluster Analysis for Dummies
The Transformation of Systems Biology Into A Large Data Science
Intrusion Detection Model using Self Organizing Maps.
Neural network-toolbox
Intro matlab-nn
learning Vector Quantization LVQ2 LVQ3
project_report
Matlab Neural Network Toolbox
Timo Honkela: Kohonen's Self-Organizing Maps for Intelligent Systems Developm...
Timo Honkela: Turning quantity into quality and making concepts visible using...
Kohonen SOM dan Learning Vector Quantization (LVQ)
Ad

Similar to Self-organizing maps - Tutorial (20)

PPT
Bio inspired computational techniques applied to the analysis and visualizati...
DOC
Data Mining Assignment Sample Online - PDF
PDF
Implementing kohonen's som with missing data in OTB
PDF
Migue final presentation_v28
PDF
Machine_Learning_Project_Report
PPT
Improving the correlation hunting in a large quantity of SOM component planes
PDF
10.1.1.29.5089
PDF
Geographic Information: Aspects of Phenomenology and Cognition
PDF
Intrusion Detection System Using Self Organizing Map Algorithms
PDF
Image Morphing: A Literature Study
PDF
Intrusion Detection System Using Self Organizing Map Algorithms
PDF
Budthimedhee - Input2012
PDF
Van hulle springer:som
PDF
Algortimos bio-inspirados para clustering y visualizacion de datos geoespaciales
PPTX
Self-organzing maps in Earth Observation Data Cube Analysis
KEY
20090521 Dv Brief
PDF
Map as a Service: A Framework for Visualising and Maximising Information Retu...
 
PPT
Research Data Sharing LERU
PDF
Cm31588593
Bio inspired computational techniques applied to the analysis and visualizati...
Data Mining Assignment Sample Online - PDF
Implementing kohonen's som with missing data in OTB
Migue final presentation_v28
Machine_Learning_Project_Report
Improving the correlation hunting in a large quantity of SOM component planes
10.1.1.29.5089
Geographic Information: Aspects of Phenomenology and Cognition
Intrusion Detection System Using Self Organizing Map Algorithms
Image Morphing: A Literature Study
Intrusion Detection System Using Self Organizing Map Algorithms
Budthimedhee - Input2012
Van hulle springer:som
Algortimos bio-inspirados para clustering y visualizacion de datos geoespaciales
Self-organzing maps in Earth Observation Data Cube Analysis
20090521 Dv Brief
Map as a Service: A Framework for Visualising and Maximising Information Retu...
 
Research Data Sharing LERU
Cm31588593

More from askroll (7)

PDF
Microsoft power point curso-2006_sesion2_kohonen
PPT
Fuzzy Growing Hierarchical Self-organizing Networks
PPT
Classification of similar productivity zones in the sugar cane culture using ...
PPT
Mapas de Kohonen como una herramienta visual de apoyo al soporte de decisione...
PPT
Curso 2006 Sesion 1 Kohonen
PPT
The COCH project
PDF
Bio-inspired techniques and their application to precision agriculture (Andre...
Microsoft power point curso-2006_sesion2_kohonen
Fuzzy Growing Hierarchical Self-organizing Networks
Classification of similar productivity zones in the sugar cane culture using ...
Mapas de Kohonen como una herramienta visual de apoyo al soporte de decisione...
Curso 2006 Sesion 1 Kohonen
The COCH project
Bio-inspired techniques and their application to precision agriculture (Andre...

Recently uploaded (20)

PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPT
Teaching material agriculture food technology
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Spectroscopy.pptx food analysis technology
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Electronic commerce courselecture one. Pdf
PPTX
Cloud computing and distributed systems.
PDF
Approach and Philosophy of On baking technology
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
Digital-Transformation-Roadmap-for-Companies.pptx
The AUB Centre for AI in Media Proposal.docx
“AI and Expert System Decision Support & Business Intelligence Systems”
Teaching material agriculture food technology
Building Integrated photovoltaic BIPV_UPV.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Network Security Unit 5.pdf for BCA BBA.
Machine learning based COVID-19 study performance prediction
Spectroscopy.pptx food analysis technology
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Electronic commerce courselecture one. Pdf
Cloud computing and distributed systems.
Approach and Philosophy of On baking technology
Review of recent advances in non-invasive hemoglobin estimation
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Encapsulation_ Review paper, used for researhc scholars
Diabetes mellitus diagnosis method based random forest with bat algorithm
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Dropbox Q2 2025 Financial Results & Investor Presentation

Self-organizing maps - Tutorial

  • 1. "Apprentissage non supervisé" de la théorie à la pratique Miguel Arturo Barreto Sánz
  • 2. Outline ● Introduction The unsupervised learning ● The Self-Organizing Map The biological inspiration The algorithm Characteristics Examples ● Practical examples using MATLAB 1
  • 3. Introduction Unsupervised learning is a way to form “natural groupings” or clusters of patterns. Unsupervised learning seeks to determine how the data are organized. It is distinguished from supervised learning in that the learner is given only unlabeled examples. . Among neural network models, the Self-Organizing Map (SOM) are commonly used unsupervised learning algorithms. The SOM is a topographic organization in which nearby locations in the map represent inputs with similar properties. 2
  • 4. The Self-Organizing Map The biological inspiration Sensory information is processed in the neocortex by highly ordered neuronal networks. • Tangential to the cortical surface, W. Penfield representations of the sensory periphery are organized into well-ordered maps. • Taste maps in gustatory cortex (Accolla et al., 2007) • Somatotopic maps in primary somatosensory cortex (Kaas, 1991). 3
  • 5. The Self-Organizing Map The biological inspiration Other prominent cortical maps are the tonotopic organization of auditory cortex (Kalatsky et al., 2005), The most intensely studied example is the primary visual cortex, which is arranged with superimposed maps of retinotopy, ocular dominance and orientation (Bonhoeffer and Grinvald, 1991). 4
  • 6. The Self-Organizing Map The biological inspiration Humunculus 5
  • 7. The Self-Organizing Map The biological inspiration Somatosensory cortex dominated by the representation of teeth in the naked mole-rat brain Kenneth C. Catania, and Michael S. Remple. 6
  • 8. The Self-Organizing Map The biological inspiration A remarkably high degree of organization is obvious in the primary somatosensory cortex, in which a clear pattern of cytoarchitectonic units termed ‘barrels’ are observed in perfect match with the arrangement of the whiskers on the snout of the mouse (Woolsey and Van der Loos, 1970) 7
  • 9. The Self-Organizing Map The biological inspiration Mapping functionally related sensory information onto nearby cortical regions is thought to minimize axonal wiring length and simplify the synaptic circuits underlying correlation-based associational plasticity. 8
  • 10. The Self-Organizing Map In a topology-preserving map, units located physically next to each other will respond to classes of input vectors that are likewise next to each other. Although it is easy to visualize units next to each other in a Teuvo Kohonen two-dimensional array, it is not so easy to determine which classes of vectors are next to each other in a high- dimensional space. Large-dimensional input vectors are, in a sense, projected down on the two dimensional map in a way that maintains the natural order of the input vectors. This dimensional reduction could allow us to visualize easily important relationships among the data that otherwise might go unnoticed. 9
  • 11. The Self-Organizing Map A SOM is formed of neurons located on a regular, usually 1- or 2-dimensional grid. The neurons are connected to adjacent neurons by a neighborhood relation dictating the structure of the map. In the 2-dimensional case the neurons of the map can be arranged either on a rectangular or a hexagonal lattice 2 2 1 1 0 Input Input 0 10
  • 12. The algorithm The weights of the neurons are initialized t=0 2
  • 14. The algorithm The training utilizes BMU competitive learning. The neuron with weight vector most similar to the input is called the best matching unit (BMU). The weights of the BMU and neurons close to it in the SOM lattice are adjusted towards the input vector. The magnitude of the change decreases with time and with distance from the BMU. 2
  • 19. Characteristics Inputs: State of health, Quality of life word map nutrition, educational services etc. 2
  • 20. Characteristics Input 3 Dimentions Output 2 dimentions z x x y y 2
  • 22. 2
  • 25. Clusters of sites with similar characteristics Soil What crops or varieties are likely to perform well where and when. Climate Genotype Homologues places for Colombian coffee production. Brazil, Equator, East Africa, and New Guinea. 14 2
  • 26. Clusters of sites with similar characteristics For commercial (mass production) crops (rice, corn) it is known the “when” and “where” For native crops (guanabana, lulo) or special types of crops (coffee varieties) it is not the case. When and what I must cultivate ? Market demand DAPA (Diversification Agriculture Project The COCH project Alliance) 16 2
  • 27. 1. Large database The challenges 2. Multivariable problem 1 point 1 Km 1 Km 1 336,025 points 2
  • 28. The challenges Introduction 1. Large datasets 2. Multivariate problem Climate, management, variety, climate estimates, soil etc. Example. BIOCLIM is a bioclimatic prediction system which uses surrogate terms (bioclimatic parameters) derived from mean monthly climate estimates, to approximate energy and water balances at a given location B1. Annual Mean Temperature B11. Mean Temperature of Coldest Quarter B2. Mean Diurnal Range(Mean(period max-min)) B12. Annual Precipitation B3. Isothermality (P2/P7) B13. Precipitation of Wettest Period B4. Temperature Seasonality (Coefficient of Variation) B14. Precipitation of Driest Period B5. Max Temperature of Warmest Period B15. Precipitation Seasonality B6. Min Temperature of Coldest Period (Coefficient of Variation) B7. Temperature Annual Range (P5-P6) B16. Precipitation of Wettest Quarter B8. Mean Temperature of Wettest Quarter B17. Precipitation of Driest Quarter B9. Mean Temperature of Driest Quarter B18. Precipitation of Warmest Quarter B10. Mean Temperature of Warmest Quarter B19. Precipitation of Coldest Quarter 2
  • 29. Clusters of sites with similar characteristics How to work ? How to obtain Prototypes, Clustering and Visualization at the same time ? Approach Unsupervised learning Self-organizing maps Two flavors of SOMs Self-organizing maps Growing hierarchical map Static map – Just one representation Different representations to different levels 2
  • 30. Clusters of sites with similar characteristics Self-Organizing Map (SOM) The clusters found in the feature space in many cases are not the same as those found in geographic space. Represent clusters of a multidimensional space: map multidimensional data onto a two-dimensional lattice of cells. Similarity of sugarcane growing environmental conditions (1999-2005) using Self-organizing 2 maps 29
  • 32. P1. Annual Mean Temperature P2. Mean Diurnal Range(Mean(period max-min)) Introduction P3. Isothermality (P2/P7) P4. Temperature Seasonality (Coefficient of Variation) P5. Max Temperature of Warmest Period P6. Min Temperature of Coldest Period P7. Temperature Annual Range (P5-P6) P8. Mean Temperature of Wettest Quarter P9. Mean Temperature of Driest Quarter P10. Mean Temperature of Warmest Quarter P11. Mean Temperature of Coldest Quarter P12. Annual Precipitation P13. Precipitation of Wettest Period P14. Precipitation of Driest Period P15. Precipitation Seasonality(Coefficient of Variation) P16. Precipitation of Wettest Quarter P17. Precipitation of Driest Quarter P18. Precipitation of Warmest Quarter P19. Precipitation of Coldest Quarter GHSOM Component planes 2