SlideShare a Scribd company logo
3
Most read
6
Most read
Professor Juha Karhunen
Applications of
Machine Learning
Personnel of the AML group
Group Head Docents Postdocs Doctoral Students Students
Juha
Karhunen
Miki Sirola Francesco
Corona
Mark van
Heeswijk
Matthieu
Molinier
Alexander
Grigorievskiy
Luiza
Sayfullina
Zhao Chen
Scope and Goals
The AML group carries out both theoretical and experimental work
on developing and applying new machine learning techniques for
solving various application problems.
More specific research topics:
● Time series analysis and prediction;
● Dimensionality reduction;
● Extreme learning machines;
● Environmental applications;
● Industrial applications;
● Classification of web sites based on images;
● Detection of malicious Android software.
see also http://research.ics.aalto.fi/eiml/publications.shtml
Time series analysis and prediction
● Formerly the main research topic of the group.
● Often one predicts only one step ahead.
● We have studied prediction farther away.
● Linear methods for time series prediction and analysis are well-
known.
● We have used nonlinear neural network and machine learning
methods.
● Lots of possible applications in various areas of life.
Time series analysis and forecasting
● Compare and combine various time
series methods: neural networks,
Gaussian Processes, State-Space
models.
● Focus on accuracy, computational
speed and probabilistic forecasting.
● Address the problems of missing
observations and unevenly sampled
time series.
● Currently we use Astronomical and
Electricity Consumption data.
Dimensionality reduction - Variable selection
● The data are often too high-dimensional for methods used.
● The computation time can explode.
● This curse of dimensionality can be handled by data compression
or variable selection.
● In variable selection, one selects the most important variables for
the task at hand.
● The other variables are discarded.
● We have studied different methods for variable selection.
● And tested them with many real-world data sets.
Dimensionality reduction - 2D Linear projections
Supervised distance preserving projections, SDPP.
● Local pairwise-match of squared distances in
projection and response or label space
● Optimisation via QSDP/SDLP or CG
● Kernel-SDPP for nonlinear problems
Stochastic discriminant analysis, SDA.
● Pairwise-match of Student’s t probabilities in
projection and label space (KL divergence)
● Gradient-based optimisation and regularisation
Neural Networks - Extreme Learning Machines
● Efficient and effective neural networks based on random
nonlinear feature extraction, scalable to large data sets due to fast
training.
Neural Networks - Extreme Learning Machines
● Many improvements have been explored in our group:
○ hidden layer pruning
○ proper and fast regularization
○ improved accuracy through ensembles
○ GPU-acceleration and parallelization
○ sparse binary/ternary features
○ feature selection
○ compressive training algorithms
Environmental and industrial applications
● Dr. Francesco Corona leads this part.
● Multivariate predictive control of wastewater treatment plants
(EU project, DIAMOND).
● Monitoring nitrate concentration in wastewater treatment plants
(Viikinmäki, Helsinki).
● Property prediction of fuels in oil refineries (Sarroch, Italy).
● Equipment aging related noise measurement with TVO Olkiluoto
nuclear power plant (Miki Sirola and a student making his Diploma
thesis).
Classification of web sites
● Web sites have been tried
to classify thus far only
based on the text they
have.
● We are using images on web sites for their classification.
● Trying to separate benign (harmless) web sites from undesirable
ones.
● The classes of these undesirable web sites are for example crime,
porn, racism, war, etc.
CloSe Project: Android Malware Detection
● The research is done in collaboration with F-Secure corporation
● They provided us a huge dataset of 120K malicious and benign
files
● Main research goal is how to efficiently reduce the dimensionality
of high-dimensional sparse binary data set for minimizing the
desired cost function
● First publication on this topic: “Efficient detection of zero-day
Android Malware using Normalized Bernoulli Naive Bayes”
● Graduate student Luiza Sayfullina works in this project
● Her instructor is Dr. Emil Eirola, and advisor Prof. Alex Jung
Dealing with high-dimensional sparse data
● Major issues are how to deal with sparsity, how to make a concise
representation of the data, and what properties of the dataset will
affect the choice of the dimensionality reduction.
● Below you can see sparse bag of words model sample from our
malware dataset. In practice, random projections work well for
sparse data compression.
Teaching responsibilities
● Prof. Juha Karhunen lectures the course T-61.5130 Machine
Learning and Neural Networks (autumn).
● The course was renovated in autumn 2015.
● The assistant Dr. Mark van Heeswijk of this course comes from our
AML research group.
● Juha Karhunen is the supervising professor for several Master’s
(Diploma) thesis made outside our department every year.
● Alexander Grigorevskiy is a teaching assistant of T-61.3050
Machine Learning: Basic Principles. Previously he has been
teaching assistant of T-61.3040 Statistical Signal Modeling.
Homepage
For more info, see our homepage
http://research.ics.aalto.fi/eiml/

More Related Content

PPTX
Supervised learning and Unsupervised learning
PPTX
Overview of Artificial Intelligence in Cybersecurity
PDF
Machine Learning and Applications
PPTX
Application of machine learning in industrial applications
PPTX
Supervised and unsupervised learning
PDF
Back Propagation Neural Network In AI PowerPoint Presentation Slide Templates...
PDF
Applications in Machine Learning
PPTX
Introduction To Machine Learning
Supervised learning and Unsupervised learning
Overview of Artificial Intelligence in Cybersecurity
Machine Learning and Applications
Application of machine learning in industrial applications
Supervised and unsupervised learning
Back Propagation Neural Network In AI PowerPoint Presentation Slide Templates...
Applications in Machine Learning
Introduction To Machine Learning

What's hot (20)

PPTX
Presentation on unsupervised learning
PPTX
Machine Learning
PPTX
Orange Data Mining and Data Visualization Tool
PPTX
Data mining technique (decision tree)
PDF
ML DL AI DS BD - An Introduction
PDF
Bias and variance trade off
PPTX
Decision Trees
PPT
2.4 rule based classification
PPT
Clustering
PDF
Anomaly Detection in Seasonal Time Series
PDF
Hierarchical clustering
PPTX
Dynamic Programming
PPT
Machine Learning
PPTX
Cryptography
PPTX
PDF
Transfer Learning
PDF
Machine Learning and its Applications
PPT
Machine learning
PDF
Machine Learning
PPTX
Semi-Supervised Learning
Presentation on unsupervised learning
Machine Learning
Orange Data Mining and Data Visualization Tool
Data mining technique (decision tree)
ML DL AI DS BD - An Introduction
Bias and variance trade off
Decision Trees
2.4 rule based classification
Clustering
Anomaly Detection in Seasonal Time Series
Hierarchical clustering
Dynamic Programming
Machine Learning
Cryptography
Transfer Learning
Machine Learning and its Applications
Machine learning
Machine Learning
Semi-Supervised Learning
Ad

Similar to Applications of Machine Learning (20)

PPT
Unexpected Challenges in Large Scale Machine Learning by Charles Parker
PDF
Applications of machine learning
PDF
Machine learning pour les données massives algorithmes randomis´es, en ligne ...
PDF
A Compendium of Various Applications of Machine Learning
PDF
Introduction To Applied Machine Learning
PDF
Nonlinear image processing using artificial neural
PPT
Lec1-Into
PDF
Experts Vision- Portfolio Jan23
PDF
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
PPTX
Machine Learning Summary for Caltech2
PPTX
Lecture 5 ml
PPTX
And Then There Are Algorithms
PDF
Machine_Learning_Blocks___Bryan_Thesis
PDF
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
PDF
Machine learning and its parameter is discussed here
PPTX
rsec2a-2016-jheaton-morning
PDF
Time ser
PDF
Classifier Model using Artificial Neural Network
PPTX
machine learning in the age of big data: new approaches and business applicat...
PDF
Machine Learning: Past, Present and Future - by Tom Dietterich
Unexpected Challenges in Large Scale Machine Learning by Charles Parker
Applications of machine learning
Machine learning pour les données massives algorithmes randomis´es, en ligne ...
A Compendium of Various Applications of Machine Learning
Introduction To Applied Machine Learning
Nonlinear image processing using artificial neural
Lec1-Into
Experts Vision- Portfolio Jan23
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
Machine Learning Summary for Caltech2
Lecture 5 ml
And Then There Are Algorithms
Machine_Learning_Blocks___Bryan_Thesis
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Machine learning and its parameter is discussed here
rsec2a-2016-jheaton-morning
Time ser
Classifier Model using Artificial Neural Network
machine learning in the age of big data: new approaches and business applicat...
Machine Learning: Past, Present and Future - by Tom Dietterich
Ad

More from Department of Computer Science, Aalto University (14)

PDF
Data strategy aija leiponen_01112016
PDF
Tiedon jakaminen: Case Mobility as a Service MaaS
PDF
MaaS Global to revolutionize the global transportation market with Whim
PDF
Jakamo - Supply chain collaboration platform
PPTX
Fingrid ja yhteiskäyttöinen tieto
PPTX
Digital Data-Driven Healthcare and Wellbeing
PPTX
Probabilistic Machine Learning
PPTX
Distributed Systems, Mobile Computing and Security
PPTX
Kernel-based machine learning methods
Data strategy aija leiponen_01112016
Tiedon jakaminen: Case Mobility as a Service MaaS
MaaS Global to revolutionize the global transportation market with Whim
Jakamo - Supply chain collaboration platform
Fingrid ja yhteiskäyttöinen tieto
Digital Data-Driven Healthcare and Wellbeing
Probabilistic Machine Learning
Distributed Systems, Mobile Computing and Security
Kernel-based machine learning methods

Recently uploaded (20)

PPTX
1.pptx 2.pptx for biology endocrine system hum ppt
PDF
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
PPTX
famous lake in india and its disturibution and importance
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
neck nodes and dissection types and lymph nodes levels
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
Microbiology with diagram medical studies .pptx
PPTX
Production technology of seed spices,,,,
PPTX
Cell Membrane: Structure, Composition & Functions
PPT
protein biochemistry.ppt for university classes
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PPT
POSITIONING IN OPERATION THEATRE ROOM.ppt
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PPTX
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PDF
The scientific heritage No 166 (166) (2025)
1.pptx 2.pptx for biology endocrine system hum ppt
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
famous lake in india and its disturibution and importance
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
Biophysics 2.pdffffffffffffffffffffffffff
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
neck nodes and dissection types and lymph nodes levels
ECG_Course_Presentation د.محمد صقران ppt
Microbiology with diagram medical studies .pptx
Production technology of seed spices,,,,
Cell Membrane: Structure, Composition & Functions
protein biochemistry.ppt for university classes
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
POSITIONING IN OPERATION THEATRE ROOM.ppt
AlphaEarth Foundations and the Satellite Embedding dataset
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
Taita Taveta Laboratory Technician Workshop Presentation.pptx
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
The scientific heritage No 166 (166) (2025)

Applications of Machine Learning

  • 2. Personnel of the AML group Group Head Docents Postdocs Doctoral Students Students Juha Karhunen Miki Sirola Francesco Corona Mark van Heeswijk Matthieu Molinier Alexander Grigorievskiy Luiza Sayfullina Zhao Chen
  • 3. Scope and Goals The AML group carries out both theoretical and experimental work on developing and applying new machine learning techniques for solving various application problems. More specific research topics: ● Time series analysis and prediction; ● Dimensionality reduction; ● Extreme learning machines; ● Environmental applications; ● Industrial applications; ● Classification of web sites based on images; ● Detection of malicious Android software. see also http://research.ics.aalto.fi/eiml/publications.shtml
  • 4. Time series analysis and prediction ● Formerly the main research topic of the group. ● Often one predicts only one step ahead. ● We have studied prediction farther away. ● Linear methods for time series prediction and analysis are well- known. ● We have used nonlinear neural network and machine learning methods. ● Lots of possible applications in various areas of life.
  • 5. Time series analysis and forecasting ● Compare and combine various time series methods: neural networks, Gaussian Processes, State-Space models. ● Focus on accuracy, computational speed and probabilistic forecasting. ● Address the problems of missing observations and unevenly sampled time series. ● Currently we use Astronomical and Electricity Consumption data.
  • 6. Dimensionality reduction - Variable selection ● The data are often too high-dimensional for methods used. ● The computation time can explode. ● This curse of dimensionality can be handled by data compression or variable selection. ● In variable selection, one selects the most important variables for the task at hand. ● The other variables are discarded. ● We have studied different methods for variable selection. ● And tested them with many real-world data sets.
  • 7. Dimensionality reduction - 2D Linear projections Supervised distance preserving projections, SDPP. ● Local pairwise-match of squared distances in projection and response or label space ● Optimisation via QSDP/SDLP or CG ● Kernel-SDPP for nonlinear problems Stochastic discriminant analysis, SDA. ● Pairwise-match of Student’s t probabilities in projection and label space (KL divergence) ● Gradient-based optimisation and regularisation
  • 8. Neural Networks - Extreme Learning Machines ● Efficient and effective neural networks based on random nonlinear feature extraction, scalable to large data sets due to fast training.
  • 9. Neural Networks - Extreme Learning Machines ● Many improvements have been explored in our group: ○ hidden layer pruning ○ proper and fast regularization ○ improved accuracy through ensembles ○ GPU-acceleration and parallelization ○ sparse binary/ternary features ○ feature selection ○ compressive training algorithms
  • 10. Environmental and industrial applications ● Dr. Francesco Corona leads this part. ● Multivariate predictive control of wastewater treatment plants (EU project, DIAMOND). ● Monitoring nitrate concentration in wastewater treatment plants (Viikinmäki, Helsinki). ● Property prediction of fuels in oil refineries (Sarroch, Italy). ● Equipment aging related noise measurement with TVO Olkiluoto nuclear power plant (Miki Sirola and a student making his Diploma thesis).
  • 11. Classification of web sites ● Web sites have been tried to classify thus far only based on the text they have. ● We are using images on web sites for their classification. ● Trying to separate benign (harmless) web sites from undesirable ones. ● The classes of these undesirable web sites are for example crime, porn, racism, war, etc.
  • 12. CloSe Project: Android Malware Detection ● The research is done in collaboration with F-Secure corporation ● They provided us a huge dataset of 120K malicious and benign files ● Main research goal is how to efficiently reduce the dimensionality of high-dimensional sparse binary data set for minimizing the desired cost function ● First publication on this topic: “Efficient detection of zero-day Android Malware using Normalized Bernoulli Naive Bayes” ● Graduate student Luiza Sayfullina works in this project ● Her instructor is Dr. Emil Eirola, and advisor Prof. Alex Jung
  • 13. Dealing with high-dimensional sparse data ● Major issues are how to deal with sparsity, how to make a concise representation of the data, and what properties of the dataset will affect the choice of the dimensionality reduction. ● Below you can see sparse bag of words model sample from our malware dataset. In practice, random projections work well for sparse data compression.
  • 14. Teaching responsibilities ● Prof. Juha Karhunen lectures the course T-61.5130 Machine Learning and Neural Networks (autumn). ● The course was renovated in autumn 2015. ● The assistant Dr. Mark van Heeswijk of this course comes from our AML research group. ● Juha Karhunen is the supervising professor for several Master’s (Diploma) thesis made outside our department every year. ● Alexander Grigorevskiy is a teaching assistant of T-61.3050 Machine Learning: Basic Principles. Previously he has been teaching assistant of T-61.3040 Statistical Signal Modeling.
  • 15. Homepage For more info, see our homepage http://research.ics.aalto.fi/eiml/