SlideShare a Scribd company logo
Neural Network Architectures
Martin Ockajak from Zürich
Software Engineer
Outline
●
Introduction
●
Feed-forward networks
●
Convolutional networks
●
Recurrent networks
●
Learning more
Introduction
Overview
●
Class of machine learning models
●
Inspired by brain biology
●
Connectionist AI approach
●
Highly parallel computation
●
Various learning types
●
Supervised
●
Reinforcement
●
Unsupervised
Applications
●
Character recognition
●
Medical diagnostics
●
Speech recognition
●
Machine translation
●
Text generation
●
Stock price prediction
●
Optimization problems
Advantages
●
Prediction accuracy
●
Complex non-linear relationships
●
Non-constantly variable data – heteroskedasticity
●
Hard to understand problems
●
Many possible architectures
Disadvantages
●
Large amount of training data
●
Long time to train
●
Computationally expensive
●
Hard to interpret - black box
●
Many possible architectures
Feed-forward networks
Perceptron
●
Simplified model of a neuron (1957)
●
Linear binary classifier
●
Multiple numeric inputs
●
One boolean output
●
Linearly separable classes only
Perceptron
0.5 1 1.5 2
-0.5
-1
-1.5
-2
0.5
1
t
f(t)
Perceptron
●
Inputs
●
Weights
●
Bias
●
w0
●
Sum
●
Activation function
●
Unit step
Multi-layer perceptron
Multi-layer perceptron
●
Nonlinear classification or regression
●
Inputs
●
Features
●
Hidden layers
●
Parallel neurons feeding the next layer
●
Dot product
●
Sigmoid activation function
●
Output layer
●
Arbitrary activation function
Training
●
Calculate the output
●
Apply differentiable loss function
●
Must be differentiable
●
Should be minimized – optimization problem
●
Gradient descent to update the weights
●
Proportional to the learning rate
●
Stochastic approximations
Training
●
Backpropagation (1974)
●
Derivative of the loss with regard to the weights
●
Apply to previous layers by using the chain rule
●
Regularization
●
Reduce overfitting
●
L1 or L2 norm
●
Dropout – ignore random neurons during training
Convolutional networks
Convolutional networks
●
Image classification (1998)
●
Image analysis
●
Object detection
●
Recommender systems
●
Text classification
●
Spatial patterns
Convolutional networks
Convolutional networks
●
Convolutional layer
●
Filter that scans the image – convolution matrix
●
Receptive field – filter size
●
Depth – number of filters
●
Space invariant
●
Pooling layer
●
Combine cluster of neurons into one
●
Non-linear down-sampling
Convolutional networks
●
Fully connected layer
●
Dense
●
Just like in multi-layer perceptron
●
Activation function
●
Rectifier – linear but remove negative values
●
Trains faster and reduces the vanishing gradient problem
●
Output activation function
●
Softmax - single-class
●
Sigmoid - multi-class
Convolutional networks
Recurrent networks
Recurrent networks
●
Sequence prediction (1986)
●
Natural language processing
●
Speech recognition
●
Machine translation
●
Generative models
●
Temporal patterns
Recurrent networks
Recurrent networks
●
Multi-layer perceptron with back-connections
●
Topology is a directed graph
●
Internal state – memory
●
Variable length sequence with dependencies within
●
Training
●
Backpropagation through time
●
Vanishing gradient problem reduction via gated state
●
Long short-term memory (1997)
●
Gated recurrent unit (2014)
Long short-term memory
Gated recurrent unit
Learning more
Materials
●
Deep Learning @ MIT Press
●
Neural Networks and Deep Learning @ Michael Nielsen
●
Practical Deep Learning @ Coursera
●
Deep Learning Specialization @ Coursera
●
Deep Learning Courses @ edX
Libraries
●
Keras
●
Tensorflow
●
MXNet
●
Theano
●
CNTK
●
PyTorch
●
Deeplearning4j
Thank you :-)

More Related Content

PDF
PyTorch 2 Internals
PPTX
Convolution Neural Network (CNN)
PPTX
Wheatstone bridge
PDF
Unlocking the Power of Generative AI An Executive's Guide.pdf
PPT
CAN (Controller Area Network) Bus Protocol
PDF
End to end Machine Learning using Kubeflow - Build, Train, Deploy and Manage
PPTX
A* algorithm
PDF
Lti system
PyTorch 2 Internals
Convolution Neural Network (CNN)
Wheatstone bridge
Unlocking the Power of Generative AI An Executive's Guide.pdf
CAN (Controller Area Network) Bus Protocol
End to end Machine Learning using Kubeflow - Build, Train, Deploy and Manage
A* algorithm
Lti system

What's hot (20)

PPTX
Single Layer Rosenblatt Perceptron
PDF
If then rule in fuzzy logic and fuzzy implications
PDF
Training Neural Networks
PDF
An introduction to Deep Learning
PPS
Neural Networks
PPTX
Perceptron & Neural Networks
PDF
Artificial Neural Network
PDF
Deep Feed Forward Neural Networks and Regularization
PPT
Fuzzy logic control
PPTX
Reasoning in AI
PDF
L7 fuzzy relations
PPT
Artificial Neural Networks - ANN
PPTX
Statistical learning
PDF
Introduction to AI & ML
PPSX
Perceptron (neural network)
PDF
Introduction to Recurrent Neural Network
PDF
Machine Learning: Introduction to Neural Networks
PPTX
Introduction Of Artificial neural network
PDF
Naive Bayes
Single Layer Rosenblatt Perceptron
If then rule in fuzzy logic and fuzzy implications
Training Neural Networks
An introduction to Deep Learning
Neural Networks
Perceptron & Neural Networks
Artificial Neural Network
Deep Feed Forward Neural Networks and Regularization
Fuzzy logic control
Reasoning in AI
L7 fuzzy relations
Artificial Neural Networks - ANN
Statistical learning
Introduction to AI & ML
Perceptron (neural network)
Introduction to Recurrent Neural Network
Machine Learning: Introduction to Neural Networks
Introduction Of Artificial neural network
Naive Bayes
Ad

Similar to Neural Network Architectures (20)

PPTX
Jyduydufyuyf8yfiyfiyfifiyfiyviyviyfiugiuy8f7dd64d4yrsxyfhgdhfjhvjhv
PDF
From neural networks to deep learning
PDF
Introduction to deeplearning engineering
PDF
Deep learning introduction basic information
PPTX
Introduction to artificial neural network.pptx
PDF
Recurrent Neural Networks, LSTM and GRU
PPTX
Productionizing dl from the ground up
PDF
Taskerman - a distributed cluster task manager
PPTX
Deep Learning Tutorial
PDF
Intro to TensorFlow and PyTorch Workshop at Tubular Labs
PPTX
Deep learning for smart manufacturing
PPTX
Introduction to deep learning
PDF
Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)
PDF
Convolutional and Recurrent Neural Networks
PDF
Automatic Machine Learning, AutoML
PDF
An End to Order
PDF
Netflix machine learning
PDF
Cassandra - A Decentralized Structured Storage System
PDF
State of the art time-series analysis with deep learning by Javier Ordóñez at...
PDF
An End to Order (many cores with java, session two)
Jyduydufyuyf8yfiyfiyfifiyfiyviyviyfiugiuy8f7dd64d4yrsxyfhgdhfjhvjhv
From neural networks to deep learning
Introduction to deeplearning engineering
Deep learning introduction basic information
Introduction to artificial neural network.pptx
Recurrent Neural Networks, LSTM and GRU
Productionizing dl from the ground up
Taskerman - a distributed cluster task manager
Deep Learning Tutorial
Intro to TensorFlow and PyTorch Workshop at Tubular Labs
Deep learning for smart manufacturing
Introduction to deep learning
Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)
Convolutional and Recurrent Neural Networks
Automatic Machine Learning, AutoML
An End to Order
Netflix machine learning
Cassandra - A Decentralized Structured Storage System
State of the art time-series analysis with deep learning by Javier Ordóñez at...
An End to Order (many cores with java, session two)
Ad

Recently uploaded (20)

PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PDF
medical staffing services at VALiNTRY
PDF
Understanding Forklifts - TECH EHS Solution
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PDF
System and Network Administration Chapter 2
PPTX
Odoo POS Development Services by CandidRoot Solutions
PPTX
L1 - Introduction to python Backend.pptx
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PPTX
history of c programming in notes for students .pptx
PDF
Designing Intelligence for the Shop Floor.pdf
PDF
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
Digital Strategies for Manufacturing Companies
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PPTX
Computer Software and OS of computer science of grade 11.pptx
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
medical staffing services at VALiNTRY
Understanding Forklifts - TECH EHS Solution
Odoo Companies in India – Driving Business Transformation.pdf
Upgrade and Innovation Strategies for SAP ERP Customers
System and Network Administration Chapter 2
Odoo POS Development Services by CandidRoot Solutions
L1 - Introduction to python Backend.pptx
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
history of c programming in notes for students .pptx
Designing Intelligence for the Shop Floor.pdf
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
Operating system designcfffgfgggggggvggggggggg
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
Wondershare Filmora 15 Crack With Activation Key [2025
Digital Strategies for Manufacturing Companies
wealthsignaloriginal-com-DS-text-... (1).pdf
Computer Software and OS of computer science of grade 11.pptx

Neural Network Architectures