SlideShare a Scribd company logo
Efficient Convolutional Neural
Network Architecture for Image
Classification
Yogendra Tamang
MSCS-070-670
Supervisor:
Prof. Dr. Sashidhar Ram Joshi
Presented By:
Outline
• Background
• Convolutional Neural Network
• Objectives
• Methodology
• Work Accomplished
• Work Remaining
• References
Background
• Learning
• Supervised
• Unsupervised
• AI Tasks
• Classification and Regression
• Clustering
Machine Learning Problem
Supervised
RegressionClassfication
Unsupervised
Clustering
Background
• Classification
• Classifies data into one of discrete classes
• Eg. Classifying digits
• Cost Function for Classification Task may be Logistic Regression or Log-
likelihood
• Regression
• Predicts continuous real valued output
• Eg. Stock Price Predictions
• Cost function for regression type problem are MSE(Mean Squared Error)
Multi Layerd Perceptrons (MLPs)
Input Layer Hidden Layer
Output Layer
Convolutional Neural Networks
• One or more convolutional layer
• Followed by one or more fully connected layer
• Resulting in easy to train networks with many fewer
parameters.
Objectives
• To classify images using CNN
• To design effective architecture of CNN for image classification task.
Convolutional Neural Networks
• Receptive fields(RFs)
• Apply filter to image.
• Pooling and
subsampling layers
Convolution Neural Network
Methodology
Training Set
Validation
Set
Testing Set
Methodology
• Convolution Layer Design
Methodology
• Pooling Layer Design
Methodology
Example CNN Architecture
Learning a Classifier
• Gradient Descent Algorithm
• Calculate Cost Function or Lost Function J(s)
• Calculate Gradient 𝜕𝐽(w)/𝜕w
• Update Weights
• Stochastic Gradient Descent: Updates Adjust after example.
• Minibatch SGD: Updates after batches.
Learning a Classifier- Negative Log likelihood
𝑁𝐿𝐿 𝜃, 𝒟 = −
𝑖=0
|𝒟|
log 𝑃(𝑌 = 𝑦(𝑖)|𝑥 𝑖 , 𝜃)
Where 𝒟 is Dataset
𝜃 is weight parameter
(𝑥 𝑖 , 𝑦 𝑖 ) is ith training data. Y is target data.
Work Accompolished
1. GPU Configuration to support CUDA.
2. CNN Architecture for CIFAR-10 dataset
3. CNN Architecture for MNIST-10 dataset
INPUT-> CONV ->MAXPOOL-> CONV -> MAXPOOL-> FULL ->
OUTPUT
MNIST Dataset Training and Output
Training Loss, Validation Loss, Validation
Accuracy on MNIST Dataset
0
0.2
0.4
0.6
0.8
1
1.2
1 2 3 4 5 6 7 8 9 10
TrainingLoss/Validation
Loss/ValidationAccuracy
Epochs
CNN running over mnist dataset
Training Loss
Validation loss
Validation accuracy
Work Remaining
• Dropout Implementation
• Parameter Changing
Time Schedule
References
[1] A. D. J. J. J. B. Eugenio Culurciello, “An Analysis of the Connections Between Layers of Deep Neural
Networks,” arXiv, 2013.
[2] B. K. A.-r. M. B. R. Tara N. Sainath, “Learning Filter Banks within a Deep Neural Network Framework,”
in IEEE, 2013.
[3] A.-r. M. G. H. Alex Graves, “Speech Recognition with Deep Recurrent Neural Networks,” University of
Toronto.
[4] A. Graves, “Generating Sequences with Recurrent Neural Networks,” arXiv, 2014.
[5] Q. V. Oriol Vinyals, “A Neural Conversational Model,” arXiv, 2015.
[6] J. D. T. D. J. M. Ross Grishick, “Rich Features Hierarchies for accurate object detection and semantic
segmentation.,” UC Berkeley.
[7] A. Karpathy, “CS231n Convolutional Neural Networks for Visual Recognition,” Stanford University,
[Online]. Available: http://cs231n.github.io/convolutional-networks/.
[8] I. Sutskever, “Training Recurrent Neural Networks,” University of Toronto, 2013.
[9] “Convolutional Neural Networks (LeNet),” [Online]. Available:
http://deeplearning.net/tutorial/lenet.html.
[10] I. S. E. H. Alex Krizhevsky, “ImageNet Classification with Deep Convolutional Neural Networks,” 2012.
References
[11] R. F. Matthew D Zeiler, “Visualizing and Understanding Convolutional Networks,” arXiv, 2013.
[12] A. K. a. L. Fie-Fie, “Deep Visual Alignment for Generating Image Descriptions,” Standford University,
2014.
[13] A. T. S. B. D. E. O. Vinyals, “Show and Tell: A Neural Image Caption Generator.,” Google Inc., 2014.
[14] J. M. G. H. IIya Sutskever, “Generating Text with Recurrent Neural Networks,” in 28th International
Conference on Machine Learning, Bellevue, 2011.
[15] M. A. Nielsen, “Neural Networks and Deep Learning,” Determination Press, 2014.
[16] J. Martens, “Deep Learning via Hessian-Free Optimization,” in Procedings of 27th International
Conference on Machine Learning, 2010.

More Related Content

PPTX
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
PDF
Scene classification using Convolutional Neural Networks - Jayani Withanawasam
PPTX
Image classification with Deep Neural Networks
PDF
Understanding Convolutional Neural Networks
PPTX
AlexNet
PPTX
Visualizaing and understanding convolutional networks
PPTX
Convolutional Neural Network for Alzheimer’s disease diagnosis with Neuroim...
PPTX
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Scene classification using Convolutional Neural Networks - Jayani Withanawasam
Image classification with Deep Neural Networks
Understanding Convolutional Neural Networks
AlexNet
Visualizaing and understanding convolutional networks
Convolutional Neural Network for Alzheimer’s disease diagnosis with Neuroim...
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)

What's hot (20)

PDF
Introduction to Convolutional Neural Networks
PDF
PDF
Modern Convolutional Neural Network techniques for image segmentation
PPTX
Introduction to CNN
PDF
AI&BigData Lab 2016. Александр Баев: Transfer learning - зачем, как и где.
PDF
Offline Character Recognition Using Monte Carlo Method and Neural Network
PDF
CNNs: from the Basics to Recent Advances
PPTX
Neuroevolution and deep learing
PDF
LeNet to ResNet
PPTX
Convolutional neural network
PDF
Deep learning
PPTX
convolutional neural network (CNN, or ConvNet)
PPTX
Convolutional neural network from VGG to DenseNet
PDF
Devil in the Details: Analysing the Performance of ConvNet Features
PDF
Visualizing and Understanding Convolutional Networks
PPTX
Convolutional neural networks deepa
PPTX
CNN Tutorial
PPTX
Deep learning for image super resolution
PDF
Neural Network as a function
PDF
Convolutional neural network
Introduction to Convolutional Neural Networks
Modern Convolutional Neural Network techniques for image segmentation
Introduction to CNN
AI&BigData Lab 2016. Александр Баев: Transfer learning - зачем, как и где.
Offline Character Recognition Using Monte Carlo Method and Neural Network
CNNs: from the Basics to Recent Advances
Neuroevolution and deep learing
LeNet to ResNet
Convolutional neural network
Deep learning
convolutional neural network (CNN, or ConvNet)
Convolutional neural network from VGG to DenseNet
Devil in the Details: Analysing the Performance of ConvNet Features
Visualizing and Understanding Convolutional Networks
Convolutional neural networks deepa
CNN Tutorial
Deep learning for image super resolution
Neural Network as a function
Convolutional neural network
Ad

Viewers also liked (20)

PPTX
Asp.net orientation
PPTX
Natural language processing
PDF
Deep Learning - Convolutional Neural Networks
PPT
Understanding Bluetooth Technology and How to use in with Mobile Devices
PPTX
Bluetooth and mobile ip
PDF
인크-프리아바워경리단길 크라우드펀딩 사업계획서 YINC-PrivaHour Business Plan for Equity Crowdfunding
PPT
终生保障收入
PPT
著作權法講義
PPTX
Seven Seas: Case Study
PDF
総合商社における戦略的思考
PPTX
Intrusion Detection with Neural Networks
PDF
Fragmented Times – Generational Strains
PPTX
Operations Management Study in Textured Jersy Lanka Limited
PPTX
Electronics projects
PPTX
ADO.NET Introduction
PPTX
Introduction and Starting ASP.NET MVC
PPTX
Azure machine learning tech mela
PPTX
Machine learning and azure ml studio
PPT
RC4&RC5
Asp.net orientation
Natural language processing
Deep Learning - Convolutional Neural Networks
Understanding Bluetooth Technology and How to use in with Mobile Devices
Bluetooth and mobile ip
인크-프리아바워경리단길 크라우드펀딩 사업계획서 YINC-PrivaHour Business Plan for Equity Crowdfunding
终生保障收入
著作權法講義
Seven Seas: Case Study
総合商社における戦略的思考
Intrusion Detection with Neural Networks
Fragmented Times – Generational Strains
Operations Management Study in Textured Jersy Lanka Limited
Electronics projects
ADO.NET Introduction
Introduction and Starting ASP.NET MVC
Azure machine learning tech mela
Machine learning and azure ml studio
RC4&RC5
Ad

Similar to Efficient Neural Network Architecture for Image Classfication (20)

PDF
imageclassification-160206090009.pdf
PPT
character_ANN.ppt
PPTX
Accurate and Efficient Secured Dynamic Multi-keyword Ranked Search
PPTX
CNN, Deep Learning ResNet_30_Slide_Presentation.pptx
PPTX
Weight Agnostic Neural Networks
PDF
Optically processed Kannada script realization with Siamese neural network model
PDF
Towards better analysis of deep convolutional neural networks
PDF
PhD Defense
PPTX
Dp2 ppt by_bikramjit_chowdhury_final
PPTX
Weave-D - 2nd Progress Evaluation Presentation
PDF
Reservoir computing fast deep learning for sequences
PPTX
Object detection with deep learning
PPTX
Deep Learning for Automatic Speaker Recognition
PDF
NLP and Deep Learning for non_experts
PPTX
Character Recognition using Data Mining Technique (Artificial Neural Network)
PPTX
Convolutional neural networks 이론과 응용
PPTX
Mx net image segmentation to predict and diagnose the cardiac diseases karp...
PPTX
Introduction to Deep Learning Technique.pptx
PDF
REVIEW ON OBJECT DETECTION WITH CNN
imageclassification-160206090009.pdf
character_ANN.ppt
Accurate and Efficient Secured Dynamic Multi-keyword Ranked Search
CNN, Deep Learning ResNet_30_Slide_Presentation.pptx
Weight Agnostic Neural Networks
Optically processed Kannada script realization with Siamese neural network model
Towards better analysis of deep convolutional neural networks
PhD Defense
Dp2 ppt by_bikramjit_chowdhury_final
Weave-D - 2nd Progress Evaluation Presentation
Reservoir computing fast deep learning for sequences
Object detection with deep learning
Deep Learning for Automatic Speaker Recognition
NLP and Deep Learning for non_experts
Character Recognition using Data Mining Technique (Artificial Neural Network)
Convolutional neural networks 이론과 응용
Mx net image segmentation to predict and diagnose the cardiac diseases karp...
Introduction to Deep Learning Technique.pptx
REVIEW ON OBJECT DETECTION WITH CNN

More from Yogendra Tamang (12)

PPTX
.Net framework
PPTX
Machine learning and azure ml studio gabc
PPTX
Infromation Reprentation, Structured Data and Semantics
PPTX
Task programming
PPTX
Arduino Workshop Day 2
PPTX
Arduino Day 1 Presentation
PPTX
Virtualization lab
PDF
Path finder
PPTX
Classification and Clustering
PPTX
Notation 3(n3)
PPTX
Cloud mobility final
PPTX
IP/Wi-Fi Based Robot
.Net framework
Machine learning and azure ml studio gabc
Infromation Reprentation, Structured Data and Semantics
Task programming
Arduino Workshop Day 2
Arduino Day 1 Presentation
Virtualization lab
Path finder
Classification and Clustering
Notation 3(n3)
Cloud mobility final
IP/Wi-Fi Based Robot

Recently uploaded (20)

PPTX
Institutional Correction lecture only . . .
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
Pre independence Education in Inndia.pdf
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
GDM (1) (1).pptx small presentation for students
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Sports Quiz easy sports quiz sports quiz
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
Insiders guide to clinical Medicine.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Pharma ospi slides which help in ospi learning
Institutional Correction lecture only . . .
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Pre independence Education in Inndia.pdf
2.FourierTransform-ShortQuestionswithAnswers.pdf
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
GDM (1) (1).pptx small presentation for students
TR - Agricultural Crops Production NC III.pdf
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Sports Quiz easy sports quiz sports quiz
102 student loan defaulters named and shamed – Is someone you know on the list?
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
human mycosis Human fungal infections are called human mycosis..pptx
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
VCE English Exam - Section C Student Revision Booklet
Insiders guide to clinical Medicine.pdf
Final Presentation General Medicine 03-08-2024.pptx
Pharma ospi slides which help in ospi learning

Efficient Neural Network Architecture for Image Classfication

  • 1. Efficient Convolutional Neural Network Architecture for Image Classification Yogendra Tamang MSCS-070-670 Supervisor: Prof. Dr. Sashidhar Ram Joshi Presented By:
  • 2. Outline • Background • Convolutional Neural Network • Objectives • Methodology • Work Accomplished • Work Remaining • References
  • 3. Background • Learning • Supervised • Unsupervised • AI Tasks • Classification and Regression • Clustering Machine Learning Problem Supervised RegressionClassfication Unsupervised Clustering
  • 4. Background • Classification • Classifies data into one of discrete classes • Eg. Classifying digits • Cost Function for Classification Task may be Logistic Regression or Log- likelihood • Regression • Predicts continuous real valued output • Eg. Stock Price Predictions • Cost function for regression type problem are MSE(Mean Squared Error)
  • 5. Multi Layerd Perceptrons (MLPs) Input Layer Hidden Layer Output Layer
  • 6. Convolutional Neural Networks • One or more convolutional layer • Followed by one or more fully connected layer • Resulting in easy to train networks with many fewer parameters.
  • 7. Objectives • To classify images using CNN • To design effective architecture of CNN for image classification task.
  • 8. Convolutional Neural Networks • Receptive fields(RFs) • Apply filter to image. • Pooling and subsampling layers
  • 13. Methodology Example CNN Architecture Learning a Classifier • Gradient Descent Algorithm • Calculate Cost Function or Lost Function J(s) • Calculate Gradient 𝜕𝐽(w)/𝜕w • Update Weights • Stochastic Gradient Descent: Updates Adjust after example. • Minibatch SGD: Updates after batches.
  • 14. Learning a Classifier- Negative Log likelihood 𝑁𝐿𝐿 𝜃, 𝒟 = − 𝑖=0 |𝒟| log 𝑃(𝑌 = 𝑦(𝑖)|𝑥 𝑖 , 𝜃) Where 𝒟 is Dataset 𝜃 is weight parameter (𝑥 𝑖 , 𝑦 𝑖 ) is ith training data. Y is target data.
  • 15. Work Accompolished 1. GPU Configuration to support CUDA.
  • 16. 2. CNN Architecture for CIFAR-10 dataset
  • 17. 3. CNN Architecture for MNIST-10 dataset INPUT-> CONV ->MAXPOOL-> CONV -> MAXPOOL-> FULL -> OUTPUT
  • 19. Training Loss, Validation Loss, Validation Accuracy on MNIST Dataset 0 0.2 0.4 0.6 0.8 1 1.2 1 2 3 4 5 6 7 8 9 10 TrainingLoss/Validation Loss/ValidationAccuracy Epochs CNN running over mnist dataset Training Loss Validation loss Validation accuracy
  • 20. Work Remaining • Dropout Implementation • Parameter Changing
  • 22. References [1] A. D. J. J. J. B. Eugenio Culurciello, “An Analysis of the Connections Between Layers of Deep Neural Networks,” arXiv, 2013. [2] B. K. A.-r. M. B. R. Tara N. Sainath, “Learning Filter Banks within a Deep Neural Network Framework,” in IEEE, 2013. [3] A.-r. M. G. H. Alex Graves, “Speech Recognition with Deep Recurrent Neural Networks,” University of Toronto. [4] A. Graves, “Generating Sequences with Recurrent Neural Networks,” arXiv, 2014. [5] Q. V. Oriol Vinyals, “A Neural Conversational Model,” arXiv, 2015. [6] J. D. T. D. J. M. Ross Grishick, “Rich Features Hierarchies for accurate object detection and semantic segmentation.,” UC Berkeley. [7] A. Karpathy, “CS231n Convolutional Neural Networks for Visual Recognition,” Stanford University, [Online]. Available: http://cs231n.github.io/convolutional-networks/. [8] I. Sutskever, “Training Recurrent Neural Networks,” University of Toronto, 2013. [9] “Convolutional Neural Networks (LeNet),” [Online]. Available: http://deeplearning.net/tutorial/lenet.html. [10] I. S. E. H. Alex Krizhevsky, “ImageNet Classification with Deep Convolutional Neural Networks,” 2012.
  • 23. References [11] R. F. Matthew D Zeiler, “Visualizing and Understanding Convolutional Networks,” arXiv, 2013. [12] A. K. a. L. Fie-Fie, “Deep Visual Alignment for Generating Image Descriptions,” Standford University, 2014. [13] A. T. S. B. D. E. O. Vinyals, “Show and Tell: A Neural Image Caption Generator.,” Google Inc., 2014. [14] J. M. G. H. IIya Sutskever, “Generating Text with Recurrent Neural Networks,” in 28th International Conference on Machine Learning, Bellevue, 2011. [15] M. A. Nielsen, “Neural Networks and Deep Learning,” Determination Press, 2014. [16] J. Martens, “Deep Learning via Hessian-Free Optimization,” in Procedings of 27th International Conference on Machine Learning, 2010.

Editor's Notes

  • #17: Different Layers in CNN are: Input Layer Convolution Layer Pooling Layer Dense Layer Output Layer Input layer is 32x32 images of 3 colour channel i.e 3x32x32. Convolution of layer consists of 20 5x5 patches with padding size 2 and stride equal to 1. Pooling layer after first convolution consists of max function with 2x2 patch size. Hence it will reduce the image to 20x16x16. Second convolution layer consists of same properties as that of first convolution layer and the image it produces will be of size 20x16x16 Max pooling layer after second convolution again reduces image to size 20x8x8. The fully connected layer after pooling consists of 1000 units. And it is connected to output layer with only 10 units, for each class.