SlideShare a Scribd company logo
HANDWRITTEN
DIGIT
RECOGNITION
(A Convolutional Neural Network Approach)
Rishabh Tyagi
(Maharaja Agrasen
Institute of Technology)
MAIN GOAL & APPLICATIONS
• Handwritten Digit Recognition is used to
recognize the Digits which are written by
hand.
• A handwritten digit recognition system is used
to visualize artificial neural networks.
• It is already widely used in the automatic
processing of bank cheques, postal addresses,
in mobile phones etc
•Scientists believe that the most intelligent
device is the Human Brain.
•There is no computer which can beat the level
of efficiency of human brain. These Inefficiencies
of the computer has lead to evolution of
“Artificial Neural Network”.
•They differ from conventional systems in the
sense that rather than being programmed these
system learn to recognize pattern.
Introduction
What are Neural Networks?
• Artificial neural networks, usually called neural networks
(NNs), are interconnected systems composed of many simple
processing elements (neurons) operating in parallel whose
function is determined by-
1) Network Structure
2) Connection Strengths
3) The Processing performed
at Computing elements or
nodes.
A neural cell in the brain
Training Dataset
• Training of the network is done by a dataset
named MNIST dataset.
• MNIST dataset has a training set of 60,000
examples, and a test set of 10,000 examples.
• All the images in the dataset are of 28x28
pixels.
•It is a good database for people who want to try learning
techniques and pattern recognition methods on real-world
data while spending minimal efforts on preprocessing and
formatting.
Why Convolutions?
Convolution is a simple mathematical operation
between two matrices in which one is multiplied to
the other element wise and sum of all these
multiplications is calculated.
Convolutions are performed for various reasons-
• Convolutions provide better feature extraction
• They save a lot of computation compared to ANNs.
• Less number of parameters are created than those in
pure fully connected layers.
• Due to less number of required parameters,
lesser fully connected layers are needed.
Architecture of a Convolutional Neural Network
Images are taken using webcam
• To take images from webcam, opencv
functions have been used
Pre-Processing of images
Pre-processing of images is done using a python library called Opencv.
It has certain functions which can be implemented to make necessary
changes in the image before passing them to network.
• Gaussian blur
– Gaussian blur is a function for smoothening an image.
• Adaptive-Threshold
– In Adaptive-Threshold, the algorithm calculate the threshold for a small
regions of the image. So we get different thresholds for different regions of
the same image and it gives us better results for images with varying
illumination.
• Dilation
– Dilation is done to make the digits bigger.
– Dilation is very useful in cases where digits have holes as noises in them
• Erosion
– Erosion is done to make the digits smaller or thinner
– This reduces the noise as thin noises get vanished after erosion.
Image after
Gaussian Blur
Image after AdaptiveThreshold Image after Dilate and Erode
Segmentation
• Segmentation of the image is done by the
concept contours in Opencv
• Contours
– Contours can be explained as simply curve joining
all the continuous points, having same color or
intensity
– The contours are a useful tool for shape analysis
and object detection and recognition.
Image after contour
extraction
Convolutional Neural Network
Architecture
This model’s architecture consists of three main parts, two convolutional
blocks and one fully connected neural network layer.
The inputs to this model are 28x28 images.
First Convolutional Block:
A 28x28 image is taken as input to this block. A padding of 2 units is added to
the image so as to retain its dimensions after a convolution operation on the
image by 16 5x5 filters/kernels.
The output of the convolution gives 16x28x28 volume, which is then input to
a ReLU activation function followed by a MaxPool operation. ReLU activation
is used to introduce some non-linearity.
This block outputs a 16x14x14 volume.
Second Convolutional Block
First step is again a convolution operation on 16x14x14 by 32
5x5kernels with padding of 2 units, obtaining a 32x14x14
volume.
It is passed through a ReLU activation followed by a MaxPool
operation.
Second convolutional block outputs a 32x7x7 volume.
Fully connected Neural Layer:
Here, a singe hidden layer of 10 nodes is taken as the fully
connected layer.
Finally, the output of the fully connected layer is passed to a
softmax function to obtain the output result of recognition.
Conclusion
• The handwritten digit recognition using
convolutional neural network has proved to
be of a fairly good efficiency.
• It works better than any other algorithm,
including artificial neural networks.
THANK YOU
Rishabh Tyagi
(Maharaja Agrasen
Institute of Technology)

More Related Content

PDF
Handwritten Digit Recognition Using CNN
PPTX
Digit recognition
PDF
Handwritten Digit Recognition using Convolutional Neural Networks
PPTX
BATCH.pptx
PPTX
A Neural Network that Understands Handwriting
DOCX
Digit recognition using mnist database
PPTX
HANDWRITTEN DIGIT RECOGNITIONppt1.pptx
PPTX
Handwritten digit and symbol recognition using CNN.pptx
Handwritten Digit Recognition Using CNN
Digit recognition
Handwritten Digit Recognition using Convolutional Neural Networks
BATCH.pptx
A Neural Network that Understands Handwriting
Digit recognition using mnist database
HANDWRITTEN DIGIT RECOGNITIONppt1.pptx
Handwritten digit and symbol recognition using CNN.pptx

Similar to interface and user experience. Responsive Design: Ensure the app is user-friendly across different device (20)

DOCX
Assignment-1-NF.docx
PDF
Devanagari Digit and Character Recognition Using Convolutional Neural Network
PPTX
694893918-ppt-on-handwritten-digit-recognition.pptx
PPTX
DigitRecognition.pptx
PPTX
Mnist soln
PPTX
GUI based handwritten digit recognition using CNN
PPTX
Handwritten Digit Recognition and performance of various modelsation[autosaved]
PDF
IRJET - Study on the Effects of Increase in the Depth of the Feature Extracto...
PDF
Handwritten digit recognition using quantum convolution neural network
PPTX
Digit recognizer
PPTX
Introduction to computer vision with Convoluted Neural Networks
PPT
lec6a.ppt
PPTX
Build a simple image recognition system with tensor flow
PPTX
Introduction to computer vision
PPTX
Digit recognizer by convolutional neural network
PPTX
Mnist report ppt
PDF
Hand Written Digit Classification
PDF
Mnist report
PPTX
Digit recognition using neural network
PDF
HANDWRITTEN DIGIT RECOGNITION
Assignment-1-NF.docx
Devanagari Digit and Character Recognition Using Convolutional Neural Network
694893918-ppt-on-handwritten-digit-recognition.pptx
DigitRecognition.pptx
Mnist soln
GUI based handwritten digit recognition using CNN
Handwritten Digit Recognition and performance of various modelsation[autosaved]
IRJET - Study on the Effects of Increase in the Depth of the Feature Extracto...
Handwritten digit recognition using quantum convolution neural network
Digit recognizer
Introduction to computer vision with Convoluted Neural Networks
lec6a.ppt
Build a simple image recognition system with tensor flow
Introduction to computer vision
Digit recognizer by convolutional neural network
Mnist report ppt
Hand Written Digit Classification
Mnist report
Digit recognition using neural network
HANDWRITTEN DIGIT RECOGNITION
Ad

Recently uploaded (20)

PDF
SIMNET Inc – 2023’s Most Trusted IT Services & Solution Provider
PDF
A Brief Introduction About Julia Allison
PDF
Business model innovation report 2022.pdf
PDF
Outsourced Audit & Assurance in USA Why Globus Finanza is Your Trusted Choice
PDF
20250805_A. Stotz All Weather Strategy - Performance review July 2025.pdf
PPTX
New Microsoft PowerPoint Presentation - Copy.pptx
DOCX
Business Management - unit 1 and 2
PPTX
Dragon_Fruit_Cultivation_in Nepal ppt.pptx
DOCX
unit 2 cost accounting- Tender and Quotation & Reconciliation Statement
PDF
Nidhal Samdaie CV - International Business Consultant
PPTX
HR Introduction Slide (1).pptx on hr intro
PDF
DOC-20250806-WA0002._20250806_112011_0000.pdf
PPTX
AI-assistance in Knowledge Collection and Curation supporting Safe and Sustai...
PDF
Types of control:Qualitative vs Quantitative
PPTX
Lecture (1)-Introduction.pptx business communication
PDF
Solara Labs: Empowering Health through Innovative Nutraceutical Solutions
PDF
kom-180-proposal-for-a-directive-amending-directive-2014-45-eu-and-directive-...
PPTX
The Marketing Journey - Tracey Phillips - Marketing Matters 7-2025.pptx
PDF
COST SHEET- Tender and Quotation unit 2.pdf
PPTX
Belch_12e_PPT_Ch18_Accessible_university.pptx
SIMNET Inc – 2023’s Most Trusted IT Services & Solution Provider
A Brief Introduction About Julia Allison
Business model innovation report 2022.pdf
Outsourced Audit & Assurance in USA Why Globus Finanza is Your Trusted Choice
20250805_A. Stotz All Weather Strategy - Performance review July 2025.pdf
New Microsoft PowerPoint Presentation - Copy.pptx
Business Management - unit 1 and 2
Dragon_Fruit_Cultivation_in Nepal ppt.pptx
unit 2 cost accounting- Tender and Quotation & Reconciliation Statement
Nidhal Samdaie CV - International Business Consultant
HR Introduction Slide (1).pptx on hr intro
DOC-20250806-WA0002._20250806_112011_0000.pdf
AI-assistance in Knowledge Collection and Curation supporting Safe and Sustai...
Types of control:Qualitative vs Quantitative
Lecture (1)-Introduction.pptx business communication
Solara Labs: Empowering Health through Innovative Nutraceutical Solutions
kom-180-proposal-for-a-directive-amending-directive-2014-45-eu-and-directive-...
The Marketing Journey - Tracey Phillips - Marketing Matters 7-2025.pptx
COST SHEET- Tender and Quotation unit 2.pdf
Belch_12e_PPT_Ch18_Accessible_university.pptx
Ad

interface and user experience. Responsive Design: Ensure the app is user-friendly across different device

  • 1. HANDWRITTEN DIGIT RECOGNITION (A Convolutional Neural Network Approach) Rishabh Tyagi (Maharaja Agrasen Institute of Technology)
  • 2. MAIN GOAL & APPLICATIONS • Handwritten Digit Recognition is used to recognize the Digits which are written by hand. • A handwritten digit recognition system is used to visualize artificial neural networks. • It is already widely used in the automatic processing of bank cheques, postal addresses, in mobile phones etc
  • 3. •Scientists believe that the most intelligent device is the Human Brain. •There is no computer which can beat the level of efficiency of human brain. These Inefficiencies of the computer has lead to evolution of “Artificial Neural Network”. •They differ from conventional systems in the sense that rather than being programmed these system learn to recognize pattern. Introduction
  • 4. What are Neural Networks? • Artificial neural networks, usually called neural networks (NNs), are interconnected systems composed of many simple processing elements (neurons) operating in parallel whose function is determined by- 1) Network Structure 2) Connection Strengths 3) The Processing performed at Computing elements or nodes.
  • 5. A neural cell in the brain
  • 6. Training Dataset • Training of the network is done by a dataset named MNIST dataset. • MNIST dataset has a training set of 60,000 examples, and a test set of 10,000 examples. • All the images in the dataset are of 28x28 pixels.
  • 7. •It is a good database for people who want to try learning techniques and pattern recognition methods on real-world data while spending minimal efforts on preprocessing and formatting.
  • 8. Why Convolutions? Convolution is a simple mathematical operation between two matrices in which one is multiplied to the other element wise and sum of all these multiplications is calculated. Convolutions are performed for various reasons- • Convolutions provide better feature extraction • They save a lot of computation compared to ANNs. • Less number of parameters are created than those in pure fully connected layers. • Due to less number of required parameters, lesser fully connected layers are needed.
  • 9. Architecture of a Convolutional Neural Network
  • 10. Images are taken using webcam • To take images from webcam, opencv functions have been used
  • 11. Pre-Processing of images Pre-processing of images is done using a python library called Opencv. It has certain functions which can be implemented to make necessary changes in the image before passing them to network. • Gaussian blur – Gaussian blur is a function for smoothening an image. • Adaptive-Threshold – In Adaptive-Threshold, the algorithm calculate the threshold for a small regions of the image. So we get different thresholds for different regions of the same image and it gives us better results for images with varying illumination. • Dilation – Dilation is done to make the digits bigger. – Dilation is very useful in cases where digits have holes as noises in them • Erosion – Erosion is done to make the digits smaller or thinner – This reduces the noise as thin noises get vanished after erosion.
  • 13. Image after AdaptiveThreshold Image after Dilate and Erode
  • 14. Segmentation • Segmentation of the image is done by the concept contours in Opencv • Contours – Contours can be explained as simply curve joining all the continuous points, having same color or intensity – The contours are a useful tool for shape analysis and object detection and recognition.
  • 16. Convolutional Neural Network Architecture This model’s architecture consists of three main parts, two convolutional blocks and one fully connected neural network layer. The inputs to this model are 28x28 images. First Convolutional Block: A 28x28 image is taken as input to this block. A padding of 2 units is added to the image so as to retain its dimensions after a convolution operation on the image by 16 5x5 filters/kernels. The output of the convolution gives 16x28x28 volume, which is then input to a ReLU activation function followed by a MaxPool operation. ReLU activation is used to introduce some non-linearity. This block outputs a 16x14x14 volume.
  • 17. Second Convolutional Block First step is again a convolution operation on 16x14x14 by 32 5x5kernels with padding of 2 units, obtaining a 32x14x14 volume. It is passed through a ReLU activation followed by a MaxPool operation. Second convolutional block outputs a 32x7x7 volume. Fully connected Neural Layer: Here, a singe hidden layer of 10 nodes is taken as the fully connected layer. Finally, the output of the fully connected layer is passed to a softmax function to obtain the output result of recognition.
  • 18. Conclusion • The handwritten digit recognition using convolutional neural network has proved to be of a fairly good efficiency. • It works better than any other algorithm, including artificial neural networks.
  • 19. THANK YOU Rishabh Tyagi (Maharaja Agrasen Institute of Technology)