Image recognition applications and dataset preparation - DevFest Baghdad 2018

Image Recognition
Applications & Dataset Preparation

Did you see this before ?
•Cover violence or
nudity images
facebook app
AppChief

•Suggests sharing
photos with
recognized facebook
friends
Moments app
AppChief

•Search for photo
content in iOS and
Android
Photos
AppChief

•Cutting out a person
from an image
Sticky app

What is image recognition ?
How to build my own app ?
Why I need it ?
Is the ability of software to identify objects, places, people, writing and
actions in images.
labeling the content of images with meta-tags, performing image content
search and guiding autonomous robots, self-driving cars and accident
avoidance systems…etc
Next slides

How to build my own app ?
TRAIN THE MODEL
PREPARE DATASET
BUILD AND RUN
…
…

What do you need ?
PREPARE DATASET
Classify image ? Detect multiple objects
inside image ?
or

Image recognition types
&
IMAGE
CLASSIFICATION
OBJECT
DETECTION
PREPARE DATASET

vs
OBJECT
DETECTION
input
output
Image Image
Class labelsClass label
OBJECT
DETECTION
INSTANCE
SEGMENTATION
types
+
bounding box
+
bounding boxes
+
segmentation
PREPARE DATASET
-
IMAGE
CLASSIFICATION
CLASSIFICATION
CLASSIFICATION
+ LOCALIZATION

vs
IMAGE
CLASSIFICATION
OBJECT
DETECTION
CLASSIFICATION
CLASSIFICATION
+ LOCALIZATION
OBJECT
DETECTION
INSTANCE
SEGMENTATION
types
PREPARE DATASET
Example

vs
IMAGE
CLASSIFICATION
OBJECT
DETECTION
CLASSIFICATION
CLASSIFICATION
+ LOCALIZATION
OBJECT
DETECTION
INSTANCE
SEGMENTATION
types
PREPARE DATASET
WE
WILL
CONTINUE
WITH
OBJECT
DETECTION

Let’s create a money reader model
STEP 1. Naming objects (Object labels)
PREPARE DATASET / OBJECT DETECTION
1 IQD_50000_ar
3 IQD_25000_ar
5 IQD_10000_ar
7 IQD_5000_ar
9 IQD_1000_ar
11 IQD_500_ar
13 IQD_250_ar
2 IQD_50000_en
4 IQD_25000_en
6 IQD_10000_en
8 IQD_5000_en
10 IQD_1000_en
12 IQD_500_en
14 IQD_250_en

STEP 2. Take photos for each money face AS MUCH AS YO CAN
For better accuracy take hundreds of photos with
• Different backgrounds

• Different positions

• Different light conditions

• Different orientations

• Different backgrounds

• Different positions

• Different light conditions

• Different orientations

2. Take about 300 photos for each money face

STEP 3. Labeling objects inside images
Label : IQD_50000_en
x : 6
y : 120
width : 150
heigh : 370
Objects :
Object #1

STEP 3. Labeling objects inside images
Label : IQD_50000_en
x : 90
y : 125
width : 313
heigh : 313
Objects :
Object #1

CAUTION : Some training libraries prefers diﬀerent coordinates system in labeling
X, Y, Width , Height midX, midY, Width , Height
minX, minY, maxX , maxY
It’s recommended to check the library needs you want to use for training before start labeling

How many photos do you think we need
for each label ?
10 ?
20 ?
50 ?
100 ?
200 ?
300 ?

Assuming 300 photos
is good for our model
let’s calculate time required
300 image x 14 Label x (5 sec) taking photo x (30 sec) labeling
175 hours !!! 7 Days !

We made a timesaving app
Only 1 hour

1 2 3Create labels Capture Generate

3 Transfer
Easy Dataset

Demo time
Easy Dataset

Money Reader - ‫العملة‬ ‫قارئ‬
Final live app
Final dataset
kaggle.com/husamaamer/iraqi-currency-
~1 GB
Iraqi Money ‫العراقية‬ ‫العملة‬
Object detection dataset for Iraqi currency

MODEL TRAINING
import turicreate as tc
import os
# Define all images annotations with bounding box details (I am showing only 1)
annotations = tc.SArray([
[{
“label”:”5000ar",
“type":"rectangle",
“coordinates”:{“y":188.5,"x":207,"width":304,"height":152}
}],
… , … …
])

MODEL TRAINING
import os
[{
}],
… , … …
])
# 1. Load images (Note: you can ignore 'Not a JPEG file' errors)
data = tc.image_analysis.load_images('mr_turi_ic', with_path=True)
data['label'] = data['path'].apply(lambda path: os.path.basename(os.path.dirname(path)))
data['annotations'] = tc.SArray(data=annotations, dtype=list)

MODEL TRAINING
import os
[{
}],
… , … …
])
# Make a train-test split
train_data, test_data = data.random_split(0.8)

MODEL TRAINING
import os
[{
}],
… , … …
])
# Create a model using Turi Create's object detector API
model = tc.object_detector.create(train_data, max_iterations=1000)
# Save the predictions to an SArray
predictions = model.predict(test_data)
# Evaluate the model and save the results into a dictionary
metrics = model.evaluate(test_data)
print('Precision' , metrics['mean_average_precision'])

MODEL TRAINING
import os
[{
}],
… , … …
])
# Create a model using Turi Create's object detector API
model = tc.object_detector.create(train_data, max_iterations=1000)
# Save the predictions to an SArray
predictions = model.predict(test_data)
# Evaluate the model and save the results into a dictionary
metrics = model.evaluate(test_data)
print('Precision' , metrics['mean_average_precision'])
# Save the model for later use in Turi Create
model.save(‘turi_ic.model')
# Export for use in Core ML file to the current directory
model.export_coreml('turi_ic.mlmodel')

Image recognition applications and dataset preparation - DevFest Baghdad 2018

More Related Content

What's hot (11)

Similar to Image recognition applications and dataset preparation - DevFest Baghdad 2018 (20)

Recently uploaded (20)

Image recognition applications and dataset preparation - DevFest Baghdad 2018