Webカメラ画像のリアルタイム識別 / Web Cam Image Recognition with Inception V3

Webカメラ画像のリアルタイム識別
Inception v3を動かす
開発倶楽部
Hideki Tsujimoto (moto2g)

内容
 Webカメラから取り込んだ画像をInception v3で識別する。
 学習済みモデルを使うので、高性能GPUが無くても大丈夫。

サンプルあるある
 Pythonやライブラリのバージョンアップにより、この内容のままでは動
かないかも…。今回も参考にしたサンプルを修正する必要があった。

Inception v3とは
 Googleが開発した画像識別の学習済みモデル
 ものすごい量の画像を学習しており、識別精度が非常に高い

実行したときの環境
 Windows 10 Home
 Webカメラ (Logicool C270m)
 NVIDIA GTX1080Ti
 Python 3.6.5
 Tensorflow-GPU 1.11.0
 Keras 2.2.4
 Open CV 3.4.3.18 … Webカメラ画像の処理用

環境構築 CUDA(GPUを使う場合だけ)
 GPUのドライバ、CUDA、cuDNNをインストールしておく。
※ Tensorflowが対応しているCUDA、cuDNNのバージョンを調べて
おくこと。
Software requirements
https://www.tensorflow.org/install/gpu

環境構築 Pythonライブラリ
> pip install tensorflow-gpu ※GPUを使う場合はこっち
> pip install tensorflow ※GPUを使わない場合はこっち
> pip install keras
> pip install opencv-python
> pip install pillow ※Python 3.x系ではPILの代わりにPillowを使う

Inception v3のサンプルソース取得
以下をクローン
https://github.com/fchollet/deep-learning-models
※"inception_v3.py"さえあればOK

動作確認用に識別する画像を配置
 識別したい画像を"inception_v3.py"と同じフォルダに
“elephant.jpg”という名前で配置する。
 象じゃなくても何の画像でも良い。
サンプルプログラムが“elephant.jpg”を読み込むようになっているの
で、ひとまずそのままにしているだけ。

サンプルプログラムの修正
下記の環境ではプログラム修正が必要だった。
Keras==2.2.4
Keras-Applications==1.0.6

inception_v3.py 35行目あたり
from keras_applications.imagenet_utils import decode_predictions
※こっちは変えない
from keras_applications.imagenet_utils import _obtain_input_shape

inception_v3.py 157行目あたり
# Determine proper input shape
input_shape = _obtain_input_shape(
input_shape,
default_size=299,
min_size=139,
data_format=K.image_data_format(),
require_flatten=include_top)

Webカメラ無しで動作確認
> python inception_v3.py
下記のように識別した結果が表示される。
Predicted: [[('n02504458', 'African_elephant', 0.58458555),
('n01871265', 'tusker', 0.35385665), ('n02504013',
'Indian_elephant', 0.01978532), ('n02134084', 'ice_bear',
0.0005529468), ('n02437312', 'Arabian_camel', 0.0005336157)]]

以降、"inception_v3.py"を"inception_v3_with_cam.py"とい
う名前でコピーしてそちらを修正する。

学習済みモデルをローカルに保存
 元のサンプルプログラムはモデルを毎回ダウンロードしている。
予めダウンロードしたファイルを読み込むようにプログラムを修正する。
 “inception_v3_with_cam.py”と同じ場所に”models”フォルダ
を作成し、下記からダウンロードしたものをその中に配置する。
https://github.com/fchollet/deep-learning-
models/releases/download/v0.5/inception_v3_weights_tf_dim_ordering_tf_kernels.h5
https://github.com/fchollet/deep-learning-
models/releases/download/v0.5/inception_v3_weights_tf_dim_ordering_tf_kernels_notop.h5
https://s3.amazonaws.com/deep-learning-models/image-models/imagenet_class_index.json

学習済みモデルをローカルに保存
 kerasのget_fileにmd5_hashを渡すと、モデルがキャッシュされて
いてもダウンロードしてチェックする。md5_hashを渡さなくする。
inception_v3_with_cam.py 378行目あたり、384行目あたり
if include_top:
weights_path = get_file(
'inception_v3_weights_tf_dim_ordering_tf_kernels.h5',
WEIGHTS_PATH,
cache_subdir='models',
md5_hash=None)
else:
weights_path = get_file(
'inception_v3_weights_tf_dim_ordering_tf_kernels_notop.h5',
WEIGHTS_PATH_NO_TOP,
cache_subdir='models',
md5_hash=None)

Webカメラの画像を読み込むように修正
 import cv2
 inception_v3.pyのmainをガッツリ変える。
スライドでは書ききれないので以下の内容を参照
https://github.com/moto2g/image_recognition

Webカメラ対応版の動作確認
> python inception_v3_with_cam.py
カメラ画像上に、最も可能性の高い識別結果とパーセンテージを表示する。
カメラ画像のウインドウをアクティブにして"q"キーで終了。

Webカメラ画像のリアルタイム識別 / Web Cam Image Recognition with Inception V3

More Related Content

What's hot (20)

Similar to Webカメラ画像のリアルタイム識別 / Web Cam Image Recognition with Inception V3 (20)

Webカメラ画像のリアルタイム識別 / Web Cam Image Recognition with Inception V3