More than 5 years have passed since last update.

Paper > link > Image Orientation Estimation with Convolutional Networks > 画像の傾き補正 / COCO dataset

Last updated at 2017-06-17Posted at 2017-02-24

Image Orientation Estimation with Convolutional Networks
Philipp Fischer et al.
http://lmb.informatik.uni-freiburg.de/Publications/2015/FDB15/image_orientation.pdf

画像の本来あるべき方向への補正をConvNet(CNN)を用いて学習するというものらしい。
realtimeでの補正に使えるとのこと。

学習データセット

学習に関しては3.1 Dataに記載がある。

Hence, we rather use the publicly available Microsoft COCO dataset [1] as training set and apply rotations artificially.

COCO dataset自体はノイズデータ（方向が不明なデータ）を除くと、本来あるべき方向の画像となっている。
それらの画像を意図的に回転させたものをinputに使って、outputに回転前画像を使うということのようだ。

回転の角度

3.2 Tasks and networksに記載がある。

±３０度、±４５度、３６０度回転などを考慮している。

networks

3.2 Tasks and networksに記載がある。

For all three tasks we built upon the AlexNet architecture from Krizhevsky et al. [8] implemented in Caffe [6] and pretrained on ImageNet. This architecture consists of 5 convolutional layers, followed by 3 fully connected layers. After each fully connected layer, a rectified linear unit is used as nonlinearity. Additionally normalization and dropout are applied. For more details see [8].

AlexNetの構造は以下にFigure　2として例示されている。
https://www.saagie.com/blog/object-detection-part1

You get articles that match your needs
You can efficiently read back useful information
You can use dark theme

What you can do with signing up