2024 dlib/face_recogonitionを始める時に知っておきたいこと

Posted at 2024-09-30

結論・コアの68点顔認識は商用利用できない

リリース直前に、調べて気づいて悲しい今日このごろです。

The license for this dataset excludes commercial use and Stefanos Zafeiriou, one of the creators of the dataset, asked me to include a note here saying that the trained model therefore can't be used in a commercial product. So you should contact a lawyer or talk to Imperial College London to find out if it's OK for you to use this model in a commercial product.

代案

どれも大変です。点を打って、hyperparameterをサーチするだけでいい。dlibが一番楽に思えてきます。

Mediapipe - https://huggingface.co/spaces/hysts/mediapipe-face-detection
OpenVINO - https://github.com/openvinotoolkit/openvino
Transformer DETR - https://huggingface.co/docs/transformers/main/ja/model_doc/detr
HRNet - https://github.com/HRNet/HRNet-Facial-Landmark-Detection
dlibで自分で作る

dlibについて

大雑把に３種類あります。

　まず、初期段階で、顔の範囲を取れないのは、デフォルトのHog顔 Detectionの問題で、Precitionをいくら訓練しても無駄です。顔の範囲取得に、dlib.simple_object_detectorを訓練するか、CNN版を使いましょう。

bool値を返す - recognition - 顔が存在するか判定
Rectを返す - detection - 顔の範囲を返す
Pointを返す - prediction - 顔のパーツの位置を返す

hyper-parameterです。

datasetの数と質で、最適な設定は異なります。
面白いことに、複雑にすればいいというわけじゃないです。

　一つ言えるのは、ランダムで多少変わる。split_testを1000ほどすると、時間かかるけど、かなり減らせる。

　一番いいのは、data数を増やす気がします。

You get articles that match your needs
You can efficiently read back useful information
You can use dark theme

What you can do with signing up