More than 3 years have passed since last update.

【3D Segmetation】PointNet : Point CloudでのSegmentaionを感覚的に理解してみる

Last updated at 2020-12-29Posted at 2020-10-09

PointNetってなんぞや？

![image.png](https://qiita-image-store.s3.ap-northeast-1.amazonaws.com/0/482094/439df62a-5a04-7299-ac19-97e1723c2fd2.png)

Point Cloudの3次元情報からClassification、Part Segmentation、Semantic Segmentation出来るPointNet。
2DのSemantic Segmentationをある程度知っていると理解しやすいかも。

アルゴリズム

![image.png](https://qiita-image-store.s3.ap-northeast-1.amazonaws.com/0/482094/3fd8cd12-9cf5-78e6-6cf8-0f24c0701466.png)

input transform

T-Netで3x3のRotation Matrixを推定し、掛ける事でPoint Cloud全体を回転させることが出来る。入力のPoint Cloudが回転しててもT-NetでRotation Matrixを求める事で、物体をいつも同じ姿勢(正面を向けるなど)に変換出来る。そうやって正規化するのね。

Feature transform

input transformの特徴量版。

max pool

point全体から特徴量の一番高い値だけを抽出する。

classificatiopn

図のglobal featureをMLPに入力すれば、入力Point Cloudがなんの物体かClassification出来るのね。

segmentation

global featureとpoint cloud1つ1つのlocal featureを結語してMLPで学習させれば、3DのSemantic segmentationが出来る。

結論

・適当な自分はFully Connectedを重ねれば良いんじゃねっと思っていたのだが、そんな単純ではないのね。・point cloudの順番によらなく、GlobalとLocalの情報を得られるようにする事がミソっぽい

参考文献

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation https://arxiv.org/pdf/1612.00593.pdf

You get articles that match your needs
You can efficiently read back useful information
You can use dark theme

What you can do with signing up