More than 1 year has passed since last update.

うおぉぉぉ！いつの間にかiOS形式に変換できるようになっていた消しゴムモデルLaMaをiPhoneで使う

Posted at 2023-12-27

あの消しゴム機能がついにiOS形式に変換できるようになった

以前はiOS形式にできなかったLaMaインペインティングモデル。
フーリエ変換か何かが変換できなかった。
CoreMLToolsのアップデートで変換できるようになったらしい。
というのは、変換スクリプトを提供してくれている人をGitHubで見つけた。

ありがとうマルマンさん。

変換方法

git clone https://github.com/mallman/CoreMLaMa.git
cd CoreMLaMa
pip install -r requirements.txt
python convert_lama.py

これだけでLaMa.mlpackageがゲットできる。

サイズも２００MBぐらいで、実行時間も2秒ぐらい（iPhone13pro）。
最高か。

ちなみに、変換スクリプトでは、LaMaのラッパーモデルを作って、出力を画像のピクセル値に非正規化している。

class CoreMLaMa(torch.nn.Module):
    def __init__(self, lama):
        super(CoreMLaMa, self).__init__()
        self.lama = lama

    def forward(self, image, mask):
        normalized_mask = ((mask > 0) * 1).byte()
        lama_out = self.lama(image, normalized_mask)
        output = torch.clamp(lama_out * 255, min=0, max=255)
        return output

うむ、勉強になる。

iOSで実行

let config = MLModelConfiguration()
let model = try LaMa(configuration: config)
let input = try! LaMaInput(imageWith: inputCGImage, maskWith: maskCGImage)
let out = try! model.prediction(input: input!)
let pixelBuffer = out.output