More than 1 year has passed since last update.

DatabricksでOpenDalleV1.1を動かしてみる

Last updated at 2024-05-26Posted at 2024-01-05

画像系生成AIにも食指を。ライセンスはNon-Commercial Personal Use License Agreementなのでご注意ください。

GPUクラスターを準備します。

ライブラリのインストール。

%pip install -U diffusers transformers accelerate
dbutils.library.restartPython()

from diffusers import AutoPipelineForText2Image
import torch
        
pipeline = AutoPipelineForText2Image.from_pretrained('dataautogpt3/OpenDalleV1.1', torch_dtype=torch.float16).to('cuda')        
image = pipeline('black fluffy gorgeous dangerous cat animal creature, large orange eyes, big fluffy ears, piercing gaze, full moon, dark ambiance, best quality, extremely detailed').images[0]

image

おおー。

なお、画像はノートブックにレンダリングされるので確認が楽です。

別のプロンプトで。

image = pipeline('2 birds facing each other. The one is white crow. The another one is red love bird').images[0]
image

若干解釈が混じっているようですが、これはこれで面白い。

日本語プロンプトだとなかなか難しいです。

image = pipeline('賑わっている動物園').images[0]
image

賑わっている？動物園？

しかし、去年はMidJourneyなどを触って一喜一憂していた訳ですが、今では無料のモデルでこういうことができるようになったんだなとしみじみ感じる年始です。

Databricksクイックスタートガイド

Databricks無料トライアル

You get articles that match your needs
You can efficiently read back useful information
You can use dark theme

What you can do with signing up