[DL輪読会]GLIDE: Guided Language to Image Diffusion for Generation and Editing
1. DEEP LEARNING JP
[DL Papers] GLIDE: Guided Language to Image Diffusion
for Generation and Editing
Xin Zhang, Matsuo Lab
http://deeplearning.jp/
2. 書誌情報
● タイトル:
○ GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided
Diffusion Models(arxiv)
● 著者:Alex Nichol, Prafulla Dhariwal Aditya Ramesh et al. (OPENAI)
● 20 Dec 2021
● 概要
○ テキストからリアルな画像を生成するDiffusion Model
○ 2種類の条件付けの方法で、複数の工夫を取り入れた実装
○ 綺麗な画像の生成に成功し、小さめなモデルを公開した
2
20. Safety Considerations & Limitations
Released small model trained on a
smaller, filtered dataset.
Fail to capture certain prompts which
describe highly unusual objects or
scenarios.