Models
| Creator | Model | Description | Access |
|---|---|---|---|
| OpenAI |
DALL.E V2
|
Hierarchical Text-Conditional Image Generation with CLIP Latents (Paper, Blog Post). |
API |
| Ludwig Maximilian University of Munich |
Stable-Diffusion V1
|
High-Resolution Image Synthesis with Latent Diffusion Models (Paper, Blog Post, Github). |
Open |
| Stability-AI |
Stable-Diffusion V2
|
High-Resolution Image Synthesis with Latent Diffusion Models ( Blog Post, Github). |
Open |
| University of California & Google |
Structure-Difussion
|
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis (Paper, Project Page, Github). |
Open |
| Tsinghua University |
CogView V2
|
CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers (Paper, Github). |
Open |
| OpenAI |
Glide
|
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models (Paper, Github). |
Open |
| Technische Hochschule Ingolstadt |
Paella
|
Fast Text-Conditional Discrete Denoising on Vector-Quantized Latent Spaces (Paper, Github). |
Open |
| Community |
minDALL-E
|
minDALL-E (Github). |
Open |
| Community |
DALLEMini
|
DALLEMini (Github). |
Open |