FREN

#FF00AA


24 may 2022

I’m gonna refrain from retweeting this whole thread about a new text-to-image diffusion model (that appears to be better than Dall-E at photorealistic content) but TIL that none of these models can process “A horse riding an astronaut” correcly.

@_akhaliq

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

project page:

sota FID(7.27 on COCO), without ever training on COCO, human raters find Imagen samples to be on par with the COCO data itself in image-text alignment

Want to know when I post new content to my blog? It's a simple as registering for free to an RSS aggregator (Feedly, NewsBlur, Inoreader, …) and adding www.ff00aa.com to your feeds (or www.garoo.net if you want to subscribe to all my topics). We don't need newsletters, and we don't need Twitter; RSS still exists.

Legal information: This blog is hosted par OVH, 2 rue Kellermann, 59100 Roubaix, France, www.ovhcloud.com.

Personal data about this blog's readers are not used nor transmitted to third-parties. Comment authors can request their deletion by e-mail.

All contents © the author or quoted under fair use.