Text-to-Image AI That Can Actually Spell!? Meet DeepFloyd IF

If you've ever used Midjourney, Dall-E, Stable Diffusion or another text-to-image generator, you'll know that words are a weakness. Text (such as on signs) tends to be gibberish. DeepFloyd IF has started to solve that problem and it's doing it open source.   Referenced in the video: https://twitter.com/DeepFloydIF https://twitter.com/EMostaque/status/1652295961404645376 https://stability.ai/blog/deepfloyd-if-text-to-image-model https://twitter.com/hardmaru/status/1651822596844048385 https://the-decoder.com/deepfloyd-if-is-a-crazy-good-text-to-image-model-and-open-source/ https://wandb.ai/geekyrakshit/deepfloyd/reports/A-Gentle-Introduction-to-DeepFloydAI-s-New-Diffusion-Model-IF--VmlldzozNTY3Nzc4 https://twitter.com/javilopen/status/1652387049268297729 https://huggingface.co/DeepFloyd https://twitter.com/DavidVorick/status/1652070967412129793   Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/

Om Podcasten

A daily news analysis show on all things artificial intelligence. NLW looks at AI from multiple angles, from the explosion of creativity brought on by new tools like Midjourney and ChatGPT to the potential disruptions to work and industries as we know them to the great philosophical, ethical and practical questions of advanced general intelligence, alignment and x-risk.