
How hard is it really to spell?
In the age where humans live alongside autocorrect tools, some may think that the technology can do their things well. The thing is, they're wrong. Predictive text can make a lot of mistakes and sometimes, it takes more time to manually correct what's autocorrected, than typing everything in the first place.
Human still need to dance their fingers across their keyboards like poetry in motion.
The same goes with AI, as Large Language Models are still prone to mistakes, especially when it comes to spelling.
But with 'Imagen 4,' Google wants to change that.
Imagen 4, welcome to the @GeminiApp
Our latest and most capable image generation model is a big leap forward with better lifelike detail, better text output , and richer images with more nuanced colors and fine-grain details . Try it out starting today, and show us your… pic.twitter.com/haGo0IdHZF— Google (@Google) May 20, 2025
Google is rolling out a new image-generating AI model, Imagen 4, that the company claims delivers higher-quality results than its previous image generator, Imagen 3.
"Imagen 4 is [a] huge step forward in quality," said Josh Woodward, who leads Google’s Labs group, said during a press briefing.
Unveiled at Google I/O 2025, Imagen 4 is capable of rendering "fine details" like fabrics and water droplets. It can also generate extreme close-ups with richer colors, textures and gradients, like animal fur. It also masters diverse art styles, in which it can render images from photo realism and impressionism to abstract and illustration, generating images in a range of aspect ratios and up to 2K resolution.
In other words, Imagen 4 can handle images with sharper clarity, meaning that it can bring imagination to life faster than ever before.
All users have to do, is image something, and let Imagen 4 visualize it.
Get ready for Imagen 4 capable of creating richer images, with more nuanced colors, intricate details and superior typography.
Tap each photo below to see more. pic.twitter.com/W0vDYu4Z4R— Google DeepMind (@GoogleDeepMind) May 20, 2025
At this time, there’s no shortage of AI image generators out there, from ChatGPT to Google to Midjourney and so forth. Many of the big players in the industry are relatively sophisticated, customizable, and capable of creating high-quality AI artwork.
So what makes Imagen 4 stand out from the crowd?
What makes Imagen 4 unique, is its advanced spelling and typography capability.
You can change your image’s aspect ratio to 16:9, 9:16, or 2:3 just by asking Gemini in your prompt.
— Google Gemini App (@GeminiApp) May 20, 2025
Imagen 4 can create text with great clarity for comics, packaging and collectibles, and comes with improved spelling, longer text strings and new layouts and styles.
"We’ve also [paid] a lot of attention and fixes around how it generates text and topography, so it’s wonderful for creating slides or invitations, or any other thing where you might need to blend imagery and text," Woodward added.
Accessibility is another strong suit of Imagen 4. It's integrated into various Google platforms, including the Gemini app, Whisk, Vertex AI, and Workspace tools like Slides, Vids, and Docs, making it readily available for both individual creators and enterprise users.
You can also create comics , packaging , stylized stamps and more - all with improved spelling and new layouts. pic.twitter.com/snh15AuJdc
— Google DeepMind (@GoogleDeepMind) May 20, 2025
Google’s Imagen 4 is a major step forward in AI image generation, but it brings notable concerns.
Chief among them are privacy issues tied to Google’s data collection practices, ethical risks due to biased training data, and legal disputes over the use of copyrighted artwork without consent. While powerful, Imagen 4 must be approached with caution, balancing innovation with transparency, fairness, and respect for creators’ rights.
Regardless, Imagen 4 isn't just an incremental update; it's a substantial leap forward in AI-driven image generation.
By addressing long-standing challenges and enhancing creative flexibility, it empowers users to bring their visions to life with unprecedented clarity and speed.
Starting today, you can try Imagen 4 in @GeminiApp and Whisk, a @GoogleLabs experiment that brings text prompts and images together to visualize your ideas.
Find out more ↓ https://t.co/aLXqjbXO93 #GoogleIO— Google DeepMind (@GoogleDeepMind) May 20, 2025