Imagen 4 is now generally available
I have found Imagein to be a good general purpose editor and we use it to clean up bitmaps, and adjust black points and white points and curves on greyscale, so it is good for preparing B&W greyscale photographs for print to compensate for dot gain in halftone screens on laser printers. Its 'color separation' capability is rudimentary/first draft though and is ridiculously close to inverse RGB rather than CMYK. For good color seps we use Photoshop so I can control undercolor removal.
> the generally availability
One of the biggest corporations in the world and they can't re-read before posting a typo in the title.
Heads be shakin
Clicking on "Read the documentation" leads to a page that documents nothing about the latest Imagen models and only provides examples using Gemini 2.0 Flash.
I was going to nitpick the missing apostrophe in movie posters caption ("STARFALLS REVENGE") but its missing from the prompt, too.
I asked basically copilot the same and got a much better result lol
I guess it's kinda nicely genuine that the "four panel comic strip" has some errors in it (misunderstanding caption + cat high-fiving itself in the bonus fifth panel)
Looks so much better than the yellow tinted chatgpt output in my eyes
I am currently building an AI product which relies on Imagen 3 to generate a lot of photorealistic, cinematic or HDR images. I tried Imagen 4 during preview, but results were too "cartoonish". Did anyone else have the same experience?
>Image generation may not always trigger:
>The model may output text only. Try asking for image outputs explicitly (e.g. "generate an image", "provide images as you go along", "update the image").
>The model may stop generating partway through. Try again or try a different prompt.
Seriously?
Wasn't Imagen 4 released months ago?
Anyone know if this can be prompted with image to image?
The comments here are priceless. In less than five years time we have gone from "That's impossible" to "Meh, it doesn't solve P=NP if prompted.".
For those commenting in the latter category, it might be worthwhile to read a bit about the underlying technology and share your insights on why it does not deliver.
The webcomics is awful. It feels off, the characters look very fake, unsettling in the way they communicate. The prompt is shown bellow the image, but for me the result looks closer to a prompt "Create lifeless characters reciting marketing slop. They must fake an over exaggerated excitement but it should be clear they don't believe in what they're saying and have no souls".
Also, the prompt specifically ask "Panel 4 should show the cat and dog high-fiving" but the cat is high-fiving ... the cat. Personally I find this hallucinated plot twist good, it makes the ending a bit better. Although technically this is demonstrating a failure of the tool to follow the instructions from the prompt. Interesting choice of example for an official announcement.
As others have said, with so many errors, it's just more AI slop.
Does the world need yet another AI slop generator?
The way it totally disregards the many explicit instructions given in the "four panel" comic strip.