Much of the hype and fears around generative AI has been about text. But there have also been rapid and dramatic developments in systems that can generate images. In many cases, these share a similar structure to text-based generative AI, but they can also be much weirder — and lend themselves to some fun creative pursuits.
Image generators are trained on billions of images, which enable them to produce creations that were once the sole dominion of painters and artists. Sometimes experts can’t tell the difference between AI-created images and actual photographs (a circumstance that has fueled dangerous misinformation campaigns in addition to fun creations). And these tools are changing the way creative professionals do their jobs.
AI Photoshop
Recently, Adobe added a generative AI feature into a beta version of Photoshop, its iconic graphics software, and creators on social networks including TikTok and Instagram have been buzzing about it.
I have a fair amount of experience with Photoshop. When I tested the new feature, called “generative fill”, I was impressed with how quickly and competently the AI carried out tasks that would have taken me at least an hour to do on my own. In less than five minutes and with only a few clicks, I used the feature to remove objects, add objects and swap backgrounds.
(To experiment with these tools, start by signing up for a free trial of Adobe Creative Suite. Then, instal the new Adobe Photoshop beta, which includes generative fill.)
Once you have Photoshop beta installed, import a photo and try these tricks:
— To change a background, click the “object selection” icon (it has an arrow pointed at a box), then under the Select menu, click “inverse” to select the background.
Next click the “generative fill” box and type in a prompt — or leave it blank to let Photoshop come up with a new background concept for you.
— To remove objects, use the lasso tool. In a photo of my motorcycle, I wanted to erase a tractor behind a fence in the background. I traced around the tractor, and then I clicked the “generative fill” box and hit “generate” without entering a prompt. The software correctly removed the tractor and filled in the background while leaving the fence intact.
What I’ve done with generative fill is far less exciting than what others have been posting on social media. Lorenzo Green, who tweets about AI, posted a collage of famous album covers, including Michael Jackson’s Thriller and Adele’s 21 that were expanded with generative fill. The results were quite entertaining.
More image generators
Tools like DALL-E and Midjourney can create entirely new images in seconds. They work similarly to chatbots: you type in a text prompt — the more specific, the better.
To write a quality prompt, start with the medium you’d like to emulate, followed by the subject and any extra details. For example, typing “a photograph of a cat wearing a sweater in a brightly lit room” in the DALL-E prompt box will generate a photo just like that.
DALL-E, which is owned by Open AI, the maker of ChatGPT, was one of the first widely available AI image generators that was simple to use. For $15, you get 115 credits; one credit can be used to generate a set of four images.
Midjourney, another popular image generator, is a work in progress, so the user experience is not as polished. The service costs $10 a month, and entering prompts can be a little more complicated, because it requires joining a separate messaging app, Discord.
Nonetheless, the project can create high-quality, realistic images.
Whichever tool you use, bear in mind that the onus is on you to use this tech responsibly. Technologists warn that image generators can increase the spread of deepfakes and misinformation. But the tools can also be used in positive and constructive ways, such as making family photos look better and brainstorming artistic concepts.