DALL-E 2, Stable Diffusion, Midjourney: How do AI art generators work, and should artists fear them?

Matthew Ashe

30 December 2022 at 4:08 am·7-min read

Throughout human history, technological progress has made some workers obsolete while empowering others. Workers in industries such as transport and manufacturing have already been strongly impacted by advancements in automation and artificial intelligence.

Today, it's the creative sector that's on the line. Visual artists, designers, illustrators and many other creatives have watched the arrival of AI text-to-image generators with a mix of awe and apprehension.

This new technology has sparked debate around the role of AI in visual art and issues such as style appropriation. Its speed and efficiency have triggered fears of redundancy among some artists, while others have embraced it as an exciting new tool.

AI writing is here, and it’s worryingly good. Can writers and academia adapt?

What is an AI text-to-image generator?

An AI text-to-image generator is a software that creates an image from a user's text input, which is referred to as a prompt. These AI tools are trained on huge datasets of pairs of text and images.

DALL-E 2 and Midjourney have not yet made their datasets public. However, the popular open-source tool Stable Diffusion has been more transparent about what it trains its AI on.

“We did not go through the Internet and find the images ourselves. That is something that others have already done,” said Professor Björn Ommer, who heads the Computer Vision and Learning Group at Ludwig Maximilian University of Munich.

Ommer worked on the research underpinning Stable Diffusion.

Artists and human-like machines: Here are the zaniest robots of 2022

“There are now big data sets which have been scraped from the Internet, publicly available. And these we used, mainly the LAION datasets, which are out there, consisting of billions of images that we can train upon,” he told Euronews Next.

LAION is a non-profit organisation that collects image-text pairs on the Internet. It then organises them into datasets based on factors such as language, resolution, likelihood of having a watermark and predicted aesthetic score, such as the Aesthetic Visual Analysis (AVA) dataset which contains photographs that have been rated from 1 to 10.

LAION gets these image-text pairs from another non-profit organisation called Common Crawl. Common Crawl provides open access to its repository of web crawl data, to democratise access to web information. It does this by scraping billions of web pages monthly and releasing them as openly available datasets.

ChatGPT: Why the human-like AI chatbot suddenly has everyone talking

Training the AI

Once these datasets of image-text pairs are gathered and organised, the AI model is trained on them. The training process teaches the AI to make connections between the visual structure, composition and any discernible visual data within the image and how it relates to its accompanying text.

“So when this training then finally completes after lots and lots of time spent on training these models, you have a powerful model that makes the transition between text and images,” said Ommer.

The next step in the development of a text-to-image generator is called diffusion.

In this process, gaussian or “random” visual noise is incrementally added to an image, while the AI is trained on each iteration of the gradually more “noisy” image.

The process is then reversed and the AI is taught to construct, starting from random pixels, an image that is visually similar to the original training image.

“The end product of a thousand times adding a tiny bit of noise will look like you pulled the antenna cable from your TV set and (there’s) just static, just noise there – no signal left anymore,” Ommer explained.

Ai-Da makes history after becoming the first robot to be grilled by UK's House of Lords

The AI model is trained on billions of images in this way, going from an image to noise and then reversing the process each time.

After this stage of the training process, the AI can then begin to create, from noise, images that had never existed before.

In practice, this means that a user can now access a text-to-image generator, enter a text command into a simple text box, and the AI will generate an entirely new image based on the text input.

Each text-to-image AI has keywords that its users have discovered through trial and error. Keywords such as “digital art”, “4k” or “cinematic” can have a dramatic effect on the outcome, and users have shared online tips and tricks to generate art in a specific style. A typical prompt might read as “a digital illustration of an apple wearing a cowboy hat, 4k, detailed, trending in artstation”.

What is the Lensa app and why are artists worried about it?

Appropriation of art style

The ethics of AI text-to-image generators have been the subject of much debate. A key issue of concern has been the fact that these AIs can be trained on the work of real, living, working artists. This potentially allows anybody using these tools to create new work in these artists’ signature style.

“I think we're going to have to figure out either a way for artists to get compensated if their names or images come up in the datasets, or for them to just completely opt-out if they don't want to have anything to do with it,” video collage artist Erik Winkowski told Euronews Next.

We're going to have to figure out either a way for artists to get compensated if their names or images come up in the datasets, or for them to just completely opt-out

On the issue of stylistic appropriation for financial gain, he added that “if a brand campaign is obviously appropriated from a person's artwork, whether it was made with AI or otherwise, it's just not a good thing. And I hope that they'll be a public standing up against that”.

In November, the online art community Deviant Art announced that it would add its own AI text-to-image generation tool DreamUp to its website.

All of Deviant Arts users' artwork on the website would then be automatically available to train the AI.

However, within 24 hours of the announcement, facing strong pushback from its community, Deviant Art changed its policy. Instead, users would have to actively choose to opt in to train the AI.

Shutterstock, a stock image marketplace, now plans to integrate DALL-E’s text-to-image generator and compensate the creators whose work was used to train the AI.

Is AI changing the art world?

Unfair competition or powerful new tool?

At the 2022 Colorado state fair, Jason Allen’s AI-generated artwork ‘Théâtre D’opéra Spatial’ – which was created using Midjourney – won in the category of "emerging digital artists".

The award sparked much controversy and debate around the future of art. Amid the publicity, Allen launched a new company, AI Infinitum, which offers “luxury AI prints”.

Some artists are concerned about the speed and accuracy at which an AI text-to-image generator can create artwork. A tool like Stable Diffusion can, in a matter of seconds, create multiple artworks that would take artists hours or days to produce.

I've seen the goal of my research never wanting to replace human beings, human intelligence or the like

This has concerned some creatives who fear that their skills may be made obsolete by this technology.

“I've seen the goal of my research never wanting to replace human beings, human intelligence or the like,” Ommer told Euronews Next.

“I see Stable Diffusion much like a lot of other tools that we're seeing there, as just an enabling technology which enables the artist, the human being, the user utilising these tools to then do more or do the things that they were already doing better, but not replacing them from the best”.

Working alongside robots could contribute to burnout and fears over losing your job, new study finds

The next stage of AI art

AI text-to-image generators are continually being improved and some researchers and tech companies are developing the next stage of generative visual art.

Meta has released examples of its text-to-video AI currently in development, which can produce a video from a user's text input.

Meanwhile, Google has unveiled DreamFusion, a text-to-3D AI that builds upon the technology of text-to-image generators to generate 3D models without the need for datasets containing 3D assets.*

Meta unveils AI tool that creates GIF-like videos from text prompts

Some visual artists such as Winkowski have already started incorporating generative AI tools into their workflow and pushing the technology to create animated art.

In his recent short film titled ‘Leaving home’, Winkowski drew certain frames and allowed Stable Diffusion to generate the frames in between.

“It's almost like having a superpower as an artist, really,” he said.

“That's really exciting. And I think we're maybe going to be able to take on more ambitious projects than we ever thought possible”.

For more on this story, watch the video in the media player above.

OK! Magazine
King Charles' '4-word response' to Harry's request to meet during UK visit
Prince Harry has reportedly extended an olive branch to King Charles and asked if the pair can meet during his upcoming visit to the UK and he received a 'four-word response'
Yorkshire Live
Have a go hero dishes out life lesson after 'old man attacked in Keighley'
CCTV footage of old man 'being attacked in Yorkshire' goes viral online as have a go hero steps in
Birmingham Live
BBC Gardeners' World star Carol Klein announces shock health diagnosis and major surgery
The popular TV presenter has been presenting permanently on the show since 2005
Snopes
King Charles III's Funeral Plans Reportedly Updated, as He's 'Very Unwell.' Here's What We Found
Numerous rumors about the British monarch spread following his cancer diagnosis on Feb. 5, 2024.
Liverpool Echo
BBC Bargain Hunt expert brutally shut down after 'awful' admission
The buyers stepped in after the suggested from Phillip Serrell
Liverpool Echo
Melanie Sykes 'quit TV' after comment from MasterChef's Gregg Wallace
After decades on our screens, Melanie has vowed never to return to TV
Birmingham Live
ITV Coronation Street actress speaks out on husband's 'break-up' with co-star
Sally Carman-Duttine plays Abi Webster on the popular soap
Bristol Live
'Sad' Tesco shopper 'clears' supermarket shelves to make £1,000 profit
Sam, who has raked up an impressive 95,000 followers on TikTok, is one of many young Brits trying to make quick cash by reselling
Wales Online
Queuing Greggs customer cannot believe his eyes at man in shop's actions
'A man weaved through the line in front of me, heading to the sandwich section, and I thought nothing of this at first'
Liverpool Echo
Taxi spotted dropping off strangers 'at all hours of the day and night'
Taxi spotted dropping off strangers 'at all hours of the day and night'
Liverpool Echo
Urgent warning to gardeners over jail time and 'unlimited fine'
Gardeners could be slapped with a hefty fine or worse
OK! Magazine
Britain's Got Talent audience member exposes part of the show that's 'faked for viewers at home'
A member of the Britain’s Got Talent audience has commented on the long-running rumours that a part of the hit entertainment show is faked each week for TV viewers
Bristol Live
New rule leaves National Lottery winner who won £10,000 with nothing
Jennifer Gothard won £10,000 on a £3 "triple cashword" scratchcard in March - but is still waiting for her winnings due to a new rule introduced by operator Allwynn
The Northern Echo
I compared supermarket butters to Lurpak to see if it could be beaten - it was
I tested supermarket butter from Asda, Tesco, Aldi, and M&S against Lurpak and another branded butter to find out which was best.
The Telegraph
US shared ‘gobsmacking’ Covid lab leak file with UK
The US shared “gobsmacking” evidence with Britain at the height of the Covid pandemic suggesting a “high likelihood” that the virus had leaked from a Chinese lab, The Telegraph can reveal.
The Telegraph
Porsche driver ‘killed trying to swerve pothole’
A 74-year-old Porsche driver has died after swerving to avoid a pothole, police believe.
OK! Magazine
King Charles 'must sit down' with Meghan and Harry and say 'I'm sorry' in difficult conversation says Piers
TV presenter Piers Morgan has claimed King Charles should sit down with Prince Harry and Meghan Markle to discuss the pair's use of their Duke and Duchess of Sussex titles
Business Insider
Ukraine highlights Russia's 'line of hell.' Claim of dozens of tanks and military vehicles destroyed on one sector of the Donetsk front.
Fighting has intensified in the Donetsk region in recent months as Russia pushes to take more ground around Avdiivka.
The Telegraph
Beauty queen shot dead after octopus ceviche order led killers to restaurant
An Ecuadorian beauty queen was assassinated after her Instagram post of a plate of octopus ceviche led gunmen to the restaurant where she was dining.
Yorkshire Live
Kate McCann's life now from heartbreaking Madeleine traditions to new job
It's been 17 years since Madeleine McCann vanished during a family holiday in Portugal - and mum Kate McCann has never given up hope

Sunak suffers disastrous night - the big moments

DALL-E 2, Stable Diffusion, Midjourney: How do AI art generators work, and should artists fear them?

What is an AI text-to-image generator?

Training the AI

Appropriation of art style

Unfair competition or powerful new tool?

The next stage of AI art

Latest stories

King Charles' '4-word response' to Harry's request to meet during UK visit

Have a go hero dishes out life lesson after 'old man attacked in Keighley'

BBC Gardeners' World star Carol Klein announces shock health diagnosis and major surgery

King Charles III's Funeral Plans Reportedly Updated, as He's 'Very Unwell.' Here's What We Found

BBC Bargain Hunt expert brutally shut down after 'awful' admission

Melanie Sykes 'quit TV' after comment from MasterChef's Gregg Wallace

ITV Coronation Street actress speaks out on husband's 'break-up' with co-star

'Sad' Tesco shopper 'clears' supermarket shelves to make £1,000 profit

Queuing Greggs customer cannot believe his eyes at man in shop's actions

Taxi spotted dropping off strangers 'at all hours of the day and night'

Urgent warning to gardeners over jail time and 'unlimited fine'

Britain's Got Talent audience member exposes part of the show that's 'faked for viewers at home'

New rule leaves National Lottery winner who won £10,000 with nothing

I compared supermarket butters to Lurpak to see if it could be beaten - it was

US shared ‘gobsmacking’ Covid lab leak file with UK

Porsche driver ‘killed trying to swerve pothole’

King Charles 'must sit down' with Meghan and Harry and say 'I'm sorry' in difficult conversation says Piers

Ukraine highlights Russia's 'line of hell.' Claim of dozens of tanks and military vehicles destroyed on one sector of the Donetsk front.

Beauty queen shot dead after octopus ceviche order led killers to restaurant

Kate McCann's life now from heartbreaking Madeleine traditions to new job