OpenAI's New AI Model Draws Images From Text (axios.com) 24
The machine learning company OpenAI is developing models that improve computer vision and can produce original images from a text prompt. From a report: The new models are the latest steps in ongoing efforts to create machine learning systems that exhibit elements of general intelligence, while performing tasks that are actually useful in the real world -- without breaking the bank on computing power. OpenAI this week is announcing two new systems that attempt to do for images what its landmark GPT-3 model did last year for text generation. DALL-E is a neural network that can "take any text and make an image out of it," says Ilya Sutskever, OpenAI co-founder and chief scientist. That includes concepts it would never have encountered in training, like the drawing of an anthropomorphic daikon radish walking a dog. DALL-E operates somewhat similarly to GPT-3, the huge transformer model that can generate original passages of text based on a short prompt. CLIP, the other new neural network, "can take any set of visual categories and instantly create very strong and reliable visually classifiable text descriptions," says Sutskever, improving on existing computer vision techniques with less training and expensive computational power.
Hmmm (Score:3)
"take any text and make an image out of it"
How can I try this out?
"Halle Berry, Camilla Cabello and me in bed"
Re: (Score:2)
Division by zero. Length processing error while drawing Ray's romantic-category anatomy.
Re: (Score:2)
Re: (Score:3)
I tried it.
Didn't work.
The AI needs to know where Halle Berry and Camilla Cabello were while you were in bed.
AI creap-outs will increase (Score:1)
The first can of Skynet Soup looks absolutely disgusting.
Re: (Score:1)
Re: (Score:1)
Maybe it was, but trained on 4chan content.
Re: (Score:2)
Yes, but it can form a liquid metal spike and open itself.
Then destroy humankind.
Full circle (Score:3)
DALL-E is a neural network that can "take any text and make an image out of it,...CLIP, the other new neural network, "can take any set of visual categories and instantly create very strong and reliable visually classifiable text descriptions,"
I only have two questions:
Can DALL-E rebuild CLIP's input from CLIP's output?
Can CLIP rebuild DALL-E's input from DALL-E's output?
Artist (Score:2)
Vincent Van Deep GOgh.
100% pun intended.
How many words? (Score:1)
Does it take 1000 words to generate an image?
Re: (Score:1)
That whooshing sound you may hear is the joke going over your head.
For the Police? (Score:2)
So they can fire all the sketch-artists?
Been using this for 27 years. (Score:1)
Re: Been using this for 27 years. (Score:2)
But it always says "Syntax error near 'sucking my'."
Never again (Score:1)
Porn anyone? (Score:2)
Come on, imagine extending it to video, and the giving it a Siri interface.
Cease and Desist (Score:2)
Wonderful news! (Score:2)
I've been waiting years for this: Alexa, draw me a nice cock and balls!
Re: (Score:2)
It drew two politicians.
Can it draw: (Score:4, Funny)
Goat ... something ?
Re: (Score:1)
On behalf of 90% of slashdotters, Nnnoooo!