Algorithm Seamlessly Patches Holes In Images 198
Beetle B. writes in with research from Carnegie Mellon demonstrating a new way to replace arbitrarily shaped blank areas in an image with portions of images from a huge catalog in a totally seamless manner. From the abstract: "In this paper we present a new image completion algorithm powered by a huge database of photographs gathered from the Web. The algorithm patches up holes in images by finding similar image regions in the database that are not only seamless but also semantically valid. Our chief insight is that while the space of images is effectively infinite, the space of semantically differentiable scenes is actually not that large. For many image completion tasks we are able to find similar scenes which contain image fragments that will convincingly complete the image. Our algorithm is entirely data-driven, requiring no annotations or labelling by the user."
Finally... (Score:5, Funny)
Re:Finally... (Score:4, Insightful)
Where are you going to find such images?
Re: (Score:3, Funny)
Re: (Score:3, Funny)
I got your content right here!
What does "semantically" mean? (Score:4, Interesting)
Our chief insight is that while the space of images is effectively infinite, the space of semantically differentiable scenes is actually not that large. For many image completion tasks we are able to find similar scenes which contain image fragments that will convincingly complete the image. Our algorithm is entirely data-driven, requiring no annotations or labelling by the user.
What are the "semantics" here?
Is this like google images, where the nearby HTML text determines the classification of the image [i.e ASCII-text as meta-data for images]?
Or is this a great big neural net of wavelet data which classifies the images mathematically?
PS: I have the same question about that infamous Photosynth/Sea Dragon demonstration:
http://www.ted.com/index.php/talks/view/id/129 [ted.com]
Re: (Score:3, Informative)
Effectively that means that just as you personally can recognize a bunch of pictures as all being "japanese porn" or all being "pictures of boats", or all being "pictures of men in suits", so can the computer.
And that number of different categories that humans take pictures of is not that large, probably less than 200,000 different categorizable subjects.
So with 1 million pictures, you have 5 of any category and can
Re: (Score:2)
Don't know for sure I didn't RTFA, but:
1) Human categories are far more than 200000, and even if you reduce the number to a manageable level, they have no graphical definition (suppouse the 'face' category, how could you clasify a cubist picture of a 'face'?), and
2) The fact that they claim not to use annotations, probably means they are using clustering techniques to detect image groups with a high graphical intersimilarity factor, that's is, the class is not a concept but a bunch of similar pictures, I
Re: (Score:2)
Re: (Score:3, Insightful)
Re: (Score:2)
What are the "semantics" here? Is this like google images, where the nearby HTML text determines the classification of the image [i.e ASCII-text as meta-data for images]?
No. The matches are done in a purely data-driven manner, meaning by analyzing lots of images and guessing matches. Meta-data appears not to have been used.
Or is this a great big neural net of wavelet data which classifies the images mathematically?
Probably a lot closer to the truth.
Another paragraph gives a clue to how they're usi
Re: (Score:2, Informative)
Re: (Score:2)
You just set up some sort of heuristic mechanism to ascertain common elements in picture A and then scan through the set for matches ranking by number of hits.
It's a lot like a radix trie in that it relies on partially similar nodes/pictures to w
Re:Finally... (Score:5, Interesting)
More seriously, I can see this applied to "fixing" pictures of just about anyone you want to see naked.
Fake celeb slips will of course come first, but why stop there? That cute girl at the coffee shop? Snap her with the camera phone, erase all those pesky clothes, and let this algorithm do its thing.
Of course, I could also see this used for more nefarious (even "sick") purposes... Ex-GF cheated and you don't have any nude pics to release to the web? You do now. And if you "repaired" a fully-clothed original of someone underage, would it still count as child porn?
And I don't even want to think about how the furries would use this... Ugh.
Re:Finally... (Score:5, Funny)
You take a picture of that cute girl at the coffee shop. Snap her with the camera phone, erase all those pesky clothes, and let the algorithm do its thing.
You wait for the algorithm to finish, it says "Done", you get all excited and click the button to see the result, and.... * DOH *, it put all her clothes back on, albeit a different color and style.
Re: (Score:2)
Re: (Score:2, Insightful)
Re: (Score:2)
Will it be as smart of MS Word? (Score:5, Funny)
Re: (Score:2)
Re:Will it be as smart of MS Word? (Score:5, Funny)
Re: (Score:2)
If anyone from fchan reads this. My eyes, my precious, pure, virgin eyes!
Re:Will it be as smart of MS Word? (Score:4, Funny)
Wow! (Score:2)
And whoever thought we'd be glad to have clippy back?
Re:Finally... (Score:4, Funny)
You get photos of the celebrities, wearing japanese clothing!
I attended the talk... (Score:2)
It did work quite well, however, offering a choice of different image completions.
Re:Finally... (Score:4, Funny)
?
Re: (Score:2)
Re: (Score:2)
I thought the biker one was fake, but apparently its for real - one of my coworkers said he had seen a documentary, and the guy actually survived being thrown off his motorcycle and getting impaled ...
Still, if you DO come across some good pics, I'll put them in the contest.
Re: (Score:2, Funny)
Immediate Application (Score:5, Interesting)
Re:Immediate Application (Score:5, Insightful)
If, on the other hand, you were a movie producer and needed to get rid of the frame change holes after loosing the master print of a film, you perhaps would be able to use such a program to mend those holes in the upper corner.
Re: (Score:2)
For what you are trying to do, a better idea would be to copy data from the previous and next frames. That's what they do when they "digitally remaster" old films to get rid of scratche
w00t! (Score:5, Funny)
Dead (Score:5, Informative)
BBC News coverage of the story is here: http://news.bbc.co.uk/1/hi/technology/6936444.stm [bbc.co.uk]
Now we need a patch for dead servers (Score:5, Funny)
coral cache (Score:2)
http://graphics.cs.cmu.edu.nyud.net/projects/scen
Re: (Score:2)
Slashdot search feature (Score:2)
In many cases, it's a dupe of the current article.
Re: (Score:2)
Re: (Score:2, Funny)
My bet is on Nicholas Cage or John Travolta.
Re: (Score:2)
Damn, that was a bad movie.
ehhh.... (Score:4, Interesting)
GREYCstoration (Score:5, Interesting)
http://www.greyc.ensicaen.fr/~dtschump/greycstora
It's pretty impressive:
http://www.greyc.ensicaen.fr/~dtschump/greycstora
and works with the gimp.
Re: (Score:2)
This one looks bad...
http://www.greyc.ensicaen.fr/~dtschump/greycstorat ion/img/res_parrot.png [ensicaen.fr]
This one you can mouse-over and have the image change on you. Again, not that impressed.
http://www.greyc.ensicaen.fr/~dtschump/greycstorat ion/img/res_claudia16.html [ensicaen.fr]
Looks very blurry.
This one is actually okay, but its because its an owl. I'm sure all the blurring is there, its just hard to notice.
The eye's pupil isn't as rounded as it could have been.
http://ww [ensicaen.fr]
Re: (Score:3, Insightful)
Re: (Score:2)
Yup, too cynical (Score:2)
What will it do if... (Score:5, Funny)
Re: (Score:2, Funny)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
DO NOT use it on your Porn-Collection! (Score:5, Funny)
Image compression? (Score:5, Insightful)
I may have to actually RTFA this time.
Re:Image compression? (Score:5, Informative)
That only works if your patch addressing space takes less space than the bits you're replacing - and of course when you reload the image, you'll still get say a cat instead of an iguana in that window...
Also, how about replacing with patches of higher resolution than the original? I realize it would all be technically lossy as hell, but the compression artifacts should not be very noticable to the human eye, right?
I'm not sure you really understand the concepts here. Replacing a patch with a higher res would be possible (but you'd have to resample the image first, basically) - and would either be incredibly lossy or perfectly unlossy, depending on your viewpoint.
From a compression standpoint there's no reason to consider a high res replace as more lossy as anything else. From a recognition standpoint, whether you're doing it high res or not, this would be a method that throws out image details for others... but that doesn't have anything to do with the resolution. So this is a lossy image manipulation, but not really a compression...
And of course, none of that would cause any compression artifacts, so yeah the human eye wouldn't notice (assuming this software works as claimed)
So to go back over the concepts:
Lossy - a compression or manipulation to an image or other digital file from which you cannot reconstruct the original bits perfectly
Compression Artifact - a noticeable image tearing or other visual defect allowing one to differentiate between a lossy-compressed file and it's original
Additionally, how about using this for movie compression? Filling in based on info from previous and next frame.
That's how movie compression came about. The first moving-file format that was widely available was animated GIF - which quickly got onto the trick of using transparent pixels for non-changing parts of a scene.
MPEG (1) one upped it; one part of the spec specifies which blocks are to be sent in each frame; you can leave out any blocks you don't want... (they also smartly seperated the chrominance and luminance channels, and subsampled the chrominance channel - not only is it a smart compression as the human eye perceives luminosity at greater fidelity than chorminance, but it also ups the chances that you don't have to transmit some blocks)
Fast Forward to MPEG-4 (non-H.263) - same basic block structure, same ability to not draw blocks, and now you can even specify offsets for blocks - you have probably heard this technology referred to as motion compression - basically if something is moving on the screen but remains relatively the same pixel values regardless of motion, the movie file will record the motion without recording every pixel - the difference between a good MP4 compressor and a bad compressor mostly has to do with how well it identifies candidates for motion compression, is my understanding...
Re: (Score:2)
From a compression standpoint there's no reason to consider a high res replace as more lossy as anything else. From a recognition standpoint, whether you're doing it high res or not, this would be a method that throws out image details for others... but th
Don't (Score:2)
...I may have to actually RTFA this time.
Don't you do it! If you do, you can hand in your /. card, tazer, and key to the clubhouse, 'cause you're out of the club.
Next thing you know, you'll be reading the flippin' article and posting insightful things that the rest of us, who spend our 9-5 journey together every day, will have no way to counter unless we start to read the freakin' articles! This will have an impact beyond what you realize. For the good of the greater, don't RTFA!
Re: (Score:2)
They could just make Speed 2 a reference to Speed and be done with it, for example.
Product Placement (Score:2)
Cut out a car, and the algorithm will replace the blank spot with a car. Not necessarily the same car, but a car of about the same size, etc.
So for location shooting, you no longer need to go through extensive "scene sanitation" ahead of the shoot. Take select frames of the footage, point out what you want to snip, and let software propagate the snips through the rest of the footage. Then let this new stuff fill those holes with "approved product placement" items that fit the scene. Yan
Re: (Score:2)
pfft (Score:5, Funny)
Re: (Score:2, Interesting)
The film 'Blow Up' Starring David Hemmings was made in the 1960's (AFAICR, As far as I can remember...) It used this as the core of the film plot.
Now, as a photographer I am not overly worried by this until these doctored images find their way onto sites like Wikipedia and people start using them as the 'Real Thing'.
There are laws in some countries that stop advertisers using 'doctored' images in things like holiday Brochures. Once we start ge
oh, come on (Score:2)
doesn't generate new info (Score:4, Insightful)
Re: (Score:2)
Re: (Score:2)
So basically, most of us could only fix images missing pink/beige* areas.
I'll still take it.
* - Nothing racist intended; most online porn quite simply features caucasian or light-skinned asian models, like it or not.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
computer: What is this a picture of?
User: rabid chinchillas and celebrity porn
hilarity ensues
Dear entrants of the Fark Photoshop contest: (Score:4, Funny)
Yours,
some code and a database
Re: (Score:2)
Howabout... (Score:2, Interesting)
Re: (Score:3, Funny)
Re: (Score:2)
Replaced with (Score:3, Funny)
Not "Arbitrary" (Score:4, Informative)
Similar to Microsoft's Photosynth (Score:2, Interesting)
Re: (Score:2)
New problem-solving paradigm? (Score:5, Interesting)
This is already happening informally in the personal sphere, because of things like Google, recently amplified by the iPhone and its inevitable successors in the ubiquitous rapid-access web-tool field. As they say, these days, if you have a web browser, you hardly have to wonder about anything anymore.
Of course, problem solving by search isn't exactly a new paradigm, but it could be a newly-cheap paradigm.
Re: (Score:2)
I wonder if this is part of the beginning of a new, computationally-driven problem-solving paradigm. As more and more data is stored, and if search algorithms become more and more clever, the cost of "looking up" (computationally speaking) the answer to a problem might be lower than the cost of "remembering" (using local storage) or "figuring out" (using local CPU power) the answer.
No, it's part of an existing, computationally-driven problem solving field that has existed for decades.
And don't refer to a
Re: (Score:2)
conference spamming time (Score:2)
Now there's an algorithm to fill in the blind spot (Score:2)
Re: (Score:2)
you get if you unwisely follow the instructions on the other current front-page article about building your own high powered laser? That's uncanny.
Actually, your eyes already come pre-equipped with exactly such an algorithm [brynmawr.edu] (needed for the naturally occurring blind spot at the space where the optical nerve is attached to the eyes). And apparently, it happily works for the extra, laser-induced blind spots as well.
Browse around the site. Not only does it fill in uniform background color (easy), but also more complicated pattern (lines going through the blind spot), and even autocompletes repeating pattern (field of red circles).
Is this how the brain fills in the blind spot? (Score:5, Interesting)
For those who don't know: each eye has a surprisingly large blind spot at the place where the optic nerve enters the eye. At reading distance, in the right eye, it's about four or five inches to the right of the spot at which you are gaving, and many textbooks and "fun with optical illusions"-type books will have a diagram... like the one on this web page... [brynmawr.edu] and directions for finding it. The blind spot is much larger than the dot on that web page, incidentally. If you explore, you'll find that... at the distance at which the dot disappears... the blind spot is nearly an inch wide and an inch-and-a-half high.
Even allowing for the fact that each eye has the blind spot in a different place so they fill in for each other, once you discover how big the blind spot is... and how relatively close to your position of gaze it is... you'll be astonished that almost nobody notices it until it is pointed out.
The brain does something more or less like filling in the blind spot. I say "more or less like" because it is very hard to answer the question "what do you see in the blind spot." For example, if you hold a computer keyboard at the right distance so that you're looking at the "G" key and the "K" key is in your blind spot, what do you see? Certainly not a black spot, certainly not a white spot, certainly not a "hole" or emptiness. Probably you have an impression of computer keys. Do you see a letter K? Certainly not, yet somehow you don't see a blank key, either.
Incidentally, I used to suffer from migraine headaches, and one of the symptoms for some people is the formation of blind spots which can be even larger than the "normal" blind spot, and can appear in central vision. One one memorable occasion, I was looking at the cover of a hardbound book, and I can tell you that when I looked at the title, my perception was the stamped, printed title disappeared, yet I would have sworn in a court of law that I still saw the cloth texture extending across the blind spot.
Although he does not specifically refer to it as a migraine illusion, I believe Lewis Carroll was known to be a migraineur, and in Chapter V of Through the Looking-Glass, "Wool and Water," Alice notices that "The shop seemed to be full of all manner of curious things -- but the oddest part of it all was that, whenever she looked hard at any shelf, to make out exactly what it had on it, that particular shelf was always quite, empty, though the others round it were crowded as full as they could hold." Any migraineur who experiences central blind spots will recognize this description.
Hays and Efros' system--relatively-simple algorithm operating on a large database of previously-seen images--seems to me to be the sorta-kinda way in which one could imagine the brain working.
I wonder if there's any way to test this?
Re: (Score:2)
It doesn't consult a big archive or do "semantic" matching or anything tricky.
Huge database of images from WHERE? (Score:2, Interesting)
It would not seem to me to fall under fair use...
MadCow.
Massive increase in productivity at Minitrue (Score:2)
IGNORANCE IS STRENGTH
Grassy Knoll (Score:2, Interesting)
In other news... (Score:2)
Preliminary tests... (Score:2)
Re: (Score:2)
Great for Restoring Old Movies (Score:2)
Can I load this algorithm into my brain? (Score:2)
Re: (Score:2)
Re: (Score:2)
Everyday science (Score:3, Informative)
Re: (Score:3, Insightful)
I agreeth with your merriment regaurding such tinkery. I myself saweth the play "InfoTrek" about the 21st century, in which a character, upon wishing to lern of a song he herd a few worrdes fromme, simple provided those words to his "computer" and shortly thereafter, generated an image sequence of a bande performing that song! I neerly interrupted the play with my laughter!
Re: (Score:2)
Re: (Score:2)