Slashdot Log In
An Advance In Image Recognition Software
Posted by
kdawson
on Sat May 24, 2008 07:45 PM
from the needle-in-a-haystack dept.
from the needle-in-a-haystack dept.
Roland Piquepaille alerts us to work by US and Israeli researchers who have developed software that can identify the subject of an image characterized using only 256 to 1024 bits of data. The researchers said this "could lead to great advances in the automated identification of online images and, ultimately, provide a basis for computers to see like humans do." As an example, they've picked up about 13 million images from the Web and stored them in a searchable database of just 600 MB, making it possible to search for similar pictures through millions of images in less than a second on a typical PC. The lead researcher, MIT's Antonio Torralba, will be presenting the research next month at a conference on Computer Vision and Pattern Recognition.
Related Stories
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Full
Abbreviated
Hidden
Loading... please wait.
There goes the neighborhood (Score:4, Funny)
Re: (Score:2)
Re:There goes the neighborhood (Score:5, Insightful)
Parent
Re:There goes the neighborhood (Score:5, Funny)
Parent
Re:There goes the neighborhood (Score:5, Informative)
Parent
Re: (Score:2)
tests (Score:1, Funny)
Like every other "advance" in image recognition... (Score:3, Insightful)
Until then, it's snake oil, as far as I'm concerned.
Re:Like every other "advance" in image recognition (Score:5, Funny)
No wonder those snakes are not only so quiet, but I never even see 'em coming!
Geez. We don't stand a chance.
Parent
Re: (Score:1)
Re:Like every other "advance" in image recognition (Score:5, Funny)
Parent
Re:Like every other "advance" in image recognition (Score:3, Informative)
From what I can tell, it's basically, "blur the image down to only a few hundred pixels and then you have less data to comb through!"
Re:Like every other "advance" in image recognition (Score:3, Interesting)
If they are claiming to have a general image recognition algorithm that'd be
Re:Like every other "advance" in image recognition (Score:5, Informative)
http://people.csail.mit.edu/torralba/tinyimages/ [mit.edu]
Parent
Another Roland Piquepaille story on Slashdot (Score:2)
Oh thats really simple to do... (Score:2)
Of course it helps if you read the papers... (Score:4, Informative)
I just finished reading "Small Codes and Large Image Databases for Recognition" written by the guy. All he did was implemented Geoff Hinton's idea of databasing images with a binarized coefficients produced by Restricted Boltzmann Machines.
Hinton himself gave a talk on it for Google here:
http://www.youtube.com/watch?v=AyzOUbkUf3M [youtube.com]
Actually I'm wondering, is he plagiarizing Hinton?
Re: (Score:3, Insightful)
Re:Of course it helps if you read the papers... (Score:4, Insightful)
This research involves the use of one of the largest image databases seen in computer vision. It shows that you can do extremely rapid scene matching for databases of this scale. No, that's not obvious no matter what you think. This image data is fairly high dimensional.
This research says something about the space of likely scenes and it might be a key enabling technology to a lot of the heavily data driven computer vision and computer graphics approaches popping up lately.
Parent
Re: (Score:2)
It was a skim to see what the hell the article was really about, didn't know these two were connected. I jumped the gun 'cause I got burned by a plagiarizer in the past, sorry.
Oh really? (Score:2)
I guess nobody there thought to do the math before making these claims. This story probably shouldn't have made it to the front page; it's less than useful.
Re: (Score:3, Insightful)
No one said they were going to identify individual people with this. The main gist of this research seems to be efficiency (in both space and time, if I read it correctly). For instance, if one wanted to identify every face in a picture of a crowd, they could apply this algorithm to a low-res version of the image to quickly find the locations of every "face," and then use a more advanced face recognition algorithm t
Re: (Score:1)
not so much an advance (Score:2)
I can see it now (Score:1)
Re: (Score:2)
Get off my lawn, you young whippersnapper!
But can it tell the difference between.... (Score:2)
How will spammers make use of this? Well just make that viagra pill be reflected in a coke bottle.
anyone for a random bit generator to see what random results gets labeled?
Wonder what fractals might produce?
moral of the question: we can always break what we make.
Search Jenna Jameson? (Score:5, Funny)
i'm a little concerned about the licensing. (Score:1)
for the record, i say this as a concerned/curious artist who isn't looking for a payout.
Re: (Score:1)
Link to the source code (Score:2)
As I understood it, it's not for sale, you can get it at his MIT website [mit.edu]
Scary. (Score:2)
Very cool stuff... (Score:2, Interesting)
That is, there are a number of image similarity algorithms, but the computed values of two similar images are not necessarily mathematically near to each other. This algorithm produces values that are, which can make searching for similar images among very many images, quite fast.
Re:Very cool stuff... (Score:4, Informative)
A reasonable descriptor which produces distances that seem somewhat correlated with human perception would indeed be Antonio Torralba and Aude Oliva's gist descriptor.
http://people.csail.mit.edu/torralba/code/spatialenvelope/ [mit.edu]
It's become quite popular in computer vision and computer graphics for scene matching.
Parent
Hmmm.... (Score:1)
While I don't care about most uses about my images (go a head and PS a penis in my mouth) but I would fight it if I found out it was used for this.
Think it through (Score:2)
Thats not meant to disparage the work - image recognition is important and difficult. This particular 'advance' just isn't that 'advancing'
Re: (Score:1)
Re: (Score:2)
It still isn't big enough to do even a half decent job.
Imagine that a bit has a coherent meaning, such as "image contains a kitten." If the bit is zero, there is no kitten in the image; if it is one, there is at least one kitten in the image.
Now imagine that system extended for all types of animals. Going to go past a requirement for 1024 bits pretty fast, isn't it?
Now imagine that a bit represents "image contains a keyboard" in the same fashion.
Now imagine a bit for every type of macro and mic
Re: (Score:2)
You should imagine it because you are conflating the number of bits required to count things with the number of bits required to discriminate among things. The two are entirely disjoint.
I forsee nefarious law enforcement uses (Score:2)
Re: (Score:3, Insightful)
Some of those scientists are actually pretty smrt.
Re: (Score:2)
Re:I forsee nefarious law enforcement uses (Score:5, Informative)
Another way is to somehow identify the orientation. An simple way to do that is to find the axis along which there's maximum variation and rotate until those axes match in both images.
Pixel by pixel co-registration basically does look at a similarity measure for a lot of variations on the affine transform. You generally don't have to look at them all though: you use an iterative algorithm with a clever optimization strategy so your transform gets better and better instead of searching through the parameter space randomly.
Parent
Typical PC (Score:2)
Of course that typical PC is a dual quad-core machine running at 3GHz with 8GB of memory, GPU X3 running offloading co-processing software, and 1TB of hard drive space.
Opensource project to identify similar images (Score:2)
It's also designed to quickly find similar images, even out of millions of images. The documentation describes a possible indexation technique (as suggested in the original paper):
http://download.pureftpd.org/pub/pure-ftpd/misc/libpuzzle/doc/README [pureftpd.org]
Images are stored as 544-bits signatures by default.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:1)
Re: (Score:2)