An Advance In Image Recognition Software 81
Roland Piquepaille alerts us to work by US and Israeli researchers who have developed software that can identify the subject of an image characterized using only 256 to 1024 bits of data. The researchers said this "could lead to great advances in the automated identification of online images and, ultimately, provide a basis for computers to see like humans do." As an example, they've picked up about 13 million images from the Web and stored them in a searchable database of just 600 MB, making it possible to search for similar pictures through millions of images in less than a second on a typical PC. The lead researcher, MIT's Antonio Torralba, will be presenting the research next month at a conference on Computer Vision and Pattern Recognition.
There goes the neighborhood (Score:4, Funny)
Re: (Score:2)
Re:There goes the neighborhood (Score:5, Insightful)
Re:There goes the neighborhood (Score:5, Funny)
Re: (Score:1)
Re: (Score:1)
Re:There goes the neighborhood (Score:5, Informative)
Re: (Score:1)
Re: (Score:2)
Re: (Score:1)
Re: (Score:1, Offtopic)
Pick a bale of hay.
tests (Score:1, Funny)
Re: (Score:1)
Like every other "advance" in image recognition... (Score:3, Insightful)
Until then, it's snake oil, as far as I'm concerned.
Re:Like every other "advance" in image recognition (Score:5, Funny)
No wonder those snakes are not only so quiet, but I never even see 'em coming!
Geez. We don't stand a chance.
Re: (Score:1)
Re:Like every other "advance" in image recognition (Score:5, Funny)
Re:Like every other "advance" in image recognition (Score:3, Informative)
From what I can tell, it's basically, "blur the image down to only a few hundred pixels and then you have less data to comb through!"
Re: (Score:1)
So... the are using it to identify nipples?
Re:Like every other "advance" in image recognition (Score:3, Interesting)
If they are claiming to have a general image recognition algorithm that'd be
Re:Like every other "advance" in image recognition (Score:5, Informative)
http://people.csail.mit.edu/torralba/tinyimages/ [mit.edu]
Re: (Score:1)
http://people.csail.mit.edu/torralba/tinyimages/ [mit.edu]
Another Roland Piquepaille story on Slashdot (Score:2)
Re: (Score:1)
...I'll believe it when I see it.
Exactly. This is a routine rite of passage for all Pattern Recognition researchers: when they need to justify more funding, they claim to be "just around the corner", hence we have this type of hyperbole. The paper is ridiculously simplistic, and does not deserve /. attention.
Before we have this tech, we will have to solve Bongard problems [scribd.com].
Machines can't even distinguish the content between two sets of B&W (binary) images with simple figures. Solve Bongard problems, and you'll be just around t
Oh thats really simple to do... (Score:2)
Of course it helps if you read the papers... (Score:4, Informative)
I just finished reading "Small Codes and Large Image Databases for Recognition" written by the guy. All he did was implemented Geoff Hinton's idea of databasing images with a binarized coefficients produced by Restricted Boltzmann Machines.
Hinton himself gave a talk on it for Google here:
http://www.youtube.com/watch?v=AyzOUbkUf3M [youtube.com]
Actually I'm wondering, is he plagiarizing Hinton?
Re: (Score:3, Insightful)
Re:Of course it helps if you read the papers... (Score:4, Insightful)
This research involves the use of one of the largest image databases seen in computer vision. It shows that you can do extremely rapid scene matching for databases of this scale. No, that's not obvious no matter what you think. This image data is fairly high dimensional.
This research says something about the space of likely scenes and it might be a key enabling technology to a lot of the heavily data driven computer vision and computer graphics approaches popping up lately.
Re: (Score:2)
It was a skim to see what the hell the article was really about, didn't know these two were connected. I jumped the gun 'cause I got burned by a plagiarizer in the past, sorry.
Re: (Score:1)
Also the example shown in the article does not really make sense for me. I mean of course if we look at a blurred and rotated object in a series of images it is hard to discern. Also we might think the object is not the same in the different images, although it actually is. But the question is, does that mean that the algorithm can reliably determine the identity of an object even if a human viewer cannot?
Actually I doubt
Oh really? (Score:2)
I guess nobody there thought to do the math before making these claims. This story probably shouldn't have made it to the front page; it's less than useful.
Re: (Score:3, Insightful)
No one said they were going to identify individual people with this. The main gist of this research seems to be efficiency (in both space and time, if I read it correctly). For instance, if one wanted to identify every face in a picture of a crowd, they could apply this algorithm to a low-res version of the image to quickly find the locations of every "face," and then use a more advanced face recognition algorithm t
Re: (Score:1)
not so much an advance (Score:2)
I can see it now (Score:1)
Re: (Score:2)
Get off my lawn, you young whippersnapper!
But can it tell the difference between.... (Score:2)
How will spammers make use of this? Well just make that viagra pill be reflected in a coke bottle.
anyone for a random bit generator to see what random results gets labeled?
Wonder what fractals might produce?
moral of the question: we can always break what we make.
Search Jenna Jameson? (Score:5, Funny)
i'm a little concerned about the licensing. (Score:1)
for the record, i say this as a concerned/curious artist who isn't looking for a payout.
Re: (Score:1)
Link to the source code (Score:2)
As I understood it, it's not for sale, you can get it at his MIT website [mit.edu]
Re: (Score:2)
Re: (Score:2)
Since 8% of pix on flickr... (Score:1)
Re: (Score:2)
Scary. (Score:2)
Re: (Score:1)
Re: (Score:2)
Very cool stuff... (Score:2, Interesting)
That is, there are a number of image similarity algorithms, but the computed values of two similar images are not necessarily mathematically near to each other. This algorithm produces values that are, which can make searching for similar images among very many images, quite fast.
Re:Very cool stuff... (Score:4, Informative)
A reasonable descriptor which produces distances that seem somewhat correlated with human perception would indeed be Antonio Torralba and Aude Oliva's gist descriptor.
http://people.csail.mit.edu/torralba/code/spatialenvelope/ [mit.edu]
It's become quite popular in computer vision and computer graphics for scene matching.
Hmmm.... (Score:1)
While I don't care about most uses about my images (go a head and PS a penis in my mouth) but I would fight it if I found out it was used for this.
Think it through (Score:2)
Thats not meant to disparage the work - image recognition is important and difficult. This particular 'advance' just isn't that 'advancing'
Re: (Score:1)
Re: (Score:1)
Your second number is essentialy the number of hydrogen atoms in 10^231 universes, all similar to our own.
In english, 2^1024 is approximately equal to the number of hydrogen atoms in a googol googol universes. Imagine if you will, a replica universe for every hydrogen atom in our universe. Now imagine a replica universe for each of the hydrogen atoms in those replica universes.
We are still about 10^
Re: (Score:2)
It still isn't big enough to do even a half decent job.
Imagine that a bit has a coherent meaning, such as "image contains a kitten." If the bit is zero, there is no kitten in the image; if it is one, there is at least one kitten in the image.
Now imagine that system extended for all types of animals. Going to go past a requirement for 1024 bits pretty fast, isn't it?
Now imagine that a bit represents "image contains a keyboard" in the same fashion.
Now imagine a bit for every type of macro and mic
Re: (Score:1)
Imagine that a bit has a coherent meaning, such as "image contains a kitten." If the bit is zero, there is no kitten in the image; if it is one, there is at least one kitten in the image.
Why would I imagine that?
The most trivial example that busts your theory is the game "20 questions." Each answer is yes/no, a single bit of data mapping a 20 bit value to 1 million arbitrary things. There is no bit assigned to "this is a cat", each bit is taken in context with all of the other bits.
A more complex example is Arithmetic Compression, which assigns fixed-length input codes to variable-length output codes. The shortest output code length is much smaller than a single bit (any symbol with
Re: (Score:2)
You should imagine it because you are conflating the number of bits required to count things with the number of bits required to discriminate among things. The two are entirely disjoint.
Re: (Score:1)
I simply expressed 2^1024 in understandable terms, keeping on point with the person I replied to who was trying to compare his numbers with the number of atoms in the universe.
Also, you apparently didn't read the article because you attribute qualities to the technique which the article specifically denies. It does not say that it will descriminate against individual people (such as "one bit that says 'image contains pesson Rockoo
I forsee nefarious law enforcement uses (Score:2)
Re: (Score:3, Insightful)
Some of those scientists are actually pretty smrt.
Re: (Score:2)
Re:I forsee nefarious law enforcement uses (Score:5, Informative)
Another way is to somehow identify the orientation. An simple way to do that is to find the axis along which there's maximum variation and rotate until those axes match in both images.
Pixel by pixel co-registration basically does look at a similarity measure for a lot of variations on the affine transform. You generally don't have to look at them all though: you use an iterative algorithm with a clever optimization strategy so your transform gets better and better instead of searching through the parameter space randomly.
Typical PC (Score:2)
Of course that typical PC is a dual quad-core machine running at 3GHz with 8GB of memory, GPU X3 running offloading co-processing software, and 1TB of hard drive space.
Mpff (Score:1)
Opensource project to identify similar images (Score:2)
It's also designed to quickly find similar images, even out of millions of images. The documentation describes a possible indexation technique (as suggested in the original paper):
http://download.pureftpd.org/pub/pure-ftpd/misc/libpuzzle/doc/README [pureftpd.org]
Images are stored as 544-bits signatures by default.