Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Searching by Image Instead of Keywords

Posted by samzenpus on Wed May 04, 2005 08:00 PM
from the find-me-something-square-and-green dept.
Content based image retrieval (CBIR), the technique to search for images not by keywords, but by comparing features of the images themselves has been the focus of much research ever since the web emerged. Consider for instance adding CBIR to Google Images, where you would be able to search for images similar to a query image instead of using keywords. A research project at Penn State University has recently been applied to the biggest aviation photo database in the world with close to 800,000 images. You can search for images similar to a photo already in their database (click "View similar photos") or submit your own query image. Some queries generate better results than others but CBIR is certainly here to stay and will be standard in many image applications of the future.
+ -
story
This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • by qewl (671495) on Wednesday May 04 2005, @08:01PM (#12437393)
    I can't wait to put a nipple into it!
    • Ehm - not no be too much of a geek, but here are some airplanes that (don't) look like the Slashdot logo...

      http://www.airliners.net/similarity/index.php?imag e_url=http://images.slashdot.org/title.gif [airliners.net]

      BTW, if you want to post other searches, this URL format seems to work.
    • by Anonymous Coward
      Talking about greatness to society and a little bit of skin. At university one of my projects was a system that used CBIR to try and diagnose skin cancer. The doctor would take an image of the suspect area it then would be compared against a database of cancers. It would then return a suggested likelyhood of being cancer. It also allowed the doctor to build a history of images allowing easy comparision over time.

      I always felt good about working on projects like this, gives a warm fuzzy feeling.
    • Actually, it will be hillarious what will happen when grandma puts in a picture of her grandson taking a drink from the hose in the backyard.

      Its almost like telling someone to go to whitehouse.com
  • and set for goatse!
  • Location? (Score:5, Funny)

    by poopdeville (841677) on Wednesday May 04 2005, @08:04PM (#12437416)
    What an awful beach [airliners.net].
  • Wow (Score:5, Interesting)

    by themoodykid (261964) on Wednesday May 04 2005, @08:04PM (#12437418) Homepage Journal
    I was just thinking about this the other day. I think content-based image search is one of the Next Big Things. Cameras are so ubquitous now (for better or worse), but having to rely on metadata to give meaning to images requires lots of effort up front.

    It will be interesting if we ever get to a stage where we can just search for a random object (or person) in a database of photos. Then you could take pictures of everything with an always-on camera and if you need to find where you put your car keys, just do a search.
    • But shit, this REALLY works! It's amazing.
      • But shit, this REALLY works! It's amazing.

        Dunno about that. Here's what I get after clicking on a picture of an A-10 Warthog: A Tornado, a 767, a 747, A Fokker F-7 turboprop, a Dassault Falcon business jet, a Luftwaffe A310, a Harrier, an F-18 Hornet, another Tornado, a Lockheed P3 Orion sub hunter, a Sikorsky Super Stallion helicopter, a Concorde... and soforth. No other A-10s. Hard to think of a more diverse crop of aircraft.

        Most of these aircraft are airborne but a couple are on the ground. If I cli

    • Re:Wow (Score:4, Informative)

      by theguyfromsaturn (802938) on Wednesday May 04 2005, @09:24PM (#12437893)

      If you are only interested in searching for images on your own computer, have a look at imgSeek. http://imgseek.python-hosting.com/ [python-hosting.com]

      It's been around for some time now. You can not only use an existing image to search, but also do a rough sketch. Check the screenshots: [sourceforge.net]

      Nice complement to what has been presented in this article.

  • This is just asking for trouble. As most of you would probably imagine, some self-proclaim "comdeian" would post either porn pictures, or pictures that resembles porn body position.

    They would need a team of outsource Indian workers to go through each picture one by one!

    I am not Indian but...can I apply for the image filtering job?
    I said this first, I should get the job ;) .
  • Top Search (Score:3, Funny)

    by daishin (753851) on Wednesday May 04 2005, @08:08PM (#12437445) Homepage
    Something with two circles and dots in the middle of each circle.
  • Some Applications of Our Research
    1. Airliners.net
    A site with almost 1,000,000 aviation images.


    Wow !!! I tested their Sample search [airliners.net] and all the results were aeroplane photos !!! Ok, ok the site only has airplanes but still ..:)

    On a more serious note the alogorithms seem to look for similatity in the colors and lighting rather than the subjects (for example it shows the interior of a cabin in photos similar to a whole plane in the sky. To really see its effectiveness we need to test in in
  • by mikael (484) on Wednesday May 04 2005, @08:17PM (#12437504)
    ... the search engine will support ASCII art image searches.

  • by FleaPlus (6935) on Wednesday May 04 2005, @08:28PM (#12437575) Homepage Journal
    There's a bunch of interesting papers out there on content-based image analysis and retrieval. Below is a sampling from my bibtex file. Does anyone else have others they'd like to share?

    * Finding Naked People [hmc.edu] (Fleck et al, 1996)

    * Video google: A text retrieval approach to object matching in videos [ieee.org] (Sivic & Zisserman, 2003): web page demo here [ox.ac.uk]

    * Names and Faces in the News [columbia.edu] (Berg et al, 2004)

    * FACERET: An Interactive Face Retrieval System Based on Self-Organizing Maps [springerlink.com] (Ruiz-del-Solar et al, 2002)

    * Costume: A New Feature for Automatic Video Content Indexing [www.irit.fr] (Jaffre 2005)
    • by FleaPlus (6935) on Wednesday May 04 2005, @08:39PM (#12437636) Homepage Journal
      I forgot one more, where specific faces were automatically retrieved from feature-length movies and Fawlty Towers:

      Automatic Face Recognition for Film Character Retrieval in Feature-Length Films [cam.ac.uk] (Arandjelovic & Zisserman, 2005)

      The objective of this work is to recognize all the frontal faces of a character in the closed world of a movie or situation comedy, given a small number of query faces. This is challenging because faces in a feature-length film are relatively uncontrolled with a wide variability of scale, pose, illumination, and expressions, and also may be partially occluded. We develop a recognition method based on a cascade of processing steps that normalize for the effects of the changing imaging environment. In particular there are three areas of novelty: (i) we suppress the background surrounding the face, enabling the maximum area of the face to be retained for recognition rather than a subset; (ii) we include a pose refinement step to optimize the registration between the test image and face exemplar; and (iii) we use robust distance to a sub-space to allow for partial occlusion and expression change. The method is applied and evaluated on several feature length films. It is demonstrated that high recall rates (over 92%) can be achieved whilst maintaining good precision (over 93%).
    • QBIC is part of IBM's DB2 content manager. It has been available for at least 5 years now, and is now part of a DB2 extender. You can check it out here:

      http://wwwqbic.almaden.ibm.com/ [ibm.com]
    • why bother making an algorithm that can recognise which images are porn and which are not when you can just set up a web site where people will do it for free? It reminds me of those "enter the characters in this image" tests that places like Yahoo do to ensure you can't sign up for a million email accounts a day. They're so easy to get around cause all you have to do is present the image to a man who wants porn and he'll happily provide his character recognition skills without charge.
  • by dotpavan (829804) on Wednesday May 04 2005, @08:45PM (#12437669) Homepage
    Here is a google game which is reverse of google's image search:

    One has to guess the search word which generated a given set of 20 images in google's image search [robinson.name]

    When things are moving forward, we have soomthing to talk about "those good ole days" but frankly the game is interesting initially but later gets boring due to the frequent repetitions..

  • Is it just colour? (Score:5, Interesting)

    by Bifurcati (699683) on Wednesday May 04 2005, @08:47PM (#12437676) Homepage
    I just did a quick search based on this [designer.am] image of a Qantas logo (that's the main Australian airline, in case you're wondering...) It's red, with a white kangaroo in the middle. My theoretical aim was to find photos of Qantas planes.

    What I got was an awful lot of red planes - some of which were actually Qantas planes, but I think more by coincidence (i.e., they're red) than design. Many images had nothing to do with Qantas, or even a red plane - they simply had a lot of red in the image.

    This is impressive in some ways, but in others it seems like it's simply looking for similar patches of colour. I haven't done enough testing to see what happens if,say, I gave it a half red half green image.

    Interesting, but not ready for public consumption just yet. A bit like A.L.I.C.E. the artifial intelligence system actually - neat, but not practical. Yet!

  • Great! (Score:5, Funny)

    by SetupWeasel (54062) on Wednesday May 04 2005, @08:50PM (#12437700) Homepage
    Now I can find all the other naked pictures of Bea Arthur on the web!
  • The Mona Lisa (famous and out of copyright) is often plagarized in whole or in part as part of commercial or satiric artistic works. These types of visual database engines have frequently been explained to me as being able to input the Mona Lisa and get a list of images that used the entirety of the image or just a part (such as the highly-praised subtle smile).

    The big problem to me is specifying input. I know the "look" of the Mona Lisa's smile, but even with the best pen input methods I'd never be able

  • Pattern Rcognition [williamgibsonbooks.com] is a novel by William Gibson, basically set in the present day or very near future. Image based search plays a central role in the plot. It's a very good read.
  • by exp(pi*sqrt(163)) (613870) on Wednesday May 04 2005, @09:07PM (#12437792) Journal
    I was looking at a picture of a plane on that web site and there was a link that said "Click for similar images". And what do you know? It brought up more pictures of planes. This is amazing stuff. How did it understand that I was looking at a picture of a plane?
  • by capedgirardeau (531367) on Wednesday May 04 2005, @09:43PM (#12438007)
    From gnu.org:

    The GIFT (the GNU Image-Finding Tool) is a Content Based Image Retrieval System (CBIRS). It enables you to do Query By Example on images, giving you the opportunity to improve query results by relevance feedback. For processing your queries the program relies entirely on the content of the images, freeing you from the need to annotate all images before querying the collection.

    GIFT [gnu.org] It worked pretty well for me in the demos they linked too. I have been waiting for this type of application to gain momentum.

  • 'Coz I'm looking for more information on this image. [pacific.net.au]

    It says "multi lock on" and a date, but all Google reports is other forum posts looking for the creator of the image. Apparently, there's a high-res version of it too.
  • A.net is very stringent on the photos they accept. You can submit hundreds of photos, and get rejected for such things as 'badmotive' (a runway sign blocking a single tire), very mildly soft focus, and lots of other pretty anal things (IMHO). So while the image count they are dealing with is high, the obvious resulting similarity among images will result in a high number of matches.

    Now, do this for something like Google Images or PBase or collections spanning infinite numbers of subjects and image sizes,

    • by Anonymous Coward
      It seems that a favorite use of the image similarity search over there at airliners.net is for the spotters to run pix on airline and flightsim sites through the search, to see who on anet has been infringed upon copyright-wise.

      Look up Bombardier in the forums on airliners.net, they have frequently asked a photog for permission to use their photos (for pay), then later say they elected not to use them (and therefore no payment to photog). But then they use the photos anyways without payment or acknowledge

  • Is this a joke? (Score:3, Interesting)

    by Daikiki (227620) <daikiki@NOSPAM.wanadoo.nl> on Wednesday May 04 2005, @10:38PM (#12438320) Homepage Journal
    I've tried two different images of airplanes; one of a bright red flying car on bright green grass and one of SpaceShip One against a deep blue sky. Both times, the results looked surprisingly like my query images in color composition only. Red planes on grass and white planes against a blue sky. Inauspicious start.

    Next experiment: I took a picture of a highly distinctive plane, a harrier, climbing at a steep angle and viewed in profile. I got, in return, a list of passenger jets, and even a helicopter. Hardly surprisingly, all of the result pictures had the same bluish white sky as my original image. That was literally the only similarity.

    According to the introduction on the search page the heuristics used compares colors, contrast and shapes in the images themselves. I saw no correlation whatsoever between shapes, and any correlation in contrast seems to be to be the result of the search engine simply looking for images that contain the same colors in a similar ratio to the original. In short, nothing to see here, move along.

    On the other hand, one of the projects listed under the Penn State University link looks fairly fascinating. The Riemann a-LIP project [psu.edu] (automatic linguistic indexing of pictures) doesn't allow user input of images, unfortunately, but it does show some fairly fascinating attempts at verbally qualifying image data. For example, it describes a blue and orange mandelbrot as pattern agate shimer abstract scene, and a sunset over a lake as Berlin Devon Namibia landscape lake scene. Okay, it may still need some work, but it sure beats the hell out of the "find the same color airplane engine".
  • by mr_zorg (259994) on Wednesday May 04 2005, @10:47PM (#12438377) Homepage
    Oh, you mean like imgseek [python-hosting.com]?
  • by Sirch (82595) on Thursday May 05 2005, @04:28AM (#12439533) Homepage
    About a year or so ago, I and three other Masters students worked on a similar project at the University of Southampton.

    I've not RTFA (not had the time), but our approach was to split the images into segments (based on colour and texture) which were assumed to be objects. The segments would then be analyzed for various feature vectors, such as shape, texture, colour etc. These vectors would then be added into a database of numbers, and finally the segments grouped, giving a collection of classified sections which (hopefully) represent similar objects.

    From related metadata such as keywords, you could then hope to build up an idea of what keyword matches which section. You could also come up with a relevance between two images, and thus search for similar images.

    We didn't have enough time to make it bulletproof by any means, but our limited results were very promising.

    Sorry I can't find the paper, but we've got some screenshots of the application here [soton.ac.uk] and here [soton.ac.uk] (you can see false colouring applied to the original image to display the segments)
    • Re:wtf? (Score:5, Interesting)

      by Rei (128717) on Wednesday May 04 2005, @08:07PM (#12437442) Homepage
      Because it still has problems - you'll note that the pictures seem to be compared simply based on color similarity. That's the same thing imgSeek [python-hosting.com] does (a great program for sorting and searching your photos) on photo searches. It works wonderfully if you're searching a very limited picture subset (say, airplanes), but if you search a wide variety of pictures, the results can be quite amusing.
      • You also have the opposite problem with color histograms: two very similar images, even two images taken of the same static scene, seconds apart, can have substantially different color histograms.

        I used to do research in CBIR, and in my image library I had 5 photos of a christmas tree. By any efficient metric, one of them was always way far away from the others.

        Xcott

    • Re:wtf? (Score:3, Interesting)

      Google actually did take this technology and try it. The first version of their image search had a "find similar" link next to every image. These tended to work okay at first (they weren't great, but you usually got enough photos back that you could visually scan them and find something of interest that was related to the original image). After a few months, for some reason, the "find similar" links started returning increasingly nonsensical results. After it degenerated to the point of near uselessness, th
    • Maybe trainspotting has died down because all you get on Google now are results for that wretched movie.

    • We always tell people not to mod down improper observations, so I'm going to try to practice what I preach so to speak.

      On what grounds could the TSA squelch photographers and their right to share their creative works (which is their livelihood)?
    • The TSA would probably have some difficulty shutting down Airliners.net, seeing as it's in Sweden. Furthermore, airliner photography is perfectly legal in the US, and even TSA reps have said so. Usually the only people with a problem with it are the rent-a-cops.

      Trainspotting seems to still be around as well, see http://www.railpictures.net/ [railpictures.net].