Recognizing Scenes Like the Brain Does 115
Roland Piquepaille writes "Researchers at the MIT McGovern Institute for Brain Research have used a biological model to train a computer model to recognize objects, such as cars or people, in busy street scenes. Their innovative approach, which combines neuroscience and artificial intelligence with computer science, mimics how the brain functions to recognize objects in the real world. This versatile model could one day be used for automobile driver's assistance, visual search engines, biomedical imaging analysis, or robots with realistic vision. Here is the researchers' paper in PDF format."
Does anybody know where to find the actual paper? (Score:1, Informative)
that's a generous view of it (Score:3, Informative)
1. The code is in a horrible hacked-together state and so not really fit for release, and nobody wants to put in the effort that would be needed to clean it up; or
2. The researchers don't want to release their code because keeping it secret creates a "research moat" that guarantees that they'll get to publish all the follow-up papers themselves, since anyone else who wanted to extend the work would have to first invest the time to reimplement it from scratch (this is more common in implementation-intensive areas like graphics)
Earlier work 1989-1997 on street scene analysis (Score:5, Informative)
1. WPJ Mackeown (1994), A Labelled Image Database, unpublished PhD Thesis, Bristol University.
2. WPJ Mackeown, P Greenway, BT Thomas, WA Wright (1994).Road recognition with a neural network, Engineering Applications of Artificial Intelligence, 7(2):169-176.
3. NW Campbell, WPJ Mackeown, BT Thomas, T Troscianko (1997).
Interpreting image databases by region classification. Pattern Recognition, 30(4):555-563.
There has been various follow up research since then [google.com]
Re:More importantly, where is the source code? (Score:2, Informative)
Re:not like the brain does. (Score:3, Informative)
Re:nothing new (Score:4, Informative)
That said, they do present a simple and biologically-motivated preprocessing layer that appears to be useful, which reflects back on the brain. In summary, I would say that this paper helps more to understand brain functioning than to develop machines that can achieve human-like vision capabilities. So, very nice, but let's not over-hype it.
Fine paper, but why not quote all of PAMI ? (Score:5, Informative)
Interested readers can browse the content of PAMI current and back issues [ieee.org] and either go to their local scientific library (PAMI is recognisable from afar by its bright yellow cover) or search on the web for interesting articles. Often researchers put their own paper on their home page. For example, here is the publication page of one of the authors [mit.edu] (I'm not him).
For the record, I think justifying various ad-hoc vision/image analysis techniques using approximations of biological underpining is of limited interest. When asked if computer would think one day, Edsgerd Dijkstra famously answered by "can submarine swim?". In the same manner, it has been observed that (for example) most neural network architectures make worse classifiers than standard logistic regression [usf.edu], not to mention Support Vector Machines [kernel-machines.org], which what this article uses BTW.
The summary by our friend Roland P. is not very good
I could go on with lists and links but the future is already here, generally inconspicuously. Read about it.