Microsoft Demos "Deep Zoom" Technology 272
Barence writes "Yesterday, during a presentation for this year's Imagine Cup, Microsoft's Mark Taylor demonstrated the company's Deep Zoom technology to appreciative gasps of admiration from the computing students present. It's pretty impressive stuff, and you can try 'deep zooming' for yourself at the Hard Rock Memorabilia Site." Unfortunately the demo requires the Silverlight plugin and the story is pretty thin on technical details. I would be interested to see how they captured the image data to that level without massive pixelation.
Imagine Cup (Score:3, Insightful)
So is this digital zoom stuff like the software that they "download off the internet in CSI: Miami" *Snicker*
Re: (Score:3, Interesting)
Re:Imagine Cup (Score:4, Informative)
I was much more impressed with PicLens [piclens.com].
Re:Imagine Cup (Score:4, Interesting)
It's not a new interpolation algorithm.
It's a live version of the The shift-and-add method or image-stacking technique used by astronomers for decades. It's just that now computer hardware is fast enough do it seamlessly.
Basically, the zoom is made from hundreds of still photographs taken from different vantage points. There was something similar being done with tourist destinations, if I remember correctly.
It's an interesting toy, but the practical applications are limited by the lengthy production process.
Re: (Score:2)
Haven't you ever.. (Score:5, Funny)
Re: (Score:3, Informative)
Re: (Score:3, Interesting)
It's the actual software they use on CSI. Read more here [msdn.com].
I think he refers to the software in which they miraculously rotate a single two dimensional image to see stuff from other angles, or enhance gritty 320x200 CCTV images into uber-high resolution with no artifacts or fuzziness.
(Might have been in some other forensics/cop show they did that, though.)
Re: (Score:2)
Re:Haven't you ever.. (Score:4, Funny)
Load photo image.
Enhance.
(zooms in)
Enhance.
(pans around some obstacle)
Enhance.
(pans to the back door, opens the door?, reads license plate from some car a half-block away)
Enhance.
(finds intelligent life in Arkansas)
Re: (Score:3, Informative)
Depending on the footage, this is semi-possible. There's software out there that can watch the motion of an object and determine what the sub pixels were. It's not ideal in every scenario (even less likely slow in the case of a blurry face on a security cam...), and it won't be as snazzy as CSI, but it is possible in a general sense. It only works, though if it can get actual motion vectors from the footag
No, but I've seen GigaPan (Score:4, Interesting)
DeepZoom (Score:4, Informative)
My understanding is that you use different resolutions of the photo. The original photo is obviously the highest res you can have, but you can make successively lower res copies. More or less just bring up a a higher res version when the user clicks.
I saw this demoed at the Atlanta Code Camp [atlantacodecamp.com] back in March. Very cool to watch.
Multiple resolutions (Score:3, Interesting)
db
Re: (Score:3, Informative)
Well, at least that's what it looks like when you use their silverlight app.
You can actually spot the "seams" when zooming. Some of them aren't even superimposed correctly, leading me to believe that they are using a series a pictures taken with different cameras, instead of just storing lower resolution copies of the master image.
Re: (Score:2)
Last year, Microsoft Research and their collaborators from University of Konstanz had a technical paper at Siggraph on this topic:
http://johanneskopf.de/publications/gigapixel/index.html [johanneskopf.de]
All a demo: smoke and mirrors (Score:2)
Setting up a single canned demo is pretty easy to do - relative to applying this "technology" in a wider automated scale.
Sure it is cool to watch, but so is any special effect.
Old news (Score:2)
Re: (Score:2)
oh lordy... (Score:5, Insightful)
You don't
*slaps forehead*
Re: (Score:2)
Re:oh lordy... (Score:4, Informative)
If you can find them, zoom in on those Beatles bobble heads that the article describes. They're very highly defined. Then zoom out a bit and scroll around to (for example) the surrounding Hard Rock Cafe frame. Wonderfully blurry with respect to the bobble heads.
As you zoom out further, you'll notice how the "container" holding those bobble heads antialiases itself differently from the surrounding different-res artwork.
If you move amongst the different images of guitars and clothes (etc) you'll notice in the lower right that it identifies who the centered item belongs to.
So it appears to me that this is a number of different graphical objects that can be zoomed at relatively different distances at the same time. And it looks like they can be embedded within each other.
Unfortunately? (Score:4, Insightful)
Unfortunately the demo requires the Silverlight plugin...
A Microsoft tech demo requires the installation of new Microsoft software to view? Who would have though?
While Silverlight might never be as widely-supported as Flash, I hope that perhaps the competition might force Adobe to do something about the CPU hog that is Flash.
Re: (Score:2, Insightful)
It's Intel/AMD that are expected to fix this issue, by making huge leaps in processing grunt.
I mean, you can not expect the current crop of programmers to actually write (or even just optimise) fast code.
We now have many touted languages that are actually interpreters, not compilers. The argument from programmers is that today's CPU's are fast enough, and that these "new" languages are much easier to debug. They may as well be written in Visual Basic in my opinion!
Re: (Score:2, Interesting)
Silverlight is a browser plugin. It takes all of about 10 seconds to install.
Flash needs to just go away.
Re: (Score:2, Informative)
(the Linux stuff is called Moonlight, and isn't functional yet)
Re:Unfortunately? (Score:4, Informative)
Re: (Score:2)
Well, no, they opened the VM and a partial JavaScript compiler [mozilla.org], which is part of Flash, but not the same thing as Flash.
Well, they are still working on it, just letting other people work on (and use) some components of it as well. Its not like Adobe stopped being involved in Tamarin [mozilla.org] once they gave it to Mozilla.
Maybe not CSI (Score:5, Insightful)
Re:Maybe not CSI (Score:5, Informative)
Aside from that, it is another form of a "tile server" application... Just one that happens to be rather easy to use from a development perspective, and one that has been done really well (Zoomify/AJAX-based solutions don't hold a candle to the tile stitching and easing effects built into the MultiScaleImage control, IMHO).
Re: (Score:2)
Re: (Score:2, Insightful)
Re: (Score:2)
Just like this over-hyped software does.
The only 'newness' is not using a fixed high-level map. That probably took a few lines of SQL (or Transact-SQL depending on the source database.)
Yawn
SeaDragon (Score:5, Informative)
Silverlight: silverlight.net
SeaDragon: http://labs.live.com/seadragon.aspx [live.com]
Re: (Score:2)
Re: (Score:3, Insightful)
So they're essentially recreating Apple's Quartz + OpenGL + standard image formats with Photosynth + DirectX + WMPF.
Re: (Score:3, Interesting)
So they're essentially recreating Apple's Quartz + OpenGL + standard image formats with Photosynth + DirectX + WMPF.
Simply put, apple does an incredible job visually representing itself, it's technology and providing a user experience that is very hard to match.
That said, I disagree that microsoft is recreating any preexisting technology. You could argue that DirectX is just like OpenGL, but that's likely grossly oversimplified.
Photosynth and Seadragon are demoed here: http://www.youtube.com/watch?v=vHsYnkLnepk [youtube.com]
Neither of those are similar to things that apple has done.
layered bitmaps (Score:4, Informative)
Re: (Score:3, Informative)
Yet another Deep Zoom (Score:4, Informative)
Re: (Score:3, Informative)
Sounds like GigaPan (Score:2, Informative)
http://gigapan.org/
It uses a (cheap) commodity digital camera, combined with a smart tripod, good photo stitching software, and a nice Flash UI to give you highly zoomable panoramas. The CMU thing has been around for a while --- over a year at least, plus I'm pretty sure you can get one of the tripod mounts if you participate in the beta and create your own.
Re: (Score:2)
...don't ask from me what you can find in /proc.
Deep Ream (Score:3, Funny)
Deep Zoom? More like Quick Load. (Score:3, Insightful)
Re: (Score:3, Insightful)
Kind of like what happens when you use Google earth very close (i.e., in-situ) to where the servers with the data are stored.
Comment removed (Score:4, Informative)
WTF? (Score:5, Insightful)
Re:WTF? (Score:4, Interesting)
MS Devs have done some amazing things within their allotted size quotas.
Re: (Score:2)
Uses gigapixel imagery as source (Score:5, Informative)
Most people go ooh and aah because they (wrongly) assume that it zooms into normal resolution photos .
It doesnt (because as you and I know, it physically can't).
Deep Zoom does NOT perform CSI/CIA-style photo enhancement. If you dig deeper, you will find that what Deep Zoom is intended for is to enable one to focus on a smaller portion of a giga-pixel photograph so you do not have to download the whole photograph.
Think of it like a hierarchical smooth slicing of a large high resolution photograph and only downloading those "planes" and "sections within a plane" that the user is interested in seeing.
Interesting technology but not magic.
Re:Uses gigapixel imagery as source (Score:4, Informative)
Re: (Score:3, Interesting)
So what exactly is new here, except for the use of Silverblight?
No free lunch (Score:5, Informative)
1) have the pixels in the first place
2) having more pixels in the first place.
Anything else is a fundamental violation of the laws of physics and math. You simply can not fake what you don't have without it being exactly that: a fake. There is no storage printing technology which could accomplish this level of zooming, and they carefully do not say that this is actually a continuous zoom of a picture on a stamp.
Deep Zoom works by letting you meld several images in such a way as pretend its one image.
Basically, its a con-job of transitioning several different images, where one is a re-photograph of sub portion of the original.
The implication of the article is that this is all one image containing a nearly infinite level of detail, which it most emphatically is NOT.
The author is probably equally impressed by street corner magic tricks.
Re:No free lunch (Score:5, Insightful)
That's still very useful.
Basically, its a con job of transitioning several different images, where one is a re-photograph of sub portion of the original.
'con job' has needless connotations of an intent to deceive.
The implication of the article is that this is all one image containing a nearly infinite level of detail, which it most emphatically is NOT.
No. The implication of the article is that you can provide this as a user interface, which is very cool. Google Earth isn't interesting because its a 'con job' to let us think we can zoom in and out of a single monster image of the planet. Its interesting because its a natural and convenient UI to use.
And we don't have to download every single pixel of every single higher res image of a tree in Nigeria to have a closeup look at a parking lot in London. Detail is loaded on the fly, as needed, while the user gets a 'seamless' and comparatively low bandwidth experience.
Its not particularly new as an idea. Or even as an implementation. But maybe Microsoft's tools make setting it up substantially easier, and that alone would be a nice bit of progress.
The author is probably equally impressed by street corner magic tricks.
I am impressed by street corner magicians too. Not because I think they're magical, but because I am impressed at their showmanship, sleight of hand, dexterity, and general ability to appear magical.
Re: (Score:2)
They analyse the data properties of the camera and see what data is rejected and where data is averaged, this is very intensive and esoteric research and not something that's obvious.
I don't get it (Score:5, Insightful)
It would be impressive if the photo they demonstrated on was anything but a photoshop, but given that the 428x134 signature is 52x11 in the 350x237 statuette picture which is 29x26 in the 428x350 hard rock picture which is 87x87 in the 428x399 stamp picture, for the stamp to be real would require a 33 gigapixel stamp (which, at 1 inch square, would be printed at 33,000,000,000 DPI).
To me zooming in and displaying a different image isn't really as exciting at the article author makes it sound? Maybe I'm missing something because the journalist sounds pretty damn excited about it.
Re: (Score:2)
Re: (Score:3, Interesting)
Basically I can run around taking random photos some zoomed in, some not and seadragon will automatically stitch it all together.
The image is only 21K (Score:5, Funny)
Crashed FF 3.0 on my Mac (Score:4, Informative)
Re:Crashed FF 3.0 on my Mac (Score:4, Interesting)
Of course the Silverlight and the zooming works as advertised in IE 7.0.6
Is it just me, or...... (Score:2)
I realize that it's new, and takes effort, but I can't break out the oohs and aaahhhs just yet. No matter how good it is, is it worth upgrading for?
Car analogy: Isn't this like demonstrating a concept car that they intend to put into production, but production will be a little bit different?
I'll wait for SP2 (or equivalent), thanks very much.
Re: (Score:2)
Deep Ripoff (Score:3, Interesting)
Go to Pouet [pouet.net] and you'll find many demonstrations of this effect.
jdb2
This has been done. Better. (Score:3, Interesting)
See Charles and Ray Eames' Powers of Ten [youtube.com]. Now that's a zoom.
As for doing it in real time, Keyhole (bought by Google and renamed Google Earth) was doing this on PCs five years ago. Any decent GPU can do this today, and you can download Google Earth to see it.
I saw one of the first systems able to do this in real time about 25 years ago. It was inside a classified tank at a major aerospace firm, and required a rack of special-purpose hardware. The user interface was beautifully simple - a big trackball (for pan), a lever (for zoom), and a knob (for rotation).
Even Microsoft's little film isn't original. That technique has been used a few times in commercials.
So Silverlight doing this isn't exactly a big "wow" development.
Prior art (Score:3, Informative)
And I finally remembered; Jef Raskin's [wikipedia.org] "Humane Interface".
Zooming demo from several years ago that runs in Flash here. [raskincenter.org]
Quite similar, IMHO. Hmm?
Had this for years (Score:2)
The real trick is finding an image with boring enough edges that you can pretend it came from the other. You'll notice on the Microsoft image demos they mostly have solid color edges except for the Planet Hollywood picture which is followed by something so busy you can't really tell where it fits in.
I got the program at least 2 years ago with one of those $50 software bundle
Old, old, OLD news. (Score:2)
This is hardly new technology. Isn't 'deep zooming' what I've been brought up to know as... well... 'zooming'?
I believe there was a demo, around a year ago, that Steve Jobs did at WWDC to demonstrate how Mac OS X had new 64-bit exploitation abilities in Leopard. If I remember correctly, he brought up two copies of the same image, to do a 'race' (32-bit vs. 64-bit). It was a wide shot of a chamber in the Library of Congress, and it was sufficiently detailed that one could zoom in and read the labels on the
Slightly on-topic: Hard-rock demo site (Score:2)
Doesn't seem to work at all. (Score:2)
Deep Zoom? *barf* (Score:2)
Rage? (Score:2)
Pre-cursor video and related GUI designs (Score:3, Informative)
focus-plus context screens are similar http://www.patrickbaudisch.com/projects/focuspluscontextscreens/index.html [patrickbaudisch.com]
Re: (Score:2)
Re: (Score:3, Insightful)
What becomes of silverlight content, whether it's all eye candy or not, is anyones guess. What I can say is, developing for Silverlight 2 kicks ass.
Re: (Score:3, Interesting)
Re:Installing Silverlight (Score:5, Interesting)
It's technically possible to index silverlight 1 content, because it's content is "loose Xaml files", which means the site has xml files alongside html/js/etc, that is rendered by the silverlight 1 engine.
Silverlight 2 has the same capabilities, but noone will use them, because using C# for application/interaction logic is way more productive than using Javascript. Silverlight 2 sites using C# have the following structure
SomeSite.XAP (zip file containing all code and assets)
- AppManifest.xml
- ApplicationCode.dll (.NET Assembly containing Entrypoint and embedded assets)
- SomeResources/ (compressed folder)
- SomeResources/SomeImage.jpg (...)
AdditionalContent.XAP (supplemental resources and code)
- AppManifest.xml
- SupplementalCode.dll
This makes silverlight 2 apps and content updates really easy to, but are a barrier to extract information.
In both cases the information gained isn't nearly as useful as textual html content, and completely different heuristics would be necessary to analyze the importance of one unit of textual content vs another. Indeed, nearly all the visual cues (The relative position, color, highlights, animations, and reactions to the user) would likely be lost in the process. Perhaps the search engine that can index flash and silverlight content is one that analyzes both visual and textual content.
Re: (Score:3, Informative)
It's technically possible to index silverlight 1 content, because it's content is "loose Xaml files", which means the site has xml files alongside html/js/etc, that is rendered by the silverlight 1 engine.
Silverlight 2 has the same capabilities, but noone will use them, because using C# for application/interaction logic is way more productive than using Javascript. Silverlight 2 sites using C# have the following structure
Well, the XAML (markup GUI, and what's probably interesting to index) and code are still in different files. A developer can choose to put the XAML outside the .xap and the code in it. And if it's an unencrypted .xap (as most would be), it'd be easy enough for a search engine to look inside the .xap to find the XAML.
Searchability of XAML is definitely something we're working on, and have guidelines for how to develop apps that are easily searched and index.
Silverlight 2 Beta 2 released today (Score:3, Informative)
Runtime and SDK downloads and lots of other info about it here http://silverlight.net/GetStarted/ [silverlight.net]
Re: (Score:2)
It's either music notation software [sibelius.com], or the Governor of Kansas [wikipedia.org].
I'm guessing the first one.
Re: (Score:2)
Re: (Score:2)
It's a plugin required by some sites to download sheet music or to view the first page before you buy it.
It's a major pain in the ass, it worked the first time I used it then upgrades didn't work and so on and so forth. I'm at the point where I don't care.
It's easier to find sheet music for free than it is to purchase it.
Re:Installing Silverlight (Score:4, Informative)
It has become pretty popular in the past 5ish years since its learning curve isn't nearly as steep as its main competitor Finale.
People criticize Sibelius since, typically (at least for the versions I've used), its output isn't exactly professional quality.
It is, however, a great tool for music students.
Back in the day, Finale was the only option for amateur composers to produce professional looking manuscripts.
I'm not sure how far Sibelius has come in the last few years, so things might have changed.
Re:Installing Silverlight (Score:4, Informative)
Re: (Score:3, Informative)
Yes, I know, Lilypond works in Windows.
You try teaching 120 computer illiterate musicians how to use it
Sibelius is popular because it's relatively easy, and it runs on Windows (so it's relatively easy to install/manage for its user base).
Re: (Score:2, Informative)
You can still take screenshots and stitch them back together, but that's obviously a pain in the ass.
The plugin itself tends to be unreliable, it often bombs without delivering the goods, while still counting as a print/view and thus often locking
Re: (Score:2)
Re: (Score:2, Funny)
Re:This is not new... (Score:5, Informative)
Photosynth is not Deep Zoomm. Photosynth reconstructs 3D models from collections of 2D photos of the scene acquired from different positions and angles. And as far as I know, Photosynth wasn't an acquisition - it was produced by Microsoft Research.
Deep Zoom was an acquisition, but it was the technology formerly known as Seadragon. It's completely unrelated - Deep Zoom/Seadragon is a 2D thing.
And it's an acquisition, but so what? Ooh, naughty Microsoft - how dare they take exciting technology developed by a startup and put it in the hands of millions of users? Shocking! Clearly it they should have left it to sink in obscurity.
Re: (Score:2)
Either way, this is old news with new branding. How they expect to capitalize on this is my question, perhaps I should RTFA. This seems like a prime candidate for open source, though: not a killer app in itself, but the technology could probably become part of a number of interesting applications.
Re:This is not new... (Score:5, Informative)
Re: (Score:2)
Re:they stole it from blade runner and csi (Score:4, Funny)
Re: (Score:2)
An image sensor in a camera is made up of Light dependent transiters which biasness changes depending on light exposure (magnified by a lens of course). Now add a laser distance measurer [dimetix.com] along with each pixel (crazy idea, I know) which would enable you not only to capture the color of each pixel but also its distance. This will record a 3d image which would be, using two of more sensors, walked around like in bladerun
Re: (Score:2)
embrace, extend, extinguish.
You're absolutely right - especially when you contrast with the way Apple supports [slashdot.org] their own pre-Intel computers.
Re: (Score:2)
Re: (Score:2)
"Leopard drops support for slower G4 and all G3 processors".
It's just a matter of time.
Re: (Score:2)
Re: (Score:2)
If Microsoft and some websites don't want viewers, fine. Their loss.
I think it's reasonable to only support the current and previous version of an OS. Microsoft, Red Hat and others all have that model (usually).
Oh well, c'est la vie.