Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
×
Microsoft Technology

Microsoft Demos "Deep Zoom" Technology 272

Barence writes "Yesterday, during a presentation for this year's Imagine Cup, Microsoft's Mark Taylor demonstrated the company's Deep Zoom technology to appreciative gasps of admiration from the computing students present. It's pretty impressive stuff, and you can try 'deep zooming' for yourself at the Hard Rock Memorabilia Site." Unfortunately the demo requires the Silverlight plugin and the story is pretty thin on technical details. I would be interested to see how they captured the image data to that level without massive pixelation.
This discussion has been archived. No new comments can be posted.

Microsoft Demos "Deep Zoom" Technology

Comments Filter:
  • Imagine Cup (Score:3, Insightful)

    by suso ( 153703 ) * on Friday June 06, 2008 @04:47PM (#23687241) Journal
    When I read Imagine Cup, I did a double take. Back in the 90s, Impulse, the company that made the popular 3D software Imagine, had a program called "Imagine CUP", which stood for Imagine Constant Upgrade Program. It allowed users to pay for the upgrade to Imagine up front and they could receive all the minor versions inbetween the major versions.

    So is this digital zoom stuff like the software that they "download off the internet in CSI: Miami" *Snicker*
    • Re: (Score:3, Interesting)

      by dave420 ( 699308 )
      No, this is the software they use on CSI (NY, at least). You can read a few articles about it starting here [msdn.com].
    • Re:Imagine Cup (Score:4, Informative)

      by electromaggot ( 597134 ) on Friday June 06, 2008 @06:55PM (#23688847)

      ...how they captured the image data to that level without massive pixelation.
      It's not that impressive. You zoom in extensively and it just gets fuzzy. So big deal: they just interpolate the color values between each pixel "point" instead of drawing huge square pixels.

      I was much more impressed with PicLens [piclens.com].
      • Re:Imagine Cup (Score:4, Interesting)

        by ozmanjusri ( 601766 ) <aussie_bob@hoMOSCOWtmail.com minus city> on Friday June 06, 2008 @10:43PM (#23690421) Journal
        they just interpolate the color values between each pixel "point" instead of drawing huge square pixels.

        It's not a new interpolation algorithm.

        It's a live version of the The shift-and-add method or image-stacking technique used by astronomers for decades. It's just that now computer hardware is fast enough do it seamlessly.

        Basically, the zoom is made from hundreds of still photographs taken from different vantage points. There was something similar being done with tourist destinations, if I remember correctly.

        It's an interesting toy, but the practical applications are limited by the lengthy production process.

    • Comment removed based on user account deletion
  • by katterjohn ( 726348 ) on Friday June 06, 2008 @04:50PM (#23687287)
    seen CSI? This technology is so passe.
    • Re: (Score:3, Informative)

      by dave420 ( 699308 )
      It's the actual software they use on CSI. Read more here [msdn.com].
      • Re: (Score:3, Interesting)

        by Eudial ( 590661 )

        It's the actual software they use on CSI. Read more here [msdn.com].

        I think he refers to the software in which they miraculously rotate a single two dimensional image to see stuff from other angles, or enhance gritty 320x200 CCTV images into uber-high resolution with no artifacts or fuzziness.

        (Might have been in some other forensics/cop show they did that, though.)

        • by dave420 ( 699308 )
          Oh I know - I always get a geeky laugh out of those shows. I just wanted to point out that they use this stuff on the show's set to get the desired effect of zooming into a picture, that's all. :)
        • by billcopc ( 196330 ) <vrillco@yahoo.com> on Friday June 06, 2008 @06:08PM (#23688307) Homepage
          (Blade Runner)

          Load photo image.

          Enhance.

          (zooms in)

          Enhance.

          (pans around some obstacle)

          Enhance.

          (pans to the back door, opens the door?, reads license plate from some car a half-block away)

          Enhance.

          (finds intelligent life in Arkansas)
        • Re: (Score:3, Informative)

          "...or enhance gritty 320x200 CCTV images into uber-high resolution with no artifacts or fuzziness."

          Depending on the footage, this is semi-possible. There's software out there that can watch the motion of an object and determine what the sub pixels were. It's not ideal in every scenario (even less likely slow in the case of a blurry face on a security cam...), and it won't be as snazzy as CSI, but it is possible in a general sense. It only works, though if it can get actual motion vectors from the footag
    • by awtbfb ( 586638 ) on Friday June 06, 2008 @05:47PM (#23688049)
      I don't watch much TV, but the functionality is awfully similar to GigaPan [gigapan.org].
  • DeepZoom (Score:4, Informative)

    by digitalgiblet ( 530309 ) on Friday June 06, 2008 @04:50PM (#23687291) Homepage Journal

    My understanding is that you use different resolutions of the photo. The original photo is obviously the highest res you can have, but you can make successively lower res copies. More or less just bring up a a higher res version when the user clicks.

    I saw this demoed at the Atlanta Code Camp [atlantacodecamp.com] back in March. Very cool to watch.

    • Multiple resolutions (Score:3, Interesting)

      by clarkn0va ( 807617 )

      My understanding is that you use different resolutions of the photo.
      Just speculating here (I don't anticipate installing Silverlight for another 24 years or so), but I think you're on the money. It should work something like Google Earth, where the resolution is improved progressively as you zoom in.

      db

    • Re: (Score:3, Informative)

      by Bozzio ( 183974 )
      You sir, are right.

      Well, at least that's what it looks like when you use their silverlight app.
      You can actually spot the "seams" when zooming. Some of them aren't even superimposed correctly, leading me to believe that they are using a series a pictures taken with different cameras, instead of just storing lower resolution copies of the master image.
    • Close, but not really. The original highres image is stitched together from a bunch of normal magapixel photographs. Then they layered a nice UI on top to zoom to various levels of detail.

      Last year, Microsoft Research and their collaborators from University of Konstanz had a technical paper at Siggraph on this topic:

      http://johanneskopf.de/publications/gigapixel/index.html [johanneskopf.de]
    • It's a set of different res photos that are all aligned in a special mock up for a canned demo. It isn't actual zooming per se since the postage stamp printing resolution would destroy the detail.

      Setting up a single canned demo is pretty easy to do - relative to applying this "technology" in a wider automated scale.

      Sure it is cool to watch, but so is any special effect.

  • They do this all the time on CSI.
  • oh lordy... (Score:5, Insightful)

    by nuzak ( 959558 ) on Friday June 06, 2008 @04:51PM (#23687301) Journal
    I would be interested to see how they captured the image data to that level without massive pixelation.

    You don't ... you don't actually think that the image data came from one photo ... do you?

    *slaps forehead*
    • Plus they were zooming out. I'm sure they only had the detail in that area. It was probably impossible to zoom into any other portion of the image.
    • Re:oh lordy... (Score:4, Informative)

      by Tarlus ( 1000874 ) on Friday June 06, 2008 @07:30PM (#23689129)

      You don't ... you don't actually think that the image data came from one photo ... do you?
      Nope.
      If you can find them, zoom in on those Beatles bobble heads that the article describes. They're very highly defined. Then zoom out a bit and scroll around to (for example) the surrounding Hard Rock Cafe frame. Wonderfully blurry with respect to the bobble heads.

      As you zoom out further, you'll notice how the "container" holding those bobble heads antialiases itself differently from the surrounding different-res artwork.

      If you move amongst the different images of guitars and clothes (etc) you'll notice in the lower right that it identifies who the centered item belongs to.

      So it appears to me that this is a number of different graphical objects that can be zoomed at relatively different distances at the same time. And it looks like they can be embedded within each other.
  • Unfortunately? (Score:4, Insightful)

    by bigdanmoody ( 599431 ) on Friday June 06, 2008 @04:53PM (#23687321)

    Unfortunately the demo requires the Silverlight plugin...

    A Microsoft tech demo requires the installation of new Microsoft software to view? Who would have though?

    While Silverlight might never be as widely-supported as Flash, I hope that perhaps the competition might force Adobe to do something about the CPU hog that is Flash.

    • Re: (Score:2, Insightful)

      Your joking, right?

      It's Intel/AMD that are expected to fix this issue, by making huge leaps in processing grunt.

      I mean, you can not expect the current crop of programmers to actually write (or even just optimise) fast code.

      We now have many touted languages that are actually interpreters, not compilers. The argument from programmers is that today's CPU's are fast enough, and that these "new" languages are much easier to debug. They may as well be written in Visual Basic in my opinion!
    • Re: (Score:2, Interesting)

      by dodgedodge ( 166122 )
      The beauty of Silverlight is the number of languages you can use. Even Ruby is coming to it.

      Silverlight is a browser plugin. It takes all of about 10 seconds to install.

      Flash needs to just go away.
  • Maybe not CSI (Score:5, Insightful)

    by decowboy ( 1083777 ) on Friday June 06, 2008 @04:53PM (#23687327)
    But how is this different different from google maps (or live maps, or WHATEVER allows you to zoom out a lot)..
    • Re:Maybe not CSI (Score:5, Informative)

      by Anonymous Coward on Friday June 06, 2008 @05:14PM (#23687609)
      A major difference is the inclusion high resolution collections, which are not fixed at runtime and can be rearranged programmatically. I know this because that is what we did on the Hard Rock Memorabilia project.

      Aside from that, it is another form of a "tile server" application... Just one that happens to be rather easy to use from a development perspective, and one that has been done really well (Zoomify/AJAX-based solutions don't hold a candle to the tile stitching and easing effects built into the MultiScaleImage control, IMHO).
    • How about...I dunno...resolution? When I use Google maps it doesn't allow me to zoom down to the level where I can count the blades of grass in my lawn. The Hard Rock memorabilia site has resolution so high that you can see a fingerprint on Bo Diddley's guitar. On the Internet...that's pretty amazing.
      • Re: (Score:2, Insightful)

        by decowboy ( 1083777 )
        How about...that depends...on the source material? That Hard Rock site does have some pieces with amazing detail, but the stamp (that the article refers to) has clearly been added to the image, because even normal print doesn't have that detail.
      • Google provides the same 'zoom down to the highest res image they have and then back out again' technique. Seams, tile loading, and all.

        Just like this over-hyped software does.

        The only 'newness' is not using a fixed high-level map. That probably took a few lines of SQL (or Transact-SQL depending on the source database.)

        Yawn .... nothing new to see here. Just a lame excuse to get me to install Silverlight.
  • SeaDragon (Score:5, Informative)

    by Dragonshed ( 206590 ) on Friday June 06, 2008 @04:56PM (#23687379)
    Silverlight's MultiScaleImage control (aka deep zoom) is a version of the SeaDragon renderer. The image format it uses is a custom tree structure that contains pixel details relevant to both it's position in the tree and relative to it's peers. Essentially, it's a hierarchical image with very smooth transitions.

    Silverlight: silverlight.net
    SeaDragon: http://labs.live.com/seadragon.aspx [live.com]
    • by fm6 ( 162816 )
      Does it matter whether you're black or white [youtube.com]?
    • Re: (Score:3, Insightful)

      by CODiNE ( 27417 )
      Thanks for the link, with it I'm starting to see MS' new strategy to compete with Apple stealing their "cool".

      The Seadragon team is currently tuning its DirectX implementation, making the most of the new Windows Media Photo format, and cranking on the Photosynth Technology Preview.


      So they're essentially recreating Apple's Quartz + OpenGL + standard image formats with Photosynth + DirectX + WMPF.
      • Re: (Score:3, Interesting)

        by Dragonshed ( 206590 )

        So they're essentially recreating Apple's Quartz + OpenGL + standard image formats with Photosynth + DirectX + WMPF.

        Simply put, apple does an incredible job visually representing itself, it's technology and providing a user experience that is very hard to match.

        That said, I disagree that microsoft is recreating any preexisting technology. You could argue that DirectX is just like OpenGL, but that's likely grossly oversimplified.

        Photosynth and Seadragon are demoed here: http://www.youtube.com/watch?v=vHsYnkLnepk [youtube.com]

        Neither of those are similar to things that apple has done.

  • layered bitmaps (Score:4, Informative)

    by Brit_in_the_USA ( 936704 ) on Friday June 06, 2008 @04:56PM (#23687385)
    The Beatles models and signatures pear to be the highest level of detail unless there are other "Easter eggs". That level of zoom on any surrounding areas is pixelated. They have stacked multiple high res photos at various scales in this particular area.
  • by MythMoth ( 73648 ) on Friday June 06, 2008 @04:59PM (#23687421) Homepage
    Ian Griffiths [interact-sw.co.uk] implemented a deep zoom for the BBC [bbc.co.uk] in their Big Weekend festival. Rather pleasingly they chose to call it the "Big Zoomy Thing" in a nice bit of anti-jargon.
    • Re: (Score:3, Informative)

      That one is also using Silverlight. Except they botched the version check, and so it won't work if you have a later version than the one they coded for. Oops.
  • Sounds like GigaPan (Score:2, Informative)

    by higgins ( 100638 )
    The folks at CMU have a similar thing:

    http://gigapan.org/

    It uses a (cheap) commodity digital camera, combined with a smart tripod, good photo stitching software, and a nice Flash UI to give you highly zoomable panoramas. The CMU thing has been around for a while --- over a year at least, plus I'm pretty sure you can get one of the tripod mounts if you participate in the beta and create your own.
  • Deep Ream (Score:3, Funny)

    by Foofoobar ( 318279 ) on Friday June 06, 2008 @05:02PM (#23687459)
    There next product for stealing your checkbook while Windows does a colonoscopy
  • by Itninja ( 937614 ) on Friday June 06, 2008 @05:04PM (#23687481) Homepage
    I admit the demo is neat and all, but they are not really zooming into the same image. They have just developed a way to quickly load the high resolution image on the fly. Kind of like how Google Maps will deliver a higher res map when you zoom in; but this is happening much faster.
    • Re: (Score:3, Insightful)

      by xtracto ( 837672 ) *
      Kind of like how Google Maps will deliver a higher res map when you zoom in; but this is happening much faster.

      Kind of like what happens when you use Google earth very close (i.e., in-situ) to where the servers with the data are stored.
  • Comment removed (Score:4, Informative)

    by account_deleted ( 4530225 ) * on Friday June 06, 2008 @05:04PM (#23687485)
    Comment removed based on user account deletion
  • WTF? (Score:5, Insightful)

    by cptdondo ( 59460 ) on Friday June 06, 2008 @05:07PM (#23687529) Journal

    If you're like me and a bunch of very smart students, you can't fail to be impressed.
    I must be dumb.... Stiching together an image of higher-res photos might be a technical wow, but sorry, I'm not really impressed. This sort of thing I might expect from a college lab, but for a multi-billion dollar company to present this as some sort of earth-shaking innovation?

    • Re:WTF? (Score:4, Interesting)

      by Dragonshed ( 206590 ) on Friday June 06, 2008 @05:57PM (#23688155)
      The earth-shaking innovation is in the form including deep zoom as part of a plugin featuring a fast 2d compositor with video decoding and animation support, common RIA application components and controls using a small .NET Runtime, packaged in a 4.3mb download, "installed in 20 seconds or less", and all of it designed to run on multiple platforms.

      MS Devs have done some amazing things within their allotted size quotas. /perspective-and-koolaid
  • by prakslash ( 681585 ) on Friday June 06, 2008 @05:07PM (#23687537)
    There is a bit of a misdirection in articles and other material about Deep Zoom.

    Most people go ooh and aah because they (wrongly) assume that it zooms into normal resolution photos .

    It doesnt (because as you and I know, it physically can't).

    Deep Zoom does NOT perform CSI/CIA-style photo enhancement. If you dig deeper, you will find that what Deep Zoom is intended for is to enable one to focus on a smaller portion of a giga-pixel photograph so you do not have to download the whole photograph.

    Think of it like a hierarchical smooth slicing of a large high resolution photograph and only downloading those "planes" and "sections within a plane" that the user is interested in seeing.

    Interesting technology but not magic.

    • by dave420 ( 699308 ) on Friday June 06, 2008 @05:34PM (#23687865)
      They actually use SeaDragon (the name of the technology) on CSI, for those sections you're talking about. Obviously they lie about what it's doing, but that's the software you see.
    • Re: (Score:3, Interesting)

      by markana ( 152984 )
      And exactly how is this different from the wavelet-compressed MrSid format? LizardTech was doing this sort of "download-what-your're-focused-on" multi-resolution zoom *years* ago. Six years ago I could zoom in smoothly and deeply to an area of a multi-GB image, and the plugin would grab only those pixels needed to show that area at that resolution.

      So what exactly is new here, except for the use of Silverblight?
  • No free lunch (Score:5, Informative)

    by icebike ( 68054 ) on Friday June 06, 2008 @05:09PM (#23687549)
    There is two ways to get this level of zoom to work:

    1) have the pixels in the first place
    2) having more pixels in the first place.

    Anything else is a fundamental violation of the laws of physics and math. You simply can not fake what you don't have without it being exactly that: a fake. There is no storage printing technology which could accomplish this level of zooming, and they carefully do not say that this is actually a continuous zoom of a picture on a stamp.

    Deep Zoom works by letting you meld several images in such a way as pretend its one image.

    Basically, its a con-job of transitioning several different images, where one is a re-photograph of sub portion of the original.

    The implication of the article is that this is all one image containing a nearly infinite level of detail, which it most emphatically is NOT.

    The author is probably equally impressed by street corner magic tricks.

    • Re:No free lunch (Score:5, Insightful)

      by vux984 ( 928602 ) on Friday June 06, 2008 @05:25PM (#23687749)
      Deep Zoom works by letting you meld several images in such a way as pretend its one image.

      That's still very useful.

      Basically, its a con job of transitioning several different images, where one is a re-photograph of sub portion of the original.

      'con job' has needless connotations of an intent to deceive.

      The implication of the article is that this is all one image containing a nearly infinite level of detail, which it most emphatically is NOT.

      No. The implication of the article is that you can provide this as a user interface, which is very cool. Google Earth isn't interesting because its a 'con job' to let us think we can zoom in and out of a single monster image of the planet. Its interesting because its a natural and convenient UI to use.

      And we don't have to download every single pixel of every single higher res image of a tree in Nigeria to have a closeup look at a parking lot in London. Detail is loaded on the fly, as needed, while the user gets a 'seamless' and comparatively low bandwidth experience.

      Its not particularly new as an idea. Or even as an implementation. But maybe Microsoft's tools make setting it up substantially easier, and that alone would be a nice bit of progress.

      The author is probably equally impressed by street corner magic tricks.

      I am impressed by street corner magicians too. Not because I think they're magical, but because I am impressed at their showmanship, sleight of hand, dexterity, and general ability to appear magical.

    • Sure you ... if you know something about what you're ....... at.

      They analyse the data properties of the camera and see what data is rejected and where data is averaged, this is very intensive and esoteric research and not something that's obvious.

  • I don't get it (Score:5, Insightful)

    by Mike1024 ( 184871 ) on Friday June 06, 2008 @05:14PM (#23687607)
    Don't we already have the ability to process multi-resolution images in, for example, Google Maps? You know, zooming in and out images with large total resolution?

    It would be impressive if the photo they demonstrated on was anything but a photoshop, but given that the 428x134 signature is 52x11 in the 350x237 statuette picture which is 29x26 in the 428x350 hard rock picture which is 87x87 in the 428x399 stamp picture, for the stamp to be real would require a 33 gigapixel stamp (which, at 1 inch square, would be printed at 33,000,000,000 DPI).

    To me zooming in and displaying a different image isn't really as exciting at the article author makes it sound? Maybe I'm missing something because the journalist sounds pretty damn excited about it.
    • Comment removed based on user account deletion
      • Re: (Score:3, Interesting)

        by prockcore ( 543967 )
        what you're missing is that seadragon constructs these things analytically from a collection of photos.

        Basically I can run around taking random photos some zoomed in, some not and seadragon will automatically stitch it all together.
  • by statemachine ( 840641 ) on Friday June 06, 2008 @05:14PM (#23687611)
    But the viewer is 126G.
  • by oborseth ( 636455 ) on Friday June 06, 2008 @05:28PM (#23687785)
    It crashed Firefox 3.0 on my Mac Book after installing the plug in and viewing the demo.
  • are there others reading /. that are thinking.. Yawn, wake me when it's working like Google maps only much better?

    I realize that it's new, and takes effort, but I can't break out the oohs and aaahhhs just yet. No matter how good it is, is it worth upgrading for?

    Car analogy: Isn't this like demonstrating a concept car that they intend to put into production, but production will be a little bit different?

    I'll wait for SP2 (or equivalent), thanks very much.
    • It took less time for me to install it and play with it to find out for myself than it did for you to write out that comment. Why not just try it instead of immediately complaining?
  • Deep Ripoff (Score:3, Interesting)

    by jdb2 ( 800046 ) * on Friday June 06, 2008 @05:33PM (#23687845) Journal
    This is a lame "embraced" and "extended" version of an old demo effect that was first demonstrated in the early 90's, if not earlier. Obviously the entire "zoomed-out" image is not stored. The "zoomed-in" images are stored, but to make the effect work a series of intermediate images has to be stored between each "zoom stage". For example, in one implementation, when "zooming" through the intermediate stage between a "larger" and "smaller" image, for each series of frames an "outside" or "boundary" image is stored in full resolution and that image is zoomed ( and clipped against the view port boundaries ) until it is outside the view port while at the same time the "internal" image is enlarged until it fills the viewport and then the process is repeated again with the "internal image" now consisting of the next "boundary" image surrounding another "internal image".

    Go to Pouet [pouet.net] and you'll find many demonstrations of this effect.

    jdb2

  • by Animats ( 122034 ) on Friday June 06, 2008 @05:35PM (#23687875) Homepage

    See Charles and Ray Eames' Powers of Ten [youtube.com]. Now that's a zoom.

    As for doing it in real time, Keyhole (bought by Google and renamed Google Earth) was doing this on PCs five years ago. Any decent GPU can do this today, and you can download Google Earth to see it.

    I saw one of the first systems able to do this in real time about 25 years ago. It was inside a classified tank at a major aerospace firm, and required a rack of special-purpose hardware. The user interface was beautifully simple - a big trackball (for pan), a lever (for zoom), and a knob (for rotation).

    Even Microsoft's little film isn't original. That technique has been used a few times in commercials.

    So Silverlight doing this isn't exactly a big "wow" development.

  • Prior art (Score:3, Informative)

    by joeslugg ( 8092 ) on Friday June 06, 2008 @05:53PM (#23688125)
    As I'm reading the descriptions and seeing it on YouTube, I'm thinking I've SEEN something like this before.
    And I finally remembered; Jef Raskin's [wikipedia.org] "Humane Interface".
    Zooming demo from several years ago that runs in Flash here. [raskincenter.org]

    Quite similar, IMHO. Hmm?

  • Fotomagico photo zoom transition, notice the video demo at the top of the page: http://www.boinx.com/fotomagico/overview/storytelling/ [boinx.com]

    The real trick is finding an image with boring enough edges that you can pretend it came from the other. You'll notice on the Microsoft image demos they mostly have solid color edges except for the Planet Hollywood picture which is followed by something so busy you can't really tell where it fits in.

    I got the program at least 2 years ago with one of those $50 software bundle
  • This is hardly new technology. Isn't 'deep zooming' what I've been brought up to know as... well... 'zooming'?

    I believe there was a demo, around a year ago, that Steve Jobs did at WWDC to demonstrate how Mac OS X had new 64-bit exploitation abilities in Leopard. If I remember correctly, he brought up two copies of the same image, to do a 'race' (32-bit vs. 64-bit). It was a wide shot of a chamber in the Library of Congress, and it was sufficiently detailed that one could zoom in and read the labels on the

  • Anybody else catch the thank-you letter from Paul McCartney to a cop who served as his bodyguard in Miami (it's a little more than halfway down on the left side)? Four pages! How cool is that?
  • I have FF 2.0.0.14 and all I get is a black screen at http://memorabilia.hardrock.com/ [hardrock.com]
  • I saw a video of this demo from TED2007 when it was called Photosynth, formerly Seadragon. Would it have killed Microsoft to have kept either of these much cooler names? I don't know which clueless marketing droid came up with this incredibly lame moniker, but he should be strapped into one of Ballmer's chairs when Uncle Fester is having his daily rant against Google/Apple/Linux.
  • Doesn't the next id technology used for the game Rage do this on any surface?
  • You can watch a Seadragon presentation from TED at http://www.ted.com/index.php/talks/view/id/129 [ted.com]
    focus-plus context screens are similar http://www.patrickbaudisch.com/projects/focuspluscontextscreens/index.html [patrickbaudisch.com]

THEGODDESSOFTHENETHASTWISTINGFINGERSANDHERVOICEISLIKEAJAVELININTHENIGHTDUDE

Working...