Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Photosynth Demo

Posted by ScuttleMonkey on Wed Jun 06, 2007 03:45 PM
from the wow-just-wow dept.
A couple of days ago Microsoft labs released a demo of their new Photosynth software on the web. Photosynth allows the aggregation of social picture networks (a la Flickr) into a completed image in addition to allowing a level of depth to image browsing previously unavailable. There is also a very impressive video of the demo available.
+ -
story

Related Stories

[+] Science: Microsoft, NASA Allow For 3D Shuttle View 84 comments
C|Net reports that a 3D software version of the space shuttle Endeavor is in the works, thanks to a collaboration project between Microsoft and NASA. The Photosynth viewer will allow fans of the space program an unprecedented level of detail in examining the shuttle and its surrounds at the Kennedy Space Center. ""It's much like a 3D video game--people can explore, walk around or fly around the shuttle," said Adam Sheppard, group product manager for Microsoft Live Labs, which developed the viewer. NASA said that the project could lead to more initiatives with the software giant. Chris Kemp, director of strategic business development at NASA's Ames Research Center, said that, for example, NASA could use the Photosynth technology on future space missions for activities such as inspecting the International Space Station and viewing landing sites on the moon."
[+] Microsoft Moves in on the Graphics Market 237 comments
Ian Lamont writes "Microsoft has quietly been building up graphics-related R&D, reports Computerworld, noting that Microsoft employees will be presenting one out of every eight papers at SIGGRAPH 2007. And it's not a fluke — other recent Microsoft graphics-related developments include Photosynth, which has been discussed on Slashdot several times, as well as the Silverlight/Expression Studio graphics suite, which will compete with Adobe's Flash/Illustrator/Lightroom/Dreamweaver offerings. At SIGGRAPH, Microsoft will supposedly have demos of some new software including image deblurring tools and Soft Scissors, which 'solves the vexing problem of how to cut and paste an image from one background to another if the image's edges — hair blowing in the wind, blades of grass — are very complex.' Microsoft's competitors aren't sitting down. Adobe's CEO, calling Microsoft a '$50 billion monopolist,' has questioned whether Silverlight will be compatible with non-Windows operating systems, and Google has also been building up its own graphics-related software products, such as the 3D modeling tool SketchUp, and Google Earth."
[+] Photosynth Team Does It Again 144 comments
STFS found an update to the Photosynth stories that we already ran. You might remember the amazing photo tourism demos. Well, this new version kicks things up several notches with paths and color correction to more smoothly transition between photos taken in different lighting conditions. As before, this stuff is worth your time. Check it out.
[+] Microsoft Releases Photosynth 247 comments
Spy Hunter writes "Photosynth has graduated from a 'tech preview' to a complete service. Now you can upload your own photos and have them automatically transformed into a 'synth': a 3D fly-through reconstruction of your home, your vacation, or anything else you can take pictures of. Learn more about Photosynth at the official blog, see what Walt Mossberg has to say about it, or just go try it out right now." According to Mossberg, Photosynth works on PCs using IE or Firefox, but not yet on Macs. We've been discussing Photosynth since its introduction.
This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • but I couldn't... 30 seconds of ads at the beginning, then the phrase "through an aquisition".

    typical microsoft "innovation"
    • Enh, so it wasn't Microsoft that did the innovative work. It's still a damn impressive demo (although you know what they say about demos...); you're missing out. (Ad was annoying, though.)
        • Re: (Score:3, Informative)

          You must not have seen the whole thing. The cathedral was assembled from images available from the internet taken by hundreds of different people and cameras.
          • Re: (Score:3, Interesting)

            Which were then manually screened to weed out the crap ones.
          • Re: (Score:3, Interesting)

            A friend of mine asked, "Doesn't that violate about a billion copyrights?"

            I shrugged. Can someone take my photos on Flikr and use them to create new content without my approval?

    • Re: (Score:2, Informative)

      by Anonymous Coward
      Better Link to the video demo.
      http://www.ted.com/index.php/talks/view/id/129 [ted.com] Click Here
          • ...watched the whole thing now... couldn't find the aformentioned "good part" anywhere.
          • Also, the video on the TED site can be enlarged and I believe is higher resolution.


            TED [ted.com] is definitely a site worth visiting away, as this presentation is probably among the less interesting ones you can watch there. More people should check it out.

    • by Bananatree3 (872975) on Wednesday June 06 2007, @04:14PM (#19416677)
      I decided wade through the hype/ads/blah, and came across a really cool piece of software. It takes thousands of flickr images stitches them into a 3-dimensional mosaic, all just through software. No special on-site 3d imaging hardware, just a program compiling everyday images of something. It does this through some very advanced image recognition. If you can brave the ads, it IS worth it.
      • Sounds like an application of autostitch [cs.ubc.ca]. The downloadable demo version is pretty neat and fun to play with, if you have overlapping scenery photos, for example.
        • Re: (Score:3, Informative)

          No, it's far more advanced than that, as its recognition is able to match objects that are not directly from the same set of photos, or even all photos, some can be diagrams or drawings for example.

          The part that blew me away is the SeaDragon technology behind the image/information scaling portion of things... now that is just incredible... check out a talk/demo at TED on March of 2007 by Blaise Aguera y Arcas of Microsoft [videosift.com], just amazing stuff.
      • Re: (Score:3, Insightful)

        "Windows XP SP2 and Vista Only

        The Photosynth technology preview runs only on Windows XP SP2 and Windows Vista.

        If you feel you've reached this message in error, you can try anyway."

        Wow, another innovative product from Microsoft.
      • Re: (Score:3, Informative)

        Microsoft also worked with the BBC [bbc.co.uk] to produce this collection [live.com] of Photosynths of several well known places in Britain.
          • Re: (Score:3, Interesting)

            Yep. I run Windows 2003 Server at work and it doesn't work on that either. I am pretty sure the Photosynth team wants it to run on more platforms. This is still a new product that is barely out of the research stage.

    • Yes, they funded this innovation by buying equity.

      C'mon, learn how it works. It's the system you live in.
          • Re: (Score:3, Interesting)

            I did WTFA, and I obviously paid more attention than you did, because the most interesting part of the demo is preceded by a comment along the lines of "I'd like to jump straight to one of Miller's original datasets and this is from an early prototype of Photosynth that we first got working this summer" ... I repeat myself again, they developed the cool technology AND THEN AFTERWARDS it was bought into by Microsoft.
  • Huh? (Score:3, Funny)

    by DragonWriter (970822) on Wednesday June 06 2007, @03:50PM (#19416367)

    Photosynth allows the aggregation of social picture networks (a la Flickr) into a completed image in addition to allowing a level of depth to image browsing previously unavailable.


    That appears to be syntactically tolerable English. Semantically, though, WTF?
    • Let me translate.

      Pretty pictures.
    • Re:Huh? (Score:5, Informative)

      by RealGrouchy (943109) on Wednesday June 06 2007, @04:03PM (#19416529)

      Photosynth allows the aggregation of social picture networks (a la Flickr) into a completed image in addition to allowing a level of depth to image browsing previously unavailable.
      That appears to be syntactically tolerable English. Semantically, though, WTF?

      This lets you take all sorts of pictures of your room, and will automatically assemble them into a 3D environment. It will assemble your photos to look like an RPG, instead of a slideshow.

      Using the example in the video...there are hundreds of online collections of people's photos of Notre Dame cathedral. Each photo is of a different part of it, from a slightly different angle.

      This software takes all those different photos and assembles them into a 3D representation of Notre Dame cathedral, where you can look at any of the individual photos.

      In addition, if someone identifies one of the saints in a statue on the cathedral, when you take a photo of it and your photo is added to the collection with the software, your photo will also have that saint identified--thereby enhancing the data contained in your photo.

      - RG>
      • Re:Huh? (Score:4, Interesting)

        by timeOday (582209) on Wednesday June 06 2007, @04:18PM (#19416729)
        I don't think this technology has that much to do with social picture networks in particular, I'm not sure using it to index images is all that compelling. What would be more useful is inputting some images from different angles (or a video) and getting back a .3ds texture-mapped geometric model. Reconstruction of gometry from imagery has been a big research topic for ages but I'm not aware of any effective, user-friendly software to do it.
    • You know what's strange is that I thought the same thing before I watched the video demo, and now having watched it, the sentence makes decent sense.

      WTFV.
      • Re: (Score:3, Insightful)

        Yeah, it's pretty decently cool, too. Personally, I thought the magazine and the car ad with highly detailed information "printed" really small was as interesting a concept at anything-- it looked like it might provide a reading experience that would make sense for an online magazine, and the small print bends the concept of your printable space in an interesting way. So long as there are sufficient hints that the tiny text was there, it would allow you to put a lot of information into a small "space".

        Th

  • by L. VeGas (580015) on Wednesday June 06 2007, @03:59PM (#19416491) Homepage Journal

    Photosynth allows the aggregation of social picture networks (a la Flickr) into a completed image in addition to allowing a level of depth to image browsing previously unavailable.
    Slashdot summary entices the accumulated aggravation of social comment communities (a la Digg) into a aggregated juxtaposition while interspersing levels of irritation heretofore unimaginable
  • by kiwicmc (93934) on Wednesday June 06 2007, @04:10PM (#19416619)
    Unlike the first set of posters I managed to get over my self importance and watched a couple of seconds of BMW ads to see the actual video.

    I liked the initial viewing of large quantity of hi-res images and the smooth zoom. The aggregation of many thousand flickr images of the Notre Dame (including one of a poster on a wall) into a 3-D image was fantastic.

    C
  • One step forward! (Score:4, Interesting)

    by Sectrish (949413) on Wednesday June 06 2007, @04:13PM (#19416665) Homepage
    At least now someone at Microsoft seems to know _what_ to buy, this is some pretty amazing technology. I just hope that someday it will be available to other OS'es too.
    • Re:One step forward! (Score:4, Interesting)

      by evohe80 (737760) on Wednesday June 06 2007, @05:19PM (#19417471)
      One thing that amazes me of Microsoft is how, having so many bright people at MS reasearch, most of their stuff is so bad, and/or lacks innovation. (I know part of this came from some other company they bought, but some of it is original from MS, I've read a paper related to this technology).

      Every single paper I've seen from MS research is great. Well done!

      (from someone developing computer vision on linux)
    • Re:One step forward! (Score:5, Informative)

      by TheTranceFan (444476) on Wednesday June 06 2007, @08:53PM (#19419309) Homepage
      Microsoft didn't buy Photosynth. It bought Seadragon. The Photosynth client is indeed built on Seadragon's client, but the idea behind Photosynth (which was a joint University of Washington/Microsoft Research project called PhotoTourism) significantly predated the Seadragon acquisition, and there was a working client. When Microsoft decided to reimplement the client as a technology preview, that's when the Seadragon team and client came into the picture.

      That said, Seadragon's technology is great. It's a fantastically smooth way to browse arbitrarily large images or collections of images, and it was a good acquisition indeed.

      (I was on engineer on the Photosynth team.)
  • by toQDuj (806112) on Wednesday June 06 2007, @04:16PM (#19416697) Homepage Journal
    This zoom-ability of the first part has a lot in common with the ideas behind Jef Raskin's The Humane Environment http://en.wikipedia.org/wiki/Archy/ [wikipedia.org].

    The second part, however, shows marvellous stuff. Especially if what I think he did, was search for patterns in images, and compare those for unique objects to collect a library of images of a single object.

    This guy and supposedly his group shouldn't work for Microsoft in my opinion, but would perhaps feel more at home in a fundamental science laboratory. But I think my opinion on this is slightly partial.

    B.
    • Re: (Score:2, Informative)

      The project was demonstrated on the Research Channel at the beginning of the year.

      Microsoft bought out a company that had written the non 3D part of Photosynth and student(s) at the University of Washington wrote the rest if I remember correctly. At the time they didn't work for Microsoft.
  • by tygerstripes (832644) on Wednesday June 06 2007, @04:39PM (#19417003)
    I can honestly say, without hyperbole, that this is the first time all those promises of what the web can really do - interconnectivity, automatic synaptic contextual linking, user generated content, and god-damned cleverness - have finally come together into something which is un-fucking-believable!!

    All those next-stage, new-wave, super-hyped ideas that generated enough excitement to get a survivable user-base just kind of passed me by, because they only ever seemed to be minor amplifications of what we already had. But this... this is something totally new. And utterly, utterly incredible!

    I'm so excited by this it's making me feel sick! TECHNOLOGY! INTERWEB! I take it all back - forgive me for my lack of faith! I LOVE YOU!

    And by the way, that "content only limited by how many pixels are on the screen" idea has been a long time coming, and I'm deeply happy that someone's solved it. I could never understand why we use raster-imaging for computer games because it's a squillion times quicker than ray-tracing, but nobody had applied the same idea to other applications. Now I feel justified in wondering, and I'm so pleased with the result!

    • by adisakp (705706) on Wednesday June 06 2007, @07:10PM (#19418475) Journal
      I could never understand why we use raster-imaging for computer games because it's a squillion times quicker than ray-tracing, but nobody had applied the same idea to other applications.

      I don't think that basic rasterizing engines are the limit. The limit is that the source data for all these pictures are tens or hundreds of gigabytes (and in the future, conceivably terabytes). Somewhere in the assembly and cross-correlation of all this data, they have to be generating LOD's (levels of detail) and dynamically loading / managing MIP-maps to keep the loaded dataset to a reasonable level. This is the hard part since "reasonable level" for loaded imageset size is probably currently a couple hundred megabytes or much less. You can probably load more data into RAM but try maintaining a 60FPS refresh with a gigabyte of textures - especially on a laptop or basic computer.

      Once you've done this you can use a variety of display techniques... the main reason to use basic texture-mapping / flat rasterization is that sources are photos which are basically a pre-lit "flat" textures.

      However, if you can generate a 3-D model and can separate lighting / color information (perhaps using combinations of day and night pictures or varying lighting from different photographs), it would be then possible to perform simple ray-tracing or other hybrid renderers -- think how cool it would look to have a dynamic artist's sketchpad with these images "penciled" in realtime. There are already high-frame-rate (near-realtime) ray tracing demos already out there for CELL and X86 that render moving images at a lower-res for higher-interactive frame rates and then when not-moving, render high-quality image stills that are quite impressive.
  • by Ided (978291) on Wednesday June 06 2007, @04:45PM (#19417101)
    This software is absolutely amazing, especially when you consider the programmatic side of this. People bashing this without actually watching the video AND playing with the operating demo are really missing out. You don't have to like it but at least have a reason that shows some form of intelligence. Not just "the intro was poorly done".
  • by Lord Satri (609291) <alexandre&leroux,net> on Wednesday June 06 2007, @05:06PM (#19417317) Homepage Journal
    Right here [slashdot.org].
  • Just looking at that (Score:3, Interesting)

    by goldcd (587052) on Wednesday June 06 2007, @05:15PM (#19417413) Homepage
    rather fabulous demo, I realize that that would tie in beautifully with the surface computing MS showed last week (which was lovely as a tech demo with little immediate use).
    Vista is 'nice' but it's just a progression of what we already know - these tech demos give me a big warm fuzzy futuristic feeling inside :)
    If nothing else it shows that MS is innovating again (at last) - Ball's back with Apple and Google now - "Make me more impressed!"
  • Data aggregation (Score:4, Interesting)

    by jemenake (595948) on Wednesday June 06 2007, @05:25PM (#19417539)
    Near the end of his presentation, the guy sums up the technology as taking all of these separate images from various sources on the net and figuring out how they all interlink to present a larger, more coherent picture. He got applause.

    My first thought was about the U.S. government's "total information awareness" project, where they're trying to take lots of separate pieces of info (which are already available to law enforcement) and interlinking them all together to provide a more coherent picture... but most people consider that to be evil.

    Granted, the government isn't doing it with vacation photos, but the idea, of finding pieces of data that are related and finding out *how* they're related, is the same. The difference in people's reaction to it, I can only attribute to the fact that people see the photosynth guy as good, and the government as evil. But I don't agree that the goodness or evilness of an action is solely determined by the goodness or evilness of who's doing it. The U.S. gov't tries this and fails. It expects that it can invade foreign countries and install friendly governments and torture people because it's "the good guys", yet the soviet union did those same things during the cold war and we admonished them for it because they were "the bad guys".

    So, where am I going with this rant? My point is this: You can't blame somebody for connecting the dots. In fact, that seems to be one of the things that we, as humans, are particularly good at. So, if you think that this photosynth thing is fine, then I think you've got to grant that the TIA project is fine. Now, you could argue that some particular bits of information shouldn't be available, but the piecing it together to form a more coherent picture... I can't come up with an argument against it that I consider defensible. Sure, it makes me uncomfortable, but that's not an "argument".
    • Re: (Score:3, Insightful)

      So, if you think that this photosynth thing is fine, then I think you've got to grant that the TIA project is fine.

      Technology is a tool. It is great to use hammers to build houses. It is not great to use hammers to bludgeon people's skulls. In no way does thinking photosynth is fine imply that TIA is fine - the fact that they (may) require the same technology to be possible does not in any way make them morally equivalent.
  • Vast Desktop... (Score:5, Interesting)

    by Slur (61510) on Wednesday June 06 2007, @05:43PM (#19417711) Homepage Journal
    Actually, as I looked at the demo, I couldn't help feeling like all that virtual space was looking like a damn nice desktop environment. Nevermind the part of the demo with a flat-on scrolly-zoomy desktop, as nice as that would be (Seems obvious in a way too... And wouldn't it be nice if Leopard had that instead of "Spaces" ?). But imagine the notion of opening up an application and instead of just popping up a new window it creates a new space - within the desktop virtual space - and brings you into it. You can always pull back and move around to another window or workspace, but while in it you'd be totally immersed.

    I dunno, I just like the notion of immersive environments, especially for conceptual learning. I think we're going to see a prevalence of this kind of interface in the near future.
    • by koreth (409849) * on Wednesday June 06 2007, @03:56PM (#19416465)
      Then you closed the window about 10 seconds before the demo started. Keep watching.
        • Re: (Score:3, Informative)

          Well, I figured there was content after the stupid intro, but seeing how much production value was in the intro, I can't trust the content, there's probably as much production value in it's presentation as there was in it's introduction.

          The people responsible for creating the intro (TED) are just the people responsible for giving the presenter a forum to share their ideas/technologies, don't let it color your impression of the rest presentation or the technology itself too much. The same brief advertisemen

    • by EERac (873862) on Wednesday June 06 2007, @04:06PM (#19416573) Homepage
      This system was demoed a while ago, I think at siggraph. There are some videos on the original university of washington PhotoTourism page. [washington.edu]. Also here's a repost of the video on youTube. [youtube.com]

      Also there's microsoft's page [live.com], which has the demo (I don't think that's new either). It seems to have some longer videos

      Non-newness and marketing hype aside, this software is frickin' awesome. It lets you view and tag photos organized in a 3D environment that reflects where the photos were taken. It should be particularly useful once cameras have GPS built in.

      I imagine the reason the software is still in the demo phase is because it's very difficult to take a large number of photos and reliably figure out where they were all taken from. For the demo purposes, Microsoft probably hand corrected a lot of the placements. Even so, everyone I've shown this too thinks its often (even non-slashdot readers!)
      • This system was demoed a while ago, I think at siggraph. There are some videos on the original university of washington PhotoTourism page. [washington.edu]. Also here's a repost of the video on youTube. [youtube.com]

        Thank you, that youTube link is exactly what I was looking for: A clear video demo of what this is, how it's used, with a nerd voiceover explaining what's going on.
        No frills, no fuss, no slick intro telling me how I should feel about the damn thing, just info.

        • Re: (Score:3, Interesting)

          hmm... I notice that in that demo, it's running on Gnome. But now, Microsoft Photosynth "runs only on Windows XP SP2 and Windows Vista".
      • by TheTranceFan (444476) on Wednesday June 06 2007, @08:46PM (#19419263) Homepage
        I was one of thee engineers that worked on the first release Photosynth. It's a great team, and it was a super fun project.

        I can tell you that we did not tweak any camera positions by hand. The only real "editing" we did was to eliminate pictures that just didn't correlate well, generally because they didn't have enough feature points in common with the rest of the photos. We didn't tweak any camera positions, but the camera positions (i.e. the locations of the orange camera frusta when you have frusta turned on) are a best estimate, which is subject to some error. Same goes for the projection planes.

        What's great about Photosynth is that from the perspective of anyone outside the computer vision community, it appears to be magic. Enough so that lots of the blogosphere was convinced that we somehow "authored" the 3D point clouds. Nope. It's more or less an automatic (albeit somewhat prolonged) process. The hard work is done as a big preproceess, then the client consumes largely precomputed data.

        It'll be cool to see Photosynth in action in BBC's upcoming How We Built Britain piece that was announced on Live Labs [live.com] today.

        I did a video interview [onten.net] about Photosynth a while back which is targeted at a non-technical audience but still might be of interest. (And I wrote the music for the original video [live.com] at Live Labs.)
    • Oh that explains it. I was wondering why I watched 10 seconds of a person looking at a large amount of pictures, like it solved something important.
      • Re:press release (Score:4, Insightful)

        by koreth (409849) * on Wednesday June 06 2007, @04:10PM (#19416617)
        I don't get the point of that part either, but keep watching. A couple minutes into it he moves on to the real meat of the demo, and it's pretty astonishing. I won't spoil it except to say that if I'd seen it in a sci-fi movie I'd probably have dismissed it as very cool-looking but totally unrealistic.
    • Re: (Score:3, Insightful)

      In the interests of openness, would you mind publishing these calculations of yours? I'm sure we'd like to see your quantification of the open-source development process, particularly for software as complex as this evidently is. Thanks.
      • by Otter (3800) on Wednesday June 06 2007, @05:03PM (#19417289) Journal
        In the interests of openness, would you mind publishing these calculations of yours? I'm sure we'd like to see your quantification of the open-source development process, particularly for software as complex as this evidently is.

        • Day 0: Someone registers a project on Sourceforge, commits main.c to CVS.
        • Day 3: It's noted on Digg, Reddit
        • Day 30: Slashdot links to project, hails it as "the Photosynth killer", misspells project leader's name. Commenters gloat about M$'s lack of innovation, speculate on the throwing of chairs in Redmond, argue about atheism and gun control.
        • Day 33: Slashdot dupes story, misspells "Photosynth", "killer" and "the".
      • Fuck Microsoft. Fuck them and their can't-beat-em-then-buy-em business model. Fuck their complete lack of innovation, and fuck the douchebags who sell out to them.
        So what you are really trying to say is that you thought the demo was cool. Hey man don't hold back the love.
      • Re: (Score:3, Informative)

        Dude, did you watch the video? The acquisition the guy mentioned was the first part - the zoom in and out and pan around lots of images. That was the "meh" part.

        The cool part... the part where they constructed a 3D model of Notre Dame by using only photos from Flickr, well the Photosynth page says where that came from: "Photosynth is a collaboration between Microsoft and the University of Washington based on the groundbreaking research of Noah Snavely (UW), Steve Seitz (UW), and Richard Szeliski (Micros