Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Google Experiments With Local Filesystem Search

Posted by timothy on Wed May 19, 2004 01:08 PM
from the better-than-what's-built-in dept.
Teoti writes "No, Puffin is not the next name of your favorite email client, but, according to the New York Times (NSA reg. req.), the project codename for a new Google search application coming directly into your desktop, that will let you search your local filesystem efficiently. This is different from, but complementary of, the Google DeskBar that already lets you search the Web. The article also gives a few words on the end of the stand alone browser in Longhorn."
+ -
story
This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • by Sartak (589317) on Wednesday May 19 2004, @01:10PM (#9196965) Homepage
    Will Google's search application functions feature Clippy? Or that damned animated XP Dog?
          • I wouldn't underestimate small dogs. My parents have a little poodle (about 5 pounds), that we swear is able to create mass. He gets fed a bowl of dogfood and then will poop seemingly twice the amount. And when he hs had an accident, it is like he generates twice his body mass. Nothing like listening to your mother complain over the phone about how much the dog is able to shit.
  • by Anonymous Coward on Wednesday May 19 2004, @01:10PM (#9196967)
    ...exactly what "local filesystem image search" will return.

    Finally, a way to effectively search through my gigabytes of pr0n!
  • by Mz6 (741941) * on Wednesday May 19 2004, @01:11PM (#9196979) Journal
    • The Reuters version [reuters.com] you linked is shorter than the NYtimes one. Here is the full version:

      SAN FRANCISCO, May 18 - Edging closer to a direct confrontation with Microsoft [slashdot.org], Google, the Web search engine, is preparing to introduce a powerful file and text software search tool for locating information stored on personal computers.

      Google's software, which is expected to be introduced soon, according to several people with knowledge of the company's plans, is the clearest indication to date that the company, based in Mountain View, Calif., hopes to extend its search business to compete directly with Microsoft's control of desktop computing.

      Improved technology for searching information stored on a PC will also be a crucial feature of Microsoft's long-delayed version of its Windows operating system called Longhorn. That version, which is not expected before 2006 at the earliest, will have a redesigned file system, making it possible to track and retrieve information in ways not currently possible with Windows software.

      Google's move is in part a defensive one, because the company is concerned about Microsoft's ability to make searching on the Web as well as on a PC a central part of its operating system. By integrating more search functions into Windows, Microsoft could conceivably challenge Google the way it threatened, and destroyed, an earlier rival, Netscape, by incorporating Web browsing into the Windows 98 operating system.

      A Google spokesman declined to comment about the new search tool.

      Although Google's core business rests on huge farms of server computers that permit fast searching on the Internet, the company has already taken several steps to move beyond that business.

      Last year, Google began testing a free program called the Google Deskbar that makes it possible to search the Web by entering words and phrases in a small dialog box placed in the Windows desktop taskbar at the bottom of the computer screen.

      Google also sells a computer search system designed to index and retrieve information created and stored by a single organization.

      There is a rich history of less-than-successful attempts to create information search tools for personal computers. In the 1980's, for example, Mitchell Kapor's On Technology developed On Location for retrieving information on Macintosh computers and Bill Gross, a prominent software developer, led a group of programmers to create Lotus Magellan for the PC.

      Digital Equipment's Alta Vista search engine group also developed a search tool for data stored on desktop PC's. Today there are a number of commercial products for desktop searches like X1 and dtSearch. Moreover, both the Macintosh and Windows operating systems have file and text retrieval capabilities.

      The Google software project, which is code-named Puffin and which will be available as a free download from Google's Web site, has been running internally at the company for about a year.

      The project was started, in part, to prepare Google for competing with Windows Longhorn, which according to industry analysts will dispense with the need for a stand-alone browser.

      The disappearance of the Web browser and the integration of both Web search and PC search into the Windows operating system could potentially marginalize Google's search engine. Google, well aware of this threat, hired a Microsoft product manager last year to oversee the Puffin project as part of its strategy to compete with Microsoft's incursion into its territory.

      Microsoft has shown demonstrations of its new search technology, which emphasizes the use of natural language in queries like "Where are my vacation photos?" or "What is a firewall?" Microsoft believes that Longhorn users will no longer think about where information is stored; they will ins

      • by Dun Malg (230075) on Wednesday May 19 2004, @02:00PM (#9197444) Homepage
        Microsoft believes that Longhorn users will no longer think about where information is stored; they will instead see a unified view of documents stored on both the Internet and on the desktop.

        This is one of the silliest notions I've ever heard. If they make no distinction between local files (in user's control) and files "on the internet" (beyond user's control), what kind of crap are we going to have to put up with when people start saying "hey, where's that document I was looking at yesterday?" because they never knew it was on someone else's hard drive and got erased.

  • Advertisements (Score:5, Insightful)

    by Anonymous Coward on Wednesday May 19 2004, @01:11PM (#9196992)
    Wonder whether they'll start serving me ads based on my hard drive contents...
  • privacy (Score:5, Interesting)

    by Councilor Hart (673770) on Wednesday May 19 2004, @01:11PM (#9196993)
    So, will I get ads based on my data?
    • Re:privacy (Score:5, Interesting)

      by Deitheres (98368) <brutalentropy@@@gmail...com> on Wednesday May 19 2004, @01:24PM (#9197149)
      I don't foresee Google adding ads to a local search function... there are no ads on the Google toolbar, nor are there any ads on the Google Deskbar (save the ones that appear in the mini browser, but those are merely Google.com ads).

      Google seems to be as anti-ad as most people on Slashdot. I personally hate ads, but I feel that most of Google's ads are non-invasive and in good taste.
  • by lukewarmfusion (726141) on Wednesday May 19 2004, @01:12PM (#9197009) Homepage Journal
    I recently searched several hundred thousand files on my work machine. It took nearly 90 minutes to complete the search. I expect Google will be able to significantly improve upon that. They're one of the few companies that I really trust to do the right thing.
    • by Petronius (515525) on Wednesday May 19 2004, @01:53PM (#9197397)
      yeah, once they cluster your box with theirs (i.e. copy your files), the searches will be fast.
      • by Waffle Iron (339739) on Wednesday May 19 2004, @02:30PM (#9197684)
        That is to say, Google's utility won't cut your search time to 20 minutes just because they have better code.

        I don't know about that... it used to take me several months to find a document on the Internet when I had to download and grep the entire World Wide Web. My bandwidth bills were astronomical. Since I started using Google, I can now find the same files in a few milliseconds. I say they have much better code than my old "wget -r http://*.*|grep foo".

  • by prostoalex (308614) * on Wednesday May 19 2004, @01:13PM (#9197015) Homepage Journal
    NYT claims the Google PC search competes with Microsoft's. Although Microsoft has never been particularly strong in the area with either Search window in 2000 or that doggie in XP. For me in 1 cases out of 10 the text search (inside the documents, search for specific text) just do not work. There are other vendors that Google will be competing against, not necessarily Microsoft.

    X1 [x1.com] seems to be the most popular one out there.

    DiskMeta [diskmeta.com], they had this project in beta for a while, the Windows product went into relese just last week, the site says

    DT Search [dtsearch.com], I remember their ads in bunch of computer magazines, although have never used them myself.

    EFS [com.com], found it on download.com, supports MS Office and PDF as well as other formats.

    • Actually yes (Score:5, Informative)

      by Pranjal (624521) on Wednesday May 19 2004, @01:18PM (#9197073)

      If you have followed Microsoft developments around Longhorn you might have noticed that search is one of the top priority features that microsoft is going to integrate directly into the operating system. So once Longhorn is released Microsoft would become the biggest competitor to Google's search applications on the web as well the desktop(with this application)

      Search is the next big thing on which a lot of players are concentrating and Microsoft entering the field has skewed the competition towards the desktop and everyone including Google is preparing for the battle.
  • NYT Article (Score:5, Informative)

    by OverlordQ (264228) * on Wednesday May 19 2004, @01:14PM (#9197033) Journal
    No-Reg Link [nytimes.com]
  • by nbvb (32836) on Wednesday May 19 2004, @01:19PM (#9197086) Journal
    Seems to be like a rehash of the AltaVista Desktop search ...

    I keep looking at Google and thinking "wow, this is just like AltaVista, without the death spiral!" :-)

  • Similar ideas (Score:5, Informative)

    by Jugalator (259273) on Wednesday May 19 2004, @01:23PM (#9197135) Journal
    Well, first this idea is part of Microsoft's WinFS plans. The idea with WinFS was partially born when Microsoft developers realized that major parts of the web can be searched faster than a user's hard drive. It will be interesting to see how this application will collide with Microsoft's plans, that's for sure. It's basically fast searches and enhanced metadata support that are the key parts of WinFS, which is in turn a key part of Longhorn.

    Second, an indexing software that does the same thing is already available today and worked very well when I tried it out. It's actually almost perfect, except for the fact that it causes occasional hard drive thrashing as it tries to keep the index up-to-date. This is unfortunately a rather major downside, but if you can bear with this, you'll get literally instant file searches on your entire hard drive -- it narrows down the possible matches as you type each letter. It even indexes file contents for small files. I'm talking about X1 [x1.com].
  • by Karamchand (607798) on Wednesday May 19 2004, @01:24PM (#9197145)
    Google should ask Microsoft for information it has to provide according to the antitrust settlement so that Google's own program can interoperate with Windows as good as Microsoft's!
  • by JasonMaggini (190142) on Wednesday May 19 2004, @01:26PM (#9197171)
    Now, when Google can tell me where I put my keys [fark.com], then I'll be impressed.
  • by The Lynxpro (657990) <lynxpro@gmai l . c om> on Wednesday May 19 2004, @01:29PM (#9197217)
    Since Microsoft considers Google a major competitor and has its target set on Google with Longhorn's capabilities, I think it would be a great idea if Google started distributing their own version of the Mozilla web browser. With Google's reputation, there would definitely be more people making the switch to Mozilla based browsers if Google were to do this. After all, Netscape is considered a failure now by the public and Mozilla to a casual observer lacks credibility no matter how great the product is.

  • by Snork Asaurus (595692) on Wednesday May 19 2004, @01:33PM (#9197247) Journal
    Altavista put out a Windows search app based on their engine technology around 1998 (during their part-of-DEC, better-than-most-search-engines of the time phase). It indexed all documents and provided keyword searches that included Word docs, PDF's and more. It was free and a little buggy but showed promise. Then it just kind of disappeared.

    Perhaps Google can fill this void in the pathetic Windows power tool-set ("Windows power tool-set" being close to an oxymoron).

    But, despite my love for Google, in these more Orwellian times, I'm glad that I have the tools (not from MS) to monitor port activity.

  • by blueZ3 (744446) on Wednesday May 19 2004, @01:45PM (#9197341) Homepage
    Call me crazy, but I actually just keep logically structured directories and make sure to save items into the appropriate location... It's much simpler to take 10 seconds to place a file in the appropriate directory at the start than to hunt for it later.

    Even when a file crosses multiple logical groups, (picture, jpg, family, nephews, 2004) if my information categories are sensible, and I use a heirarchy that makes sense to me, I don't need search that often. In fact, I can't recall the last time I had to do a search of my drive to find a file. (I should probably mention that my work requires a lot of information mapping, so creating and maintaining such a structure is trivial for me)

    Of course, since Windows search is so inefficient and (sometimes) problematic, I learned long ago not to rely on it.

    bluez3
  • by phoxix (161744) on Wednesday May 19 2004, @01:45PM (#9197343)
    grep -r $dir -I -H -n -e "foobar"
    a) it really works
    b) have fun!

    Sunny Dubey
  • by Kaa (21510) on Wednesday May 19 2004, @01:47PM (#9197355) Homepage
    From the article:

    Microsoft believes that Longhorn users will no longer think about where information is stored; they will instead see a unified view of documents stored on both the Internet and on the desktop.

    I don't like this idea. At all.

    The main problem from my point of view has to do with ownership and control. Generally speaking, what's physically on my machine(s) is *mine*, that is subject to my total control (we'll leave aside intellectual property issues). I can add, change, delete, etc.

    Still generally speaking, what's on some machine I access over the net is *not mine* in the sense that my control is reduced. Usually other people can do something with that information (again, add, change, delete) and if the machnine is taken offline, I have no access and no control at all.

    As a simple example, consider a web page. In one case I make a local copy of it on my machine. In the other case I just have a bookmark. The difference in control is fairly obvious...

    Now, what happens if we make users believe there's no difference between their local hard drive and Internet? That we drill into their heads that they are the same?

    Well, you still have no control over information stored on the 'net. Thus, if you were trained to think that the local drive and the 'net are basically the same, then you would expect to have no control over information stored on your hard drive.

    Note that by an amazing coincidence, that's also the goal of DRM -- that you have no control over information (that they call content) stored on your hard drive.

    Also note that the flip side of the coin -- making your hard drive irrelevant by switching to a subscription service for everything, from OS to applications to content, is also a highly popular idea in Redmond and elsewhere.

    So color me highly suspicious with regard to that idea...
    • by YrWrstNtmr (564987) on Wednesday May 19 2004, @03:27PM (#9198303)
      Now, what happens if we make users believe there's no difference between their local hard drive and Internet? That we drill into their heads that they are the same?
      Well, you still have no control over information stored on the 'net. Thus, if you were trained to think that the local drive and the 'net are basically the same, then you would expect to have no control over information stored on your hard drive.


      People are already looking to do this voluntarily. Even among the pseudoenlightened here. Look through some of the comments regarding Gmail. And especially the (false) 1TB announcement.
      Scripting your wordprocessor to autosave to your 1TB (or 1GB) Gmail acct.
      Online hard drive.
      No more backup worries! I can store all my stuff on Gmail!
      Will they release the API so I can automate this?

      People won't need to be forced into this...they will come running.

      But I do agree with you. I don't like it either.
  • by buzzoff (744687) on Wednesday May 19 2004, @02:00PM (#9197445)
    Google will win this battle.

    1. Microsoft doesn't understand that people LOVE Google. Nobody particularly LOVES Microsoft anymore. Product activation, high prices, and security flaws are causing too many headaches.

    2. Google is more innovative. What has Microsoft innovated in the past few years? Their products keep changing their look, but what about user behavior? AD changed admin behavior, but how has IE or Word gotten easier to use? Google has all kinds of creative stuff in the pipe. The Google toolbar has not only changed the way many of my users search, but it prevents a lot of popup related spyware installations as well.

    3. Google is clean. If I see that damn dog show up one more time I'll kill myself. When I search my file system I don't want to hide the stupid mutt, change my options so that subfolders are searched, then click through three screens to say I want to search my file system. Google will cut through this nonsense because they believe in simple/clean interfaces.

    4. The technology Microsoft seeks doesn't exist. Nobody can create a search engine based on current technology that takes plain speech user input and magically transforms it into accurate search results. Everyone I've seen that's tried this has failed to an extent. You can't just try your best to fuzzy match and pass it off as good results.

    • by mathd (656476) on Wednesday May 19 2004, @02:59PM (#9197947)
      3. Google is clean. If I see that damn dog show up one more time I'll kill myself. When I search my file system I don't want to hide the stupid mutt, change my options so that subfolders are searched, then click through three screens to say I want to search my file system. Google will cut through this nonsense because they believe in simple/clean interfaces.
      The dog problem is easy to fix.
      Create HKEY_CURRENT_USER\Software\Microsoft\Windows\Curre ntVersion\Explorer\CabinetState\Use Search Asst as a new String Value and use the value "no".

      You'll have the old windows 2000 search dialogue.
    • by Elwood P Dowd (16933) <judgmentalist@gmail.com> on Wednesday May 19 2004, @03:30PM (#9198343) Journal
      1. Microsoft doesn't understand that people LOVE Google. Nobody particularly LOVES Microsoft anymore.
      People loved Netscape.
      2. Google is more innovative. What has Microsoft innovated in the past few years?
      Netscape was more innovative at first.
      3. Google is clean. If I see that damn dog show up one more time I'll kill myself.
      One of my officemates near to started crying after I used her computer for a minute and disabled Clippy without thinking.
      4. The technology Microsoft seeks doesn't exist. Nobody can create a search engine based on current technology that takes plain speech user input and magically transforms it into accurate search results.
      Didn't. Didn't exist. My college had an excellent linguistics department. Microsoft interviewed every decent computational linguistics student that sent them a resume, and hired several. Yes, all natural language search products that I've seen have sucked. Not all such research projects that I've seen have sucked. I wouldn't be surprised at all if Microsoft innovates a little in this regard. Shocker, I know.

      So... hate Microsoft all you want. I've used and loved Google since 1998 (ie forever), and I'm not betting on this race.
  • by evil-osm (203438) on Wednesday May 19 2004, @02:14PM (#9197565)
    ...10,000 Linux systems connected to your local system and it will all run snappy ;)
  • by farzadb82 (735100) on Wednesday May 19 2004, @02:43PM (#9197794)
    How long before Google pushes their ad-words technology onto your desktop ?

    Would people be willing to live with ads sprinkled throughout their search items ?

    • by TRS80NT (695421) on Wednesday May 19 2004, @01:18PM (#9197071)
      Maybe that's why it's not "Find" anymore. "Find" was evidently too positive a term. Now you only have the ability to "Search".
    • by Verteiron (224042) * on Wednesday May 19 2004, @01:22PM (#9197124) Homepage
      It works a lot better when you enable indexing.

      Or so I'm told. My personal experiences with allowing the Windows Indexing service to run in the background have been that it's more trouble than its worth. Yes, on the rare occasion that it's actually -not- indexing when I search, the search is blazingly fast (compared to a non-indexed search).

      But if the index is currently being modified, then the Windows search feature can't use it. Period. So when you search, you get the text "Windows is currently building an index of the files on drive C:" and it falls back to the regular, non-indexed search. In addition, the indexer consumes massive amounts of RAM while indexing, so a search run when the index is being modified ends up being about two times slower than usual.

      It also doesn't seem to be able to tell when the user is idle. No amount of tweaking seems to fix this, without leaving you with a days-old index. If the index is complete, but you've saved a file since it was completed, that file will not show up in the search at all. I've had it kick on while in the middle of working on something else so often that I finally just turned it off entirely and have resigned myself to slow(er) searches in Windows.

      In the interest of fairness I will say that the search seems to work quite well when searching a remote server that is running the indexing service. But running it locally is just a pain.
    • wingrep (Score:5, Informative)

      by (54)T-Dub (642521) * <tpaine@g m a i l .com> on Wednesday May 19 2004, @01:27PM (#9197179) Journal
      As a developer trapped in windows I find this little tool [wingrep.com] incredibly usefull.
      • Don't underestimate Google. Did you know that Gmail works with (officially) IE, Netscape, Mozilla, and Firefox? They could have just done a bunch of nasty IE-only stuff and forgotten about a whopping 1-5% of the market, and it might have been less work---but they didn't. It also seems to work with Safari (minus the keyboard shortcuts), and I bet Konqueror isn't far behind.

        They might be windows only, but there is a chance they'll decide to please the rest of us, too.

        • by jkabbe (631234) on Wednesday May 19 2004, @02:13PM (#9197559)
          I don't think that's a good comparison. It's a lot easier to write a cross-platform website than it is to write cross-platform applications. Sure, some of the underlying code can be reused. But a lot of the code (particularly for interacting with the file system and the GUI bits) will be platform-specific.
          • It's a lot easier to write a cross-platform website than it is to write cross-platform applications.

            Having done quite a bit of both in the past several years, I'd highly disagree. There are plenty of off the shelf products or methods to create cross-platform applications and very very few (and generally poor in quality) tools or even documentation to write cross-platform websites (modern ones, with dhtml and heavy usage of DOM).

            But a lot of the code (particularly for interacting with the file system and the GUI bits) will be platform-specific.

            Nope, that's pretty much been standardized, assuming you're writing from scratch. Now porting an application written platform specific is a completely different story. But this example is an application written from scratch.

            And as for filesystems, well... nowadays filesystems are much more consistant than, say, SysV versus VMS versus the dozen variants of CP/M. Subdirectories and pretty consistant meta information (date created, date modified, date accessed, etc) on every file is the accepted standard. They may do things different under the hood, but (at this time) they are all pretty much POSIX.

            --
            Evan

      • by Waffle Iron (339739) on Wednesday May 19 2004, @02:10PM (#9197532)
        They're not going to go out of their way and spend resources on an Os that captures a whopping 1-5% of the desktop market.

        Google has a vested interest in trying to help diminish Microsoft's desktop market share. Doing so increases the relative market value of Google's products relative to Microsoft's products.

        To help drive a wedge between Microsoft and their current desktop customers, Google will almost certainly port this kind of tool to other OSes. They would then get into various "enterprise" partnerships with IT solution providers to push pre-canned non-Windows desktops into corporate accounts. This product in particular would help to sell alternative desktops against Longhorn's alleged new filesystem features.

        If this strategy were successful, Google would stand to pick up a good bit of revenue and mindshare at Microsoft's expense. My guess is definitely: Cross platform.

    • Re:interesting (Score:5, Interesting)

      by Kircle (564389) on Wednesday May 19 2004, @01:21PM (#9197107)
      [Google] going to reach a point where they stretch their resources too thin?

      Google researchers are allotted 20% of their working time to do outside projects or to follow personal interests. Google News and Gmail were both results of work done during this "20%" time. So in short, no, I don't think Google has really stretched their resources any more so than before.
    • Re:DO NO EVIL? (Score:5, Insightful)

      by American AC in Paris (230456) * on Wednesday May 19 2004, @01:29PM (#9197212) Homepage
      but what's the benefit to google for this one?

      (Warning: lack of cynicism ahead)

      Seeing as they've built an empire on goodwill, a high-quality free search service, and word-of-mouth name recognition, I'm tempted to guess that their big benefit is continued goodwill and good karma from their userbase.

      Yes, this is a novel concept in a business world where most companies look at customers and see numbers. Thing is, it's goodwill and a user-centric business plan have made Google the great company it is.

      It could be that the 'catch' you're looking for is that Puffin will further solidify their already strong user relationship.

    • by irix (22687) on Wednesday May 19 2004, @02:49PM (#9197845) Journal

      I wish a could beat the creator of google-watch.org and every person who ever linked to it with a gigantic clue stick.

      First of all, the creator of google-watch.org has a really big axe to grind [google-watch-watch.org] with Google.

      Second, HTTP is a stateless protocol. If you want a user's preferences to to persist within a session you need to use cookies or attach a lot of state information to each GET/POST request. If you want the preferences to persist after you close and re-open your browser you have to have the user log in every time and store the prefs on the server or store the prefs on the client side in a cookie like Google does. This simple fact seems to fly right over the head of google-watch.org and their ridiculous cookie conspiracy theories.

      But hey, we've been over this in every Google story since the anti-Google FUD crowd started coming out of the woodwork. Here's a thought: if you really need a tinfoil hat then disable cookies, don't use Orkut and sleep better at night. But please stop subjecting people to google-watch.org FUD.