Google Index Doubles

Follow Slashdot stories on Twitter

Google Index Doubles 324

Posted by samzenpus on Thursday November 11, 2004 @07:00AM from the even-more dept.

geekfiend writes "Today Google updated their website to indicate over eight billion pages crawled, cached and indexed. They've also added an entry to their blog explaining that they still have tons of work to do."

This discussion has been archived. No new comments can be posted.

Google Index Doubles

Search 324 Comments Log In/Create an Account

Comments Filter:

no update on the images (Score:3, Informative)

by bvdbos ( 724595 ) writes: on Thursday November 11, 2004 @07:07AM (#10785980)

Unfortunately they didn't update [slashdot.org] the image-search [google.com] yet.

Share
twitter facebook
Google needs your cookie badly (Score:2, Informative)

by Anonymous Coward writes: on Thursday November 11, 2004 @07:27AM (#10786053)

Until today you could save your google settings [google-watch.org] without loosing your privacy [google-watch.org]. You can still save those settings but google refuses to use them when you block their cookie. In my case I get 10 search results although I like to receive 100. Seems that they are making many dollars on a user's cookie, and now they are a public company my privacy is less important than "stock holders' interests".

Share
twitter facebook
Google domination. (Score:2, Informative)

by Anonymous Coward writes: on Thursday November 11, 2004 @07:30AM (#10786069)

Local tabloid Aftonbladet is running a poll on search engine use:

Google (81.4 %)
Yahoo (2.2 %)
MSN (3.8 %)
Other (11.4 %)
Don't know (1.2 %)

61730 votes so far.

I'm a little surprised, either the masses who use the "default" (MSN?) aren't bothering to answer, or google is simply very very dominant and those "default using masses" do not exist [in this country].

Share
twitter facebook
Re:Google thieves my bandwidth (Score:5, Informative)

by Anonymous Coward writes: on Thursday November 11, 2004 @07:35AM (#10786089)

Google respects the robots.txt file. Use it.

Parent Share
twitter facebook
Re:I'm all alone (Score:3, Informative)

by tadmas ( 770287 ) writes: <david AT tadmas DOT com> on Thursday November 11, 2004 @07:43AM (#10786114) Homepage

8 billion pages and not a single link to my blog.

Perhaps you should just tell them where it is [google.com].

Parent Share
twitter facebook
Mine is bigger than yours!!! (Score:5, Informative)

by ayjay29 ( 144994 ) writes: on Thursday November 11, 2004 @07:46AM (#10786124)

From BBC News here [bbc.co.uk].

In a statement Microsoft said its search engine returned results from five billion web pages - more than any other search engine.

But this quickly won a response from Google which announced that its index has now grown to more than 8 billion pages.

Prior to the Microsoft announcement, Google was only indexing 4,285,199,774 web pages.

Steve Ballmer is soon to announce that his daddy is one hundrad years old, and kan kick your daddy's ass...

Share
twitter facebook
Re:Quality - not quantity (Score:4, Informative)

by dabadab ( 126782 ) writes: on Thursday November 11, 2004 @07:49AM (#10786134)

"[i]Since pagerank was switched off[/i]"

Since when is Pagerank switched off?

Parent Share
twitter facebook
Searching LiveJournal.com (Score:5, Informative)

by hackrobat ( 467625 ) writes: <manish.jethani@gma i l .com> on Thursday November 11, 2004 @07:49AM (#10786135) Homepage

Looks like they've added a gazillion LiveJournal [livejournal.com] pages to their index. I used to have a Google search box on my LJ that didn't throw up relevant results until last week or so. Now it works perfectly, just like builtin search (like what you see in MT and WordPress).

Share
twitter facebook
Competing with Microsoft's 5bn? (Score:5, Informative)

by Richard W.M. Jones ( 591125 ) writes: <{rich} {at} {annexia.org}> on Thursday November 11, 2004 @07:51AM (#10786143) Homepage

On the same day that this story hits the BBC [bbc.co.uk]. In that story Microsoft claim that they have 5 billion pages indexed, more than the 4.2 billion pages indexed (at that point) by Google. The BBC have just updated the story with the 8bn figure.
I smell competition!
Rich.

Share
twitter facebook
robots.txt (Score:4, Informative)

by ReKleSS ( 749007 ) writes: <rekless AT fastmail DOT fm> on Thursday November 11, 2004 @07:51AM (#10786148)

Yes, this is probably a troll, but anyway... I take it you've never heard of the robots.txt file? You sound like you might want to read up on it. It's designed to help control the spidering of your pages for whatever reason, particularly cases like yours or situations where a spider would get confused and end up doing something stupid (recursive stuff, etc).
-ReK

Parent Share
twitter facebook
Re:Google needs your cookie badly (Score:3, Informative)

by Anonymous Coward writes: on Thursday November 11, 2004 @07:52AM (#10786151)

You can still save those settings but google refuses to use them when you block their cookie. In my case I get 10 search results although I like to receive 100.
Create a keyword bookmark [mozilla.org] with the URL

http://www.google.com/search?q=%s&num=100 [google.com]

Give it the keyword 100, then type 100 search_term in the address bar to use it.

Parent Share
twitter facebook
great but where are the .txt and directories? (Score:3, Informative)

by js7a ( 579872 ) writes: <`gro.kivob' `ta' `semaj'> on Thursday November 11, 2004 @08:06AM (#10786189) Homepage Journal

Google won't be within reach of the pinnacle until they index .txt files, directory listings, and anonymous ftp sites.

Parent Share
twitter facebook
Re:Google thieves my bandwidth (Score:5, Informative)

by jvj24601 ( 178471 ) writes: on Thursday November 11, 2004 @08:09AM (#10786196)

Well, if you know that Google is indexing your site and "stealing" your bandwidth, then you must have looked at the server logs, right? You'd see the name of the search bot is googlebot. Search for it [google.com], and you'll find that the first relevant link [google.com] explains how to prevent googlebot from accessing your site.

The logs would probably also show failed attempts to find the file /robots.txt. Similar info is gained from searching on that term [google.com] as well.

Parent Share
twitter facebook
Re:More pages v.s more relevant pages (Score:3, Informative)

by __aahlyu4518 ( 74832 ) writes: on Thursday November 11, 2004 @08:20AM (#10786241)

Personally I find that the lack of relevant pages if the biggest problem with search engines, not the lack of pages with information.

Actually.... information IS relevant data. If it's not relevant to what you want, then it is just data...

Parent Share
twitter facebook
Re:great but where are the .txt and directories? (Score:3, Informative)

by geminidomino ( 614729 ) * writes: on Thursday November 11, 2004 @08:21AM (#10786244) Journal

One out of 3 [google.com] ain't a bad start. Add a few more keywords to narrow down the google-crawling.

Parent Share
twitter facebook
Re:What? (Score:5, Informative)

by jez9999 ( 618189 ) writes: on Thursday November 11, 2004 @08:25AM (#10786261) Homepage Journal

Erm, that's only because of the bizarre plus signs the grandparent poster put in - try this [google.com]. Note to grandparent: Just about any modern search engine assumes words not prefixed by anything are to be included in the Boolean search query. No need for +.

Parent Share
twitter facebook
try +the (Score:2, Informative)

by leuk_he ( 194174 ) writes: on Thursday November 11, 2004 @09:20AM (#10786516) Homepage Journal

Yes there is, try to search for

The Doctor

vs

+the doctor

Parent Share
twitter facebook
Re:meta-no-archive (Score:3, Informative)

by justMichael ( 606509 ) writes: on Thursday November 11, 2004 @11:34AM (#10787868) Homepage

I'm sure I've seen some way of doing a sort of backwards search on a page, that will show all the pages in Google that link to it.
The search you are looking for looks like this: link:slashdot.org [google.com]

Parent Share
twitter facebook
Re:Proximity search will help (Score:2, Informative)

by mazarin5 ( 309432 ) writes: on Thursday November 11, 2004 @01:32PM (#10789341) Journal

Google has a near operator: *
Only useful in a quoted string.
Example:
Thomas * Edison [google.com]

Parent Share
twitter facebook
Re:Makes you wonder... (Score:3, Informative)

by RedWizzard ( 192002 ) writes: on Thursday November 11, 2004 @05:13PM (#10791955)

Maybe the steep increase is due to all the new file formats they are indexing now.

The steep increase is probably due to an architecture change. Google has, for a long time, been indexing around 4 billion pages. That implies that they have been giving each page a 32 bit unique identifier, and had exhausted that id space. It would be a lot of work for them to seamlessly upgrade all their software to support a larger id, and it has taken them a long time to do so. Now that they have the large jump in pages is simply due to the fact that they can index much more of the web.

Parent Share
twitter facebook

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Google Index Doubles 324

Google Index Doubles More Login

Google Index Doubles

no update on the images (Score:3, Informative)

Google needs your cookie badly (Score:2, Informative)

Google domination. (Score:2, Informative)

Re:Google thieves my bandwidth (Score:5, Informative)

Re:I'm all alone (Score:3, Informative)

Mine is bigger than yours!!! (Score:5, Informative)

Re:Quality - not quantity (Score:4, Informative)

Searching LiveJournal.com (Score:5, Informative)

Competing with Microsoft's 5bn? (Score:5, Informative)

robots.txt (Score:4, Informative)

Re:Google needs your cookie badly (Score:3, Informative)

great but where are the .txt and directories? (Score:3, Informative)

Re:Google thieves my bandwidth (Score:5, Informative)

Re:More pages v.s more relevant pages (Score:3, Informative)

Re:great but where are the .txt and directories? (Score:3, Informative)

Re:What? (Score:5, Informative)

try +the (Score:2, Informative)

Re:meta-no-archive (Score:3, Informative)

Re:Proximity search will help (Score:2, Informative)

Re:Makes you wonder... (Score:3, Informative)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot