Do you develop on GitHub? You can keep using GitHub but automatically sync your GitHub releases to SourceForge quickly and easily with this tool so your projects have a backup location, and get your project in front of SourceForge's nearly 20 million monthly users. It takes less than a minute. Get new users downloading your project releases today!
geekfiend writes "Today Google updated their website to indicate over eight billion pages crawled, cached and indexed. They've also added an entry to their blog explaining that they still have tons of work to do."
This discussion has been archived.
No new comments can be posted.
I agree search engines are so 1990. I rely exlusively on word of mouth to find websites. If Firefox would add a button to the toolbar that said 'Cool Sites', maybe with an icon of a pair of glasses, and have the button link to a webpage with links to the latest cool sites on the net, that would certainly be the end of Google and their 8 billion pages. Pah!
At the same time, can Slashdot create a "Curmudgeon" section for those who like to gripe about the less than monumental significance of some story topics?
by Anonymous Coward writes:
on Thursday November 11, 2004 @08:38AM (#10786314)
I don't want to start an old discussion again... but hereI don't want to start an old discussion again... but here is where rdf,... can play a role. At my school they are starting to deposit articles,... in a repository that has metadata based on the dublin core. Hope this will help searching for that kind of info info: papers,... ?
Anyway, I believe google also has a personalised search:
by Anonymous Coward writes:
on Thursday November 11, 2004 @09:10AM (#10786448)
It is an interesting problem, extact string matching. If you think at how it would be done it is relatively simple for a short piece of text. just call strstr on a chunk of text. The problem, is google does not likely index large bodies of text. Instead, google indexes bags of terms. Each term is likely a stemmed word, that no longer resembles the orignal word. In this way, google compresses the document, saving space, while making it faster to look up key words in a document. The only way I think google could provide exact string matching, is to search their google cache. The problem or limitation with the google cache, is if you didn't notice, google does not cache every page, hence the word cache. While disk space is cheap it is also slow to access, so, even while it is visible google could store all 8 billion pages on disk it is only likely you would want to wait that long to search for your extact match. There are some tricks that could be used to speed narrow in on which documents to do exact string checking in. First they use the string you passed in and do the normal tokenization of the string breaking it down into parts. Then they come up with a result set. Now they can start doing exact string matching within that returned result set. The issue with that is it is undeterministic as to how long that process will take as each document is of arbitrary size. The best they could do would be to do an exact string match in the summary text and return the documents in that set first followed by the other documents, which is very close to what they actually do.
Re:This is news ? (Score:2, Funny)
Google is a constant source of information and a geeks friend - if the index has doubled so has our supply of information. Information rules!
Comment removed (Score:4, Funny)
I'm all alone (Score:4, Funny)
Can't figure of I should just shoot my self or maybe just open a subscription to
And I for one welcome... (Score:2, Funny)
Mhm to anonymous coward or not to anonymous coward?
Will moderators smack my karma below zero?
Re:I'm all alone (Score:4, Funny)
Re:Google Schmoogle (Score:2, Funny)
slashdotting (Score:4, Funny)
Re:This is news ? (Score:1, Funny)
Re:Quality - not quantity (Score:3, Funny)
Nonsense. (Score:2, Funny)
You don't just go from 4 billion to 8 billion overnight.
They are probably just crawling the same 4 billion twice.
Re:Google makes minor change to website - news at (Score:2, Funny)
At the same time, can Slashdot create a "Curmudgeon" section for those who like to gripe about the less than monumental significance of some story topics?
If I kept eating so much spam... (Score:2, Funny)
8 billions.... (Score:2, Funny)
and 19% is pr0n.
There's debate if the remaining 1% contains pirated music and movie or plans for DIY nukes.
Re:slashdotting (Score:3, Funny)
Grrrrr (Score:4, Funny)
Now it's going to be even harder to get my name in the top spot. Why was I cursed with the surname Smith!
Doubled? Wait a minute... (Score:5, Funny)
Re:More pages v.s more relevant pages (Score:1, Funny)
Just tried the beta of the new MSN Search (Score:4, Funny)
It is not clear to me how I can help them improve. Suggest they switch their servers to Linux?
Re:More pages v.s more relevant pages (Score:1, Funny)
Anyway, I believe google also has a personalised search:
http://labs.google.com/personalized
Maybe this can help.
Re:More pages v.s more relevant pages (Score:1, Funny)
Re:More pages v.s more relevant pages (Score:1, Funny)
Re:slashdotting (Score:1, Funny)
Re:slashdotting (Score:2, Funny)