Microsoft To Launch Homegrown Search Engine 300
Mr. Christmas Lights writes "While Google is currently the king-of-the-hill in search engines, Microsoft continues to lag in market share and uses Yahoo's technology/results. But Cnet reports that they'll launch on Thursday their own homegrown search engine , although it appears this is mostly a face-lift (despite a year of development and $100 million investment). According to Bill Gates, they 'will introduce a homegrown web crawler and algorithmic search engine ... later this year,' which is almost certainly their tech preview (you can look at this now) -- but will that be ready for prime-time in less than two months?"
this article is oooooold (Score:5, Informative)
Doubtful this will take any ground. (Score:3, Informative)
Almost there... (Score:2, Informative)
THE bot? (Score:5, Informative)
"msnbot/0.11 (+http://search.msn.com/msnbot.htm)"
It was only stoppable by blocking the IP. (robots.txt was only read once before it started) Great, smart bot, really.
Re:Netcraft says the hosting servers run on Linux (Score:2, Informative)
Mostly wrong. [netcraft.com]
Re:About time (Score:5, Informative)
Re:About time (Score:3, Informative)
Re:3 bad results. (Score:4, Informative)
Linux. No pointers to linux.org.
Google. Returns the Dutch/Belgian version of the page. Why?
These are no longer true. I know it used to do this but now ...
'Orange' returns Orange.co.uk.'Linux' returns linux.org
'google' returns google.com
'microsoft sucks' returns fuckmicrosoft.com
'abu graib' returns the photographs of inside the prison.
'lindows' returns lindows.com
This is from Firefox 0.8 on Redhat Linux.
BB
Wow (Score:1, Informative)
msn search is really similar as well. From the way text ads are done, to the font colours, etc. Complete rip offs of google.
Re:About time (Score:2, Informative)
Unconvinced... here's some stats from my logs:
MSNBot hits: 10217+77 bandwidth: 441.67 MB
Googlebot hits: 116+90 bandwidth: 16.13 MB
This is after the modifications of the robots.txt file, and this is only for a 2 week period in October. MSN bot was drawing nearly 1 gigaBYTE of upstream per month, just from my lowly site! No thank you... I prompty did this:
iptables -A INPUT -p all -s 65.54.0.0/16 -j DROP
I encourage all other webmasters to do the same.
Re:About time (Score:5, Informative)
User-agent: msnbot
Disallow:
iptables -A INPUT -p all -s 65.54.0.0/16 -j DROP
Or even better, if you have the TARPIT module:
iptables -A INPUT -p tcp -s 65.54.0.0/16 -j TARPIT