Microsoft Goes In For Hadoop 67
Frankie70 writes that after more than three years, Microsoft has "finally learned to stop worrying and love Hadoop." Frankie70 excerpts from the linked Wired article: "Any aversion to Hadoop disappeared on Wednesday, when the company announced that it will integrate the platform with future versions of its relational database, SQL Server, and its platform cloud, Windows Azure, an online service for hosting and readily scaling applications. The company is now working to port the Hadoop platform to Windows."
Hadoop is written in Java (Score:5, Funny)
So what they mean is, they're going to do a search and replace to make it compile as a C# application.
Re: (Score:2)
Actually, there is an ever increasing amount of JNI (read: C) code in Hadoop that is in the critical path for security and performance features. Most of that code is not very portable. So either MS is going to pay for some major overhauling of that code, completely new code/branch to replicate that functionality or MS Hadoop is going to be severely lacking in features/performance.
Re: (Score:2)
If that C code is well-written, it shouldn't be hard to port it over - the porting would have to be done at the Java/C boundary, and .NET actually has it much simpler thanks to P/Invoke.
Re: (Score:2)
It isn't. There is an incredible overuse of glibc/Linux-isms to the point that even porting it to another UNIX is difficult.
Re: (Score:2)
Well then, perhaps those guys will clean it up architecturally while they're porting it, and submit the changes upstream.
Re: (Score:1)
Re: (Score:2)
How about Spiteful? .net mean you no longer have to observe the license?
Does taking the source from an Apache licensed product and converting it from java to
I'd say no, I wonder what M$ will say.
Re: (Score:2)
I have to wonder where the first line of the article came from. If it's from the author, nothing to see here, but if it's a near direct quote from the source it sounds more like a shoutout to Dr. Strangelove, which could indicate that they are still terrified of it but are embracing it because they don't feel they have any choice in the matter.
Not wanting to put a dampener on things... (Score:2)
I think MS getting involved with Open Source is great, but....
We've seen the way that they work before, embrace and extend... This hasn't worked out that well for them before, but you have to ask if there is an alterior motive in there...
Re: (Score:2)
Re: (Score:2)
Sell more Windows Server & SQL Server Enterprise/Data Center licenses?
Re: (Score:1)
What major data center (working with large volumes of critical data) in its right mind would ever even consider using MS SQL Server as its database? Who'd consider running their critical server for any database on a windows box?!?!
Not in any major production player I've ever seen or worked at....
Re: (Score:1)
NASQAQ uses sql server http://www.computerworld.com/s/article/106050/Microsoft_unwraps_flagship_database_SQL_Server_2005
DirectEdge - 4th largest stock exchange uses sql server
http://blogs.technet.com/b/dataplatforminsider/archive/2011/06/03/fourth-largest-us-stock-exchange-direct-edge-looks-to-sql-server-parallel-data-warehouse-for-big-data-needs.aspx
Re: (Score:3)
I think they are telling the truth about their goals:
1) Get Hadoop to work on Windows servers
2) Create a Windows server management interface for Hadoop
3) Create SQL Server extension to manage Hadoop.
And the motive is:
a) Sell server licenses
b) Sell SQL Server licenses
Re: (Score:2)
You would be amazed how many people go nuts over the latest F/OSS platform du jour... and then complain that it runs first and foremost under Linux.
Even if they're never going to go anywhere near the underlying OS anyway, still that gets brought up.
Windows Server licensing is quite lucrative for Microsoft. So if they can now announce "Hadoop: Now certified for Windows (TM) Server" they can sell more licenses for Windows Server.
Re: (Score:2)
Re: (Score:2)
I am not sure what an alterior motive is, but I am quite sure that MS has an ulterior motive for this.
My ex-girlfriend was an ulterior decorator.
Re: (Score:2)
> alterior
bad spelling, ulterior. Sorry, my bad.
Re: (Score:2)
They are quite publicly indicating their intention is to embrace and extend:
1) Get Hadoop to work on Windows servers
2) Create a Windows server management interface for Hadoop
3) Create SQL Server extension to manage Hadoop.
So we don't have to speculate, that's what they say they are doing. That being the case all that stuff might be useful for Hadoop.
Re: (Score:2)
Denver Open Source Group meets at MicroSoft (Score:2)
Re: (Score:2)
This isn't about Microsfot getting involved with open source. This is about Microsoft not getting left out. Beyond the countless startups, Apache Hadoop already has major players like Amazon, Dell, EMC, HP, IBM, NetApp, Oracle, VMware, ... trying to make a dent in the community in some form or another. Hell, I have a SuperMicro catalog on my desk emblazoned with the Apache Hadoop logo all over it. Like Oracle, they are coming in very late to the party and now need to play catch-up. Buying off Hortonwo
First... (Score:1)
Then they laugh at you.
Then they port you on their platform.
Then you win.
Original: http://bit.ly/o3V3cA [bit.ly] [Google Books]
Re: (Score:2)
First they ignore you.
Then they laugh at you.
Then they port you on their platform.
Then you win.
First they ignore you.
Then they laugh at you.
Then they port you on their platform.
Then they add some convenient feature that they only make available on their platform
Then they win
FTFY
Re: (Score:1)
Think I will go into a corner and cry for a minute now.
Heh (Score:5, Funny)
Someone should trick Timothy into reposting this article. Then he'd be duped into posting a dupe about hadoop.
Re: (Score:2)
Re: (Score:1)
Congrats, you've demonstrated enough brain-power for kindergarten. If a kid stacks a blue lego block on top of another, you are able to imitate and stack a 3rd blue lego block.
Re: (Score:2)
*golf clap*
Re: (Score:2)
*golf clap*
Something you get from being in the showers with Tiger Woods?
Re: (Score:2)
And then I'd file a false complaint against it being a copyright violation in France.
It would be a Hadopi dupe dupe Hadoop.
But I digress...
Embrace. Extend. Extinguish. (Score:4, Interesting)
http://en.wikipedia.org/wiki/Embrace,_extend_and_extinguish [wikipedia.org]
Re: (Score:2)
"Microsoft allowed the pair" -- Here
"which was rolled into Redmond's Bing search engine" -- It
"But somewhere along the way, Mi
as usual, the summary here is lacking (Score:5, Insightful)
That's wonderful that the summary mentions what "SQL Server" and "Azure" are... but why no mention of wtf "Hadoop" is?
Why do I need to RTFA just to find out what we're talking about here?
Re: (Score:2)
The Wired article tells you what Hadoop is, because it's written by journalists, and aimed at a broad readership.
The /. summary tells you what SQL Server and Azure are, because the descriptions arbitrarily happen to occur in the Wired paragraph that's been quoted.
The /. summary does not tell you what Hadoop is, because (unlike Wired readers), /. readers are expected to have some basic knowledge of the software world. If you think Hadoop is obscure, maybe this isn't the site for you?
Re: (Score:1)
I'm with the OP. Quoting an article is fine, even with descriptions one should know... but if an article is primarily about something that isn't common, then a nice little blurb about it would save AT LEAST TWO PEOPLE a wiki call. Hadoop isn't as ubiquitous as you believe. Coming from a heavy consulting background in coding, I've never seen this.
Oddly, by your logic, /. readers don't know what SQL Server and Azure are. I'll put dollars to doughnuts that more people USE SQL Server than KNOW what Hadoop i
Re: (Score:2)
Oddly, by your logic, /. readers don't know what SQL Server and Azure are.
Nope. As I said, the definitions for those arbitrarily happen to be in the quote chosen for the summary. But those definitions aren't the pertinent part of the quote.
Re: (Score:2)
The /. summary tells you what SQL Server and Azure are, because the descriptions arbitrarily happen to occur in the Wired paragraph that's been quoted.
So what you're saying is .... the person who wrote the summary doesn't know how to write a good summary or pick a good quote. :) (no, I'm not new here...)
Re: (Score:2)
Sadly, this is happening more and more with Slashdot.
Dont look behind that curtian! (Score:2)
"the company plans to eventually release its work back to the open source community."
That is a bit too vague... because 100 years from now is "eventually"
Re: (Score:3)
If you look at their todo list they could release it constantly it won't matter. What they are doing is essentially creating extensions for their commercial products that work with Hadoop. I think they have every intention of trying to get the small parts that need to be in Hadoop back into the main tree.
Dryad (Score:2)
Microsoft has a equivalent to Hadoop known as Dryad.
They should have open-sourced Dryad a long time ago.
I wonder what is going to happen to Dryad with this focus on Hadoop.
Re:Dryad (Score:4, Informative)
Dryad is not quite Hadoop. From their whitepaper:
We can map the whole relational algebra on top of Dryad, however Dryad is not a database engine: it does not include a query planner or optimizer; the system has no concept of data schemas or indices; and Dryad does not support transactions or logs
I can see how Hadoop would supplement their own research in this field.
Microsoft Goes Hadooken (Score:2)
The cool parts (Score:2)
1) They are contributing the bits needed to make it work on windows back to open source (Hortonworks is helping that make sure that goes smoothly)
2) They are making JavaScript a first-tier language for writing map/reduce jobs, and contributing THAT work back to the community.
That is awesome.