Supercomputing: Raw Power vs. Massive Storage 346
securitas writes "The NY Times reports that a pair of Microsoft researchers are challenging the federal policy on funding supercomputers. Gordon Bell and Jim Gray argue that the money would be better spent on massive storage instead of ultra-fast computers because they believe today's supercomputing centers will be tomorrow's superdata centers. They advocate building cheap Linux-based Beowulf clusters (PCs in parallel) instead of supercomputers." NYTimes free reg blah blah.
Ny Times free reg?! (Score:5, Informative)
Just use the google link!
Re:Ny Times free reg?! (Score:5, Funny)
Heh, even with the Google hack.
Partner = Slashdot (Score:5, Informative)
Re:Partner = Slashdot (Score:5, Funny)
Re:Partner = Slashdot (Score:3, Funny)
We know you do, Anonymous Coward, we know you do.
Re:Ny Times free reg?! (Score:2, Insightful)
Im happy enough signing up once for a website, using a spam catching email address, and using those login detials to read whatever the editors post.
If you dont like registering for an article, then dont read it, but dont bitch about it.
Microsoft recommending Linux Beowolf cluster? (Score:5, Funny)
Re:Microsoft recommending Linux Beowolf cluster? (Score:4, Insightful)
Because getting rid of "big iron" will kill IBM and Sun, their competitors. They can't *really* think that a cluster of PCs is a one-size-fits-all solution.
Re:Microsoft recommending Linux Beowolf cluster? (Score:5, Interesting)
Re:Microsoft recommending Linux Beowolf cluster? (Score:5, Insightful)
I doubt it. You can only use a "cluster" like a Beowulf if your problem can neatly be divided into small, completely independent work units. If you want to render a movie, then so long as you have all the scene data, each frame can be rendered completely independently of any other, then stitched together at the end.
Try using a Beowulf-style cluster for a CFD problem, and watch as all computation grinds to a halt as your processors and interconnects devote all their capacity to inter-node coherency and synchronization. You need a traditional supercomputer like an SGI Origin for jobs like that, because of its massive internal bandwidth.
There is absolutely no danger of Beowulfs killing off the supercomputer in the near future. In fact, the supercomputer market is looking pretty healthy.
Re:Microsoft recommending Linux Beowolf cluster? (Score:4, Interesting)
That used to be true, but I don't think it is anymore. A high-end Beowulf compute node these days typically gives you 2 processors and 2-4 Gigabit Ethernet channels, going into a high-end switch. That seems like it's in the same ballpark as the SGI Origin, which gives you nodes with up to 16 processors, up to 12GB/sec aggregate memory bandwidth, and 8 channels going into the router. They aren't going to perform identically, but I think the differences are diminishing.
Furthermore, with distributed shared memory software, parallel linear algebra libraries, and SIMD-on-MIMD libraries, you can program it more or less like you would have a traditional supercomputer, without having to worry a lot about synchronization.
OpenMosix, in an upcoming release, even promises to give you address spaces that cross machines, giving you effectively a NUMA machine on a network of PCs.
Re:Microsoft recommending Linux Beowolf cluster? (Score:3, Insightful)
Where can I get Gigabit routing hardware that is comparable in price to 100BaseT equipment?
Re:Microsoft recommending Linux Beowolf cluster? (Score:5, Interesting)
B...S...; we use a small Beowulf (16 dual 1 GHz PIII boards with a fast ethernet backplane from PSSC [pssclabs.com]) for oceanic numerical modeling and the problem scaled almost perfectly with number of processors.
Our models are 3-dimensional, but sudivision and message passing takes place only in the horizontal two-D direction. And message passing only needs to account for the boundary nodes.
Ease of use is a bit of a larger issue, however. For convenience sake I usually end up running at home on the dual Athlon and then doing big runs and batch jobs on the Beowulf.
Re:Microsoft recommending Linux Beowolf cluster? (Score:5, Insightful)
These are not MS evangelists we're talking about here. Gordon Bell and Jim Gray know a 'thing or two' about high-performance computing.
If these guys weren't able to speak their minds on technical matters entirely without retribution from Bill and Steve, they wouldn't be at MS at all. They don't have to be. They CERTAINLY don't have to tow the party line and recommend the flavour of the week, because it messes with the latest Sun/IBM/HP/Linux/Mac threat.
Now if you actually look at the statement these guys are making and examine it based on their history, they've got a very good point. They're not talking even remotely about 'one-size-fits-all' systems--they're talking about the future of cutting-edge research.
Re:Microsoft recommending Linux Beowolf cluster? (Score:2, Informative)
Gordon bell prize != high performance computing (Score:3, Informative)
this is not what high performance computing is about. this is the class of problems that are embarassingly parallel and dont need good disk access. in short pointless benchmarks like computing pi rather than solving real tightly coupled physics probelms like say asteroid impa
Re:Microsoft recommending Linux Beowolf cluster? (Score:2, Insightful)
Will someone ever shut the hell up and not spew crap just because they feel like pissing someone else off?
Re:Microsoft recommending Linux Beowolf cluster? (Score:3, Funny)
Re:Microsoft recommending Linux Beowolf cluster? (Score:4, Informative)
That's pretty clearly mentioning Linux.
Re:Microsoft recommending Linux Beowolf cluster? (Score:3, Informative)
Re:Microsoft recommending Linux Beowolf cluster? (Score:3, Informative)
Of course they will suggest to use that instead of Linux because...[fill in the blank]
Holy Shit! (Score:2, Informative)
Re:Holy Shit! (Score:2)
You can be wrong six ways from sunday!
Re:Shocked! SHOCKED, I tell you! (Score:2)
Davak
world domination. (Score:2)
"Letting the researchers decide" is a clear means of pushing M$ crap. While we might imagine people spinging up to do the work, M$ is still up to it's tricks and not everyone knows how to set up a cluster. Between a shortage of trained people ready to move and Paladium, M$ stands to suck up sales. NASA and others have shown the way, but M$ has blocked better schemes before. Just look at the last article [slashdot.org] on running a reasearch lab with free soft
I'm Sorry... (Score:5, Funny)
My calendar says June 2nd. What does yours say?
GTRacer
- ? slooF lirpA
Re:I'm Sorry... (Score:2)
Calendar (OT) (Score:2)
so happy birthday to me.
(maybe not april fools, but commemorating the birth of a fool?
This is from Microsoft *research* (Score:3, Insightful)
I've heard presentations from them, and talked to them in private, and I can assure you they are far from following the party line. I'm sure that any pressure from above to do so would cause
Re:I'm Sorry... (Score:2)
Well Sort of, If you read the article carefully Gordon Bell and Jim Gray, scientists at Microsoft's Bay Area Research Center, are saying that scientific computeing is moving toward a data-centric and away from a processing-centric model. They did give lip-service to linux/beowolf technology, but also said
"But the Beowulf is a Volkswagen and these people are selling trucks."
So actualy they are saying that Sun
Bell and Gray not just MS researchers... (Score:5, Informative)
Which is what makes this so sad (Score:5, Insightful)
Research on building Mega beowulf clusters is a legit govt activity and so is building some. But the beauty of the beowulf cluster is that it is affordable to bussinesses, acadmeics and govt, plus its very adaptable to budgets and interconnection schema (fast, slow, grid, scavenger).
but beowulf clusters wont replace the need for super fast, super scalable, computers with well architected interconnects. there are lots of problems in this class, mostly physics simulation, that just cant be done well on beowulf clusters.
I should probably note that my own work involves large computer clusters. However my probelms (in biology) are in fact well suited for beowulf clsters. thus I'm happy to hear of more money for beowulf computing. but frankly I think that this should be in addition to the fast computers.
the flip side here is that it might be the case that money for fast computer resources is not being well spent as it could be at present. there seems to be too much emphasis on "landing the contract" for the computer center than on building a good design. congress via DOE tends to doll these things out in a political fashion making sure each big client gets funding for a center rather than letting the best center get the most contracts. as a result some of the so-called super computers may be just glofied too-expensive-per-cpu unscalable systems already that could be eclipsed by a comparable low cost beowulf system.
but that being said its still an area that the gov needs to fund since it wont drive itself commercially but its needed for lots of science and simulation.
I disagree respectfully. (Score:3, Interesting)
I remember reading in Wired magazine a few years ago about a biotech company here in the San Francisco Bay Area that clustered several hundred machines running Pentium III 600 MHz CPU's to do DNA mapping and analysis--and the results were just as fast as most supercomputers costing several times what that cluster cost.
Imagine what a cluster of 700 to 1,000 blade servers running the latest Intel Xeon CPU's
Re:I disagree respectfully. (Score:2, Informative)
Re:I disagree respectfully. (Score:5, Informative)
Actually, it would be a very crappily-performing cluster. Blade servers are designed with two major goals - CHEAP and SMALL. Blade servers are engineered for high availability applications (think webserver farm).
Just because you CAN do something doesn't mean it's the optimal solution. It amazes me when I see vendors selling blade server clusters.
(Disclaimer: I work as an engineer with a company with builds Linux based clusters for universities and labs)
Re:I disagree respectfully. (Score:3, Informative)
News for you: Linux clusters are [techextreme.com] the new supercomputers. Not just Blue Gene, but probably Ascii Purple as well, which is supposed to be the fastest supercomputer ever.
They are so FIRED!!!!!! (Score:5, Funny)
Re:They are so FIRED!!!!!! (Score:4, Funny)
Open to possibilities. (Score:4, Interesting)
And I think they have a good point about massive memory being a very important part of computing advancement right now.
Re: Open to possibilities. (Score:2)
> Its nice to see some MS researchers going against the perceived stereotype and being open in their suggestions like this.
Microsoft does hire real CS/IT researchers, and there's no reason to suppose that they're all mini-monopolists waiting to grow up and hold the world ransom for... one million dollars.
Moreover, even if they have "handlers" in Marketing, notice that switching from supercomputers to Beowulf clusters isn't going to hurt Microsoft any, since they aren't playing in the supercomputer ma
Re:Open to possibilities. (Score:2)
It would mean that open-source won, and proved its point. When the world of computing finally turns from infighting to cooperation, there is nothing that cannot be accomplished.
Peanut clusters... (Score:5, Funny)
obligatory Beowulf meme (Score:3, Funny)
Man, I'd like to see a... um... damn.
Emergency auxiliary meme (Score:3, Funny)
In Soviet Russia, clusters Beowulf you!
Hrm (Score:5, Funny)
New York Times?
MSFT'ers recommending Linux?
I thought they fired that reporter who was making things up
Re:Hrm (Score:2)
Those of you that remember the Kevin Mitnick deal will remember this guy making stuff up back then too.
Nice (Score:5, Insightful)
Plus, think of all the computers that go unused at night in places like school computer labs. All those free machines could, at night, join a cluster and do number crunching for researchers.
Re:Nice (Score:5, Insightful)
Re:Nice (Score:2)
Re:Nice (Score:5, Informative)
Mod this guy up. He's really telling the truth!
Loosely coupled clusters like PDSF [nersc.gov] are great for work like what the high energy physics people do, like SNO [slashdot.org].
However, somethings work better on vector architectures such as climate models and fusion work: there is a reason why the Spanish Met troops [cray.com] bought a Cray. Additionally, some chemistry, many fusion and several other codes work best on vector architectures.
There guys presented their global warming work where at my job. They've developed their climate code though as a parallel one. See here [ucar.edu]. One of the places that they have been running is on seaborg [nersc.gov], an IBM RS/6000 with over 6k and near 7k processors.
Interestingly, the PCM guys presented what they wanted for an uber'puter. While it had massive amounts of storage, it was also a 500 *PETAFLOP* SUSTAINED PERFORMANCE machine.
*clickety clack* That'd be something like 166,666,666 Athlons. IDK of any interconnects that handle that. Can you imagine being an admin? Better hope you're good on rollerblades zipping to and fro replacing those oh-so-reliable commodity disks and CPUs...even if you have a .05% failure rate, that's still too damn much. As an admin, that'd be a huge waste of time. It'd also wreck havoc on the guys running stuff.
Or is that what grad students are for? To attempt such a silly thing and then admin it? ;)
Seriously tho. To get from here to their, we're going to need some exotic techs...not just more 'attack of the killer micros'.
Re:Nice (Score:2)
*shrug* I only admin the bloody things
Re:Nice (Score:2)
I think they're 1.54 GHz Athlons, off the top of my head. I'll double check that later.
We have an Opteron 4 way system. We're under NDA tho, so I can't speaaa*mmmph*mpph*
Cray is actually taking a multipath approach to their next systems. They have the X1, their current development in the vector architecture, which they are going to follow up with an enhanced version. Then they also have the Red Storm, which you note, for scalar-cluster work. Then they have the MTA for threaded work too. Basically, t
Re:Nice (Score:3, Insightful)
The MTA is dead, sadly. It was an interesting architecture, but Cray never did any work on it after Tera bought cray and changed name.
Burton Smith is Cray's lead systems architect now and heading up Cray's entry into DARPA's High Productivity Computing effort. I wouldn't want to call the MTA dead just yet. Burton's prolly not gonna let his baby die...:D
BTW, I think Burton Smith would be an ideal candidate for a /. interview. He'd prolly be amianable to the whole idea too.
Re:Nice (Score:2)
Now, to break it down:
1. Earth Simulator: classive vector super computer.
2 & 3. ASCI Q: Alpha Cluster.
4. ASCI White: Power3 rig, not sure if it's a cluster or not...
5. MCR Linux Cluster: name says it all.
6 & 7: Both are Alpha clusters.
8. HPTi: Linux Beowulf.
9 & 10: IBM p690's.
So, out of the top 10, -6- are clusters, not all but 1. I don't know what list you're using, but it certainly isn't t
Re:Nice (Score:2, Informative)
What usually governs what the machine is good at is more towards laten
Re:Nice (Score:4, Informative)
Actually, when you say you did you take a look at the top 500 list, you should put actions behind your words. The top cluster is at #5 on the most recent list (LLNL's NetworX machine - http://www.top500.org/list/2002/11/), and is less than 20% behind the #2 spot. Guaranteed that within a year, linux clusters will indeed fill the #2 spot on down.
Second, hydrodynamic problems (which are a class of hyperbolic PDEs), deal with nothing but local communications, and scale quite well even on Linux clusters. The more challenging set of problems are non-local PDEs (elliptic and parabolic -- like Poisson's equation and heat transfer). Because these problems couple every point in space to every other point in space at ever time, they reamin tough to solve on a parallel machine no matter what platform you are on.
The Earth Simulator is a highly special case. The Japanese government made an enormous investment (well over $500 M) to purchase that machine. Even with the support of the DOE and private industry (increasingly biotech), the US just does not have the political willpower to spend that much on a single platform. It is often neglected that the current paradigms of high-performance computing are lacking in many respects -- some refer to the recent move towards very large parallel machines as "a great step backwards". We have to pursue technically innovative solutions which will be both cheaper to purchase than the Earth Simulator, and more efficient to use.
Re:Nice (Score:2)
Actually, I'll have to disagree with this, to a point.
I expect to see no -more- than half the top 10 be linux clusters. The reason for that is that most of the f
Interesting (Score:2)
This isn't all sweetness and light (Score:5, Insightful)
I wonder if they have a database recommendation. (Score:3, Insightful)
Just waiting for the other shoe to drop...
I think they are right (Score:5, Insightful)
I'm perfectly happy with my P3 800MHz, but I run out of hard drive space everyday.
Cheap, YET RELIABLE high density storage solutions are still not readily available. I know we are now down to a $1 per Gig, but the average size of a user's file has increased now. Media (legal or otherwise), games, and other programs are chewing up hard drive space.
There needs to be more research into trustworthy, lowcost high volume storage mediums.
Yes, but we need research into BOTH (Score:5, Interesting)
Industry will spend R&D money on clustering for storage and reliability, without major government subsidy, because there's a crying need for it. How much government money went into Google/eBay/Amazon?
Government research is supposed to complement industry R&D - to be aimed at fields where the results are still important, but maybe not as profitable. This is why government should not abandon raw speed as a research goal.
Re:I think they are right (Score:2)
Bullshit.
While I'm not going to comment -too- much on the "who really needs a 3+GHz machine" it needs to be said that gamers tend to push them. It -also- needs to be said that developers on large projects will also push them. Downtime
Smart move. (Score:4, Insightful)
Another name for P2P? (Score:2, Offtopic)
Sounds familar, and the RIAA and MPAA's worst fears.
more Beowulf clusters coming your way! (Score:2, Interesting)
I don't think that's what they said (Score:5, Insightful)
When they mention beowulfs, it's in the context that when researchers need the equivalent of a supercomputer, they can just build/use a beowulf cluster. What they can't do on their own is come up with petabyte storage facilities and the data in them.
So what they're really advocating is spending money on storage; it doesn't say in the article what form that storage should take.
The government may very well like this. They're going to need big data farms to support the TIA program. It takes a lot of space to remember what kind of toppings every person in the US likes on their pizza.
Re:I don't think that's what they said (Score:3, Interesting)
He says it himself. Stop deciding where to put the money and let science decide itself.
They work in a Microsoft research facility... (Score:5, Interesting)
From the MS site, the Bay Area Research Center is "... a small Microsoft Research group located in the San Francisco Bay Area. We've been working on two large projects with other universities, companies, other Microsoft Research groups, and with Microsoft product groups in Redmond and Cupertino. These projects are Scalable Servers and Media Presence. "
I can't see scalability involving commodity hardware with MS OSes. In spite of Microsoft's desktop domination strategies, and small business server dominance (arguably, at least for the moment) they know they won't be taken seriously about clustering Windows 2003 server, purely because there is no design AFAIK in the kernel for operating in clusters in the first place. This is supercomputing using commodity hardware, not supercrashing using commodity OSes. Linux is perfectly situated to be recommended by anyone because it is not a competitors product, per se.
The homepages of the two men can be seen here, if anyone is interested in some of the more interesting history of the two. Little of it has to do with Microsoft propaganda and the marketing machine:-
Gordon Bell [microsoft.com]
Jim Gray [microsoft.com]
They DO mention Linux (Score:5, Informative)
"The supercomputer vendors are adamant that I am wrong," Dr. Bell said. "But the Beowulf is a Volkswagen and these people are selling trucks."
All the people who are responding saying they don't mention Linux didn't read the second page.
Username/Password (Score:5, Informative)
slashdot124
slashdot
Be wary however, I registered as a North Korean military R&D official under high salary.
supercomputers and the Nuclear Test Ban (Score:5, Informative)
All this for the price of a few supercomputers every year. And the market for supercomputers pushes several technologies; for example, high speed interconnect and gallium arsenide, and sets the bar for high performance silicon. Pretty good deal, doncha think?
But now the Moron-in-Chief wants to bring back nuclear testing. [reuters.com] (pardon me, 'nookyuler.' Bush can't be wrong about something as simple as pronunciation, can he?). Farewell to deterrence. Farewell to common sense...
Advocate is not the right word (Score:2)
They simply spotted a trend, and suggested that BECAUSE of it (because of the use of Beowulf clusters of Linux machines), the focus of research should be on large data storage.
No one's said it yet? (Score:2)
Of course, I agree fully with this arguement.
How else are we going to store all that pr0n?
Oh, yeah and the other data too.
.
Better to discuss original article than NYT story (Score:2, Informative)
Cluster Farming is nothing new. (Score:2)
I'm surprised the government is still funding old-fashioned "Supercomputers" though. Well no, I guess I'm not. They're still subsidising helium [webleyweb.com] production, so why not supercomputers?
Seems like everyone who needs tons of power has been doing Beowulf clusters for years. Wish the government would catch up.
Data Rules! (Score:5, Insightful)
He also talked about CERN generating 10 PetaBytes a year when their new collider comes on line
Supercomputers are sexy, but are losing the technology war. If you start designing a new one today it will be years before it is ready. During those years Intel and AMD will crank up their clock speeds and negate much if not all of the CPU speed advantage you get from your fancy design. Why not go for parallelism from cheap machines?
rewrite existing codes? (Score:2)
I've seen code running on a supercomputer which was first written for a VAX. The authors were long gone but nobody could afford the rewrite.
I'm sure it's been said, but... (Score:2, Insightful)
Anyway, data storage is not a problem in MY field -- and I would think that government interests in supercomputing lie in pl
beowulf = Tissue | beowulf != Kleenex (Score:2, Insightful)
Re:beowulf = Tissue | beowulf != Kleenex (Score:3, Informative)
RTFA
MS Announces Win XP with 200 CPUs (Score:2)
"200 CPUs should be enough for anybody." Microsoft Chairman Bill Gates was quoted as saying.
Steven Ballmer, CEO of Microsoft, contributed the following: "Developers, Developers, Developers, Develo
Why Vs.? (Score:4, Interesting)
The national centers USE Linux Clusters already (Score:2, Informative)
The Titan Cluster [uiuc.edu]
The Platinum Cluster [uiuc.edu]
TeraGrid Clusters Successfully Installed at NCSA [teragrid.org]
These clusters run either RedHat or SuSE Linux and are available for researchers nationwide.
These clusters are not beowulf; they allow access through a general scheduler and have MPI [anl.gov] to run programs that use a group of nodes at once. This gives the greatest flexability to the users to create a computational system that can be optimz
they're all over the map (Score:2)
One fact not mentioned is that planning for storage is already an integral part of planning a supercomputing center. Also not mentioned is another predictable outcome that generating lots of data eventually requires someone, or some thing (e.g., a beowulf cluster), to analyse it. Thus, under the present trends, data mining itself as well as development of method of *how* to data-mi
Why not BOTH? (Score:2)
huh? (Score:3, Funny)
"They advocate building cheap Linux-based Beowulf clusters..."
come on guys...June 2nd, not April 1st.
Re:where? (Score:2)
Re:where? (Score:2)
They never mention linux, only Beowolf.
Re:where? (Score:2)
Re:Just imagine a beowulf cluster of (Score:2)
Hell, if we're going to talk all SERIOUS and shit (Score:5, Interesting)
Re:(-1, Asshat) (Score:2)
Re:(-1, Troll) (Score:3, Informative)
From the article
By rewriting existing scientific programs, they say, researchers will be able to get powerful computing from inexpensive clusters of personal computers that are running the free Linux software operating system. Many scientists are now adapting their work to these parallel computing systems, known as Beowulfs, which make it possible to cobble together tremendous computing power a
Re:(-1, Troll) (Score:2, Informative)
> And how much can a MS cluster scale anyway?
Windows 2000/2003 WLBS can scale theoretically scale to 32 nodes, but I have seen performance decreases after 16 or so.
Windows 2000 MCS can scale up to two nodes with Advanced Server, and four nodes with Datacenter.
Windows 2003 MCS can scale up to four nodes with the Server, and eight nodes with Enterprise.
jwg
Re:(-1, Troll) (Score:2)
Re:Possible explination? (Score:3, Insightful)
They dont work for Microsoft, Microsoft simply provides the grants that fund their research.
If anything their report would tell those who are on the MS payroll to get to work on a cluster offering.
Re:MS recommending Linux? (Score:2)
Re:Storage vs Power (Score:3, Insightful)
Microsoft Research. Big difference. (Score:3, Informative)
They employ people with the likes of Tony Hoare (invented quicksort and the 'hoare triple'). They also hired most of the core developers of the functional language Haskell. And many other brilliant minds.
Most universities could only dream of the funding that MS research has. And they're