Visions Of The Future Of Grid Computing 145
CaptianGrid writes "Computing grids, or software engines that pool together and manage resources from isolated systems to form a new type of low-cost supercomputer, have finally come of age. BetaNews sat down with some of the world's leading grid gurus to discuss the significance of such distributed technologies and separate grid hype from grid reality."
hm (Score:2, Funny)
Re:hm (Score:2, Funny)
Re:hm (Score:2)
Re:hm (Score:2, Offtopic)
Inspired by? (Score:5, Funny)
Univa
Univac - a successor of Multivac, the largest computer in Asimovs world.
Nerds - they get everywhere.
Re:Inspired by? (Score:1)
No kidding, the fact you're citing Asimov's Univac and Multivac speaks volumes.
Re:Inspired by? (Score:5, Informative)
Re:Inspired by? (Score:2)
Re:Inspired by? (Score:2)
Looks like you're right.
The UNIVAC was introduced by Remington Rand. Remington Rand merged with Sperry in 1955 to form Sperry Rand. Sperry and Burroughs merged in 1986 to form Unisys.
See http://www.unisys.com/about__unisys/history/index
Only way (Score:2, Interesting)
Not the only way. (Score:5, Interesting)
Re:Not the only way. (Score:4, Informative)
For example, let's say you're trying to determine the best FOO, and running a FOO is a highly serial process. Even though you can't split up running each FOO, you can pass the processing of each FOO test case that you want to run to a different machine.
True - sometimes, you need the results of your previous run in order to plan your next run, and sometimes your *only* need is a single run of a very simple algorithm for which there can be absolutely no parallelization. However, more often than not, even algorithms that need tightly coupled data have at least *one* stage which can benefit from parallelization - and you really need only one to get the benefits.
Re:Not the only way. (Score:3, Interesting)
Unless your task is signficantly computationally demanding, this overhead can significantly outweigh just doing the task directly, regardless of how parallel the task can be.
Yes, essentially the latency (Score:1)
Re:Not the only way. (Score:3, Insightful)
This would help [slashdot.org].
Unless your task is significantly computationally demanding
Isn't that what we're talking about here?
Of course, one can't forget the main benefit of grid networking: it's *cheap*. You get a *lot* more CPU horsepower for your dollar, so you want to use it on "computationally demanding" tasks if you can.
Re:Not the only way. (Score:3, Informative)
Here is a link to a cool Java applet that shows all jobs running on the European research grid:
LCG2 Real Time Grid Monitor [ic.ac.uk]
Re:Not the only way. (Score:1)
Re:Not the only way. (Score:2)
Re:Not the only way. (Score:1)
Re:Not the only way. (Score:3, Insightful)
Re:Not the only way. (Score:1)
For instance, configuration changes would likely rely on what went wrong in previous tests. It does not make sense to just randomly try different configurations if the exercise is to solve a particular problem.
Re:Not the only way. (Score:2)
Re:Not the only way. (Score:1)
Considering the vast amount of money the DoE spends on massively huge clusters, I think the clear answer here is 'yes' ;)
Re:Not the only way. (Score:1)
Well, if all the bombs went off at once that would be WW3 wouldn't it?
Beowulf cluster (Score:2, Funny)
Re:Beowulf cluster (Score:5, Funny)
Actually, this is the great promise of grid computing. Once it becomes ubiquitous, people will stop trying to imagine Beowulf clusters!
Of course, Soviet Russia broke up, and we still hear about that...
Re:Beowulf cluster (Score:1)
Re:Beowulf cluster (Score:1)
Re:Beowulf cluster (Score:2)
In Soviet Matrix, Beowulf cluster imagines YOU!
X-Grid (Score:5, Informative)
It runs on any OSX system, 10.2.8 and up. Put your spare cycles to work.
Xgrid: High Performance Computing for the Rest of Us
HPC for very specialized problems, maybe. (Score:5, Informative)
However, don't even begin to think you'll be solving anything that requires any sort of processor to processor communication. Rocket simulation (our local favorite example here at UIUC) for instance is heavily communication based.
The linpack benchmark that top500 uses also needs a low-latency interconnect to perform really well, so don't expect to see "the grid" sitting up at the #1 supercomputer slot on top500.org anytime soon (or really, ever, unless someone develops FTL networking). Latency on the internet in general (and specifically around the world thru all those switches and latest_slashdot_hot_chick_movie.torrent packets) is nothing near what a supercomputer needs.
Now, there are research groups looking at ways of making communiation delays less of a problem, including the one I was in while I was in grad school. There's a number of ways to do it, but none of them I've seen are going to take on worldwide-network-latency and survive with their performance intact.
Even something as "simple" as chess wants to have a fast interconnect - every node that's gotten stranded working on low-priority (bad move) work is a wasted node you may as well not have.
Re:HPC for very specialized problems, maybe. (Score:4, Insightful)
Re:HPC for very specialized problems, maybe. (Score:2, Insightful)
Depends on how you utilize it. (Score:4, Insightful)
The reason for such an arrangement is that high-speed interconnects are expensive. Building a single cluster that is uniformly very high performance would be horrible for anyone other than a very rich organization to consider.
On the other hand, grids alone are way too slow to handle the needs of time-critical communication, which is what you have a lot of the time in parallel computing.
A hybrid, able to place components of a problem according to that component's needs, would seem to be the logical solution. It is also the scalable solution. Clusters often have an upper limit in size. By having grids of clusters, you have a virtually infinite capacity. True, there simply aren't any clusters that have reached the upper limit. Yet. But it's getting tough at the size they are at right now.
Re:HPC for very specialized problems, maybe. (Score:2)
Grid is not just for parallelisable codes, but also for the ability to find a resource to run some code on, collecting and virtualising resources, and creating
Re:HPC for very specialized problems, maybe. (Score:1)
Grid computing is supposed to be usefull for exactly the type of problems you say they are useless for. At least the two i have worked with/on (NORDUgrid and LCG).
The idea is interconnecting supercomputers, so that your specific program,
Re:X-Grid (Score:5, Informative)
Re:X-Grid (Score:4, Insightful)
You have the in-house developed Condor [wisc.edu] that's amazing!
Re:X-Grid (Score:5, Informative)
(i) Curiosity. Xgrid was new, and looked interesting.
(ii) Potential. Xgrid (final version) is going to be bundled with the upcoming 10.4 release of the Mac OS. That's an awful lot of machines that will have Xgrid preinstalled, with the user basically just needing to click "start."
B
Re:X-Grid (Score:2, Funny)
Condor isn't a grid example! (Score:1)
Re:Condor isn't a grid example! (Score:2, Informative)
Re:Condor isn't a grid example! (Score:2)
I understand that Globus is the big API that everyone uses to do grid computing, but you can't just use it to run jobs on a grid - you'd have to write an application to do that. Why the bigotry? I'm s
Re:Condor isn't a grid example! (Score:2)
Commoditization of grid computing... (Score:5, Interesting)
I think that for full standards compliance, you'll need to look to companies which don't offer their own computing resources -- platform-agnostic companies. But then who do you buy the compute resources from? Unless you're buying your own systems for use (which makes "utility computing" less viable), it's a bit of a catch-22.
Re:Commoditization of grid computing... (Score:3, Interesting)
The advantages of IBM and Sun are that they can just pull racks of servers out of their factories at cost and pay their own people to set them up. This will always give them a price advantage over the platform-agnostic competition who would be using the same Opteron, Xeon, SPARC, or POWER CPUs, anyway.
Re:Commoditization of grid computing... (Score:2)
I recall that Intel was at least incubating a renderfarm company.
Re:Commoditization of grid computing... (Score:2)
Plan 9 & Inferno (Score:5, Informative)
Computing grids (Score:4, Funny)
Pure Bolshevism, that's what!
Another round of tech job deaths? (Score:1)
Echoes of Carly (Score:5, Funny)
What we provide is primarily an implementation of Web services standards to allow people to build services, and the primary goal is also for us to provide a set of pre-defined services that allow you to use Web services protocols to interact to request the allocation of compute resources, the creation of computational services and moving the data from one place to another and so forth.
Does this sound like Carly Fiorina attempting to explain HP's strategy to anyone else?
Grid: loaded word (Score:5, Informative)
Frankly, I don't like the word Grid being applied in this way. However, the latter technology is facinating (virtual OS) and will come to dominate computing in the next few years.
The basic idea is total abstraction of the application/service from hardware/location. The app gets the resources it needs, can be cloned/replicated to another location for distaster tolerance, and can scale and grow on demand based on needs by simply throwing more hardware modules at it. It's not just limited to computing but also applied to storage and network.
Re:Grid: loaded word (Score:1, Interesting)
Re:Grid: loaded word (Score:2)
Virtualization alone is just another layer to manage. There's no point in hiding a technology if you are replacing it with an equally complex and less efficient one.
You are correct that virutalization adds expensive overhead, and it can be complex and inflexible.
I believe the solution resides where the costs of downtime associated with direct associations with hardware outweigh the costs of virtualization. Virtual Memory
Nope, wrong. (Score:3, Informative)
Re:Grid: loaded word (Score:2)
The first is at the low level - containers in which to run an operating system. This allows the system to be provisioned with the required OS that the user requires, no matter what the hardware layer. This allows the user more options of where their jobs might run rather than scouring the world for the one server that has the right OS, is cheap enough, and can have the job done by next Tuesday.
At the higher level there is the virtualisation of collections
Re:Grid: loaded word (Score:3, Insightful)
Me neither, but for slightly different reasons.
The main definition of a grid is a pattern of intersecting lines. While sun or ibm may arrange their computers neatly in rows of vertical racks and build it in a grid pattern physically, nothing of this remains for the actual use or architecture of so-called grid computing. This leaves large swaths of parallel algorithms by the wayside. The only things you can efficiently compute on a grid are the "emba
Re:Grid: loaded word (Score:3, Insightful)
Oh boy... (Score:5, Interesting)
Globus is now "web services" and not "GRID". GRID is so last century. It's far more cool now that it's in Java too. Anyone still working on GRIDs should search/replace immediately!!!
And did they drop the name of every single business partner they have in that article, or did only I notice that?
Re:Oh boy... (Score:2)
Actually, Java would be perfect for managing a grid. If you have a network of Linux, Windows, Solaris, AIX, HPUX, etc. computers, you can use the same Java management application on all of them, with only extra code for nuances of issuing jobs, etc., on each platform. Networking in Java is really easy, too.
Re:Oh boy... (Score:3, Informative)
IBIS: java-based grid computing (Score:2)
If you think so, check it ibis [cs.vu.nl]. You can download the latest version from the link and play around with it. Or read this old slashdot story [slashdot.org]
disclaimer: I'm not directly involved with the project, but working in the same group as the developers. And I don't mind pushing a good idea
Blackout 2003 (Score:5, Insightful)
mod parent up (Score:1)
Quote TFA:
Sun Microsystems recently unveiled a new grid computing offering that promises to make purchasing computer time over a network as easy as buying electricity and water.
That sounds very much to me like Sun's another try to warp the world back to the "classical" server/dumb-terminal era.
Re:Blackout 2003 (Score:2)
Redundancy is key for obtaining reliability. Because of this, just relying on yourself is not going to improve things but make them worse. I'm not familiar with the actual numbers of power availability in your area in the U.S., but I suspect in the past 10 years you've had
Re:Blackout 2003 (Score:2)
For instance, if you had solar panels, or a small wind generator, you couldn't completely power your house, but you could keep essential things running like your fridge.
In this situation, unlike electricity and water, computers can be run completely on their own without a grid, as evident by current computer setups. So why mov
Web service and the cost to "confederate" (Score:2, Interesting)
Threads (Score:2)
Sure, it may be much harder then migrating a whole process, but too often spawning whole processes is simply not the answer to SMP programming.
Thus far, I have looked at Mosix/OpenMosix and OpenSSI, and both fail here. Can anyone give me some insight perhaps? Maybe I am missing somethi
Re:Threads (Score:1)
Re:Threads (Score:1, Insightful)
Re:Threads (Score:2)
Re:Threads (Score:1)
Moving threads is possible and desirable. Think of a single MySQL server spread over a cluster.
I did some work on that at Uni years ago. You can move threads from one machine to another transparently. You page in memory over the network on demand, mark "dirty" pages, and send page diffs back. It's neat to see it working (two threads running on different machines), but network latency is a problem.
Google for distributed shared memory for similar projects.
Solution to Intel heat problems (Score:3, Funny)
Grids eh! (Score:3, Interesting)
Make money at home? (Score:2, Interesting)
What I want in a grid (Score:4, Interesting)
Grid in the sense that if my datacenter needs more resources, I just plug in a blamk PC with extra CPU/MEM/Disk and not worry about it. Or if one goes bad, I just rip it out without worring about what it will destroy.
Virtualisation in sanse that if I need an email server - I just create a virtual one on this grid and let it go, if I need a DNS server - I just create one on this grid and let it go, a web server
That is my idea of a true grid.
Great until everyone needs cycles at the same time (Score:4, Insightful)
I wonder how long SETI@home will last if home PC users realize they can "sell" cycles to meet for-pay demand for computational power.
Re:Great until everyone needs cycles at the same t (Score:2)
Sun has been experimenting with EBay for quite a while now. It would be pretty neat if they could figure out a way to auction off chunks of their grid on some sort of how-much-and-how-soon basis, like you say. If a movie company or fluids dynamics contractor needs the whole thing yesterday, they would be willing to pay a premium for not having to make a grid of their own and get a few thousand CPUs _right_now_.
Re:Great until everyone needs cycles at the same t (Score:2)
Re:Great until everyone needs cycles at the same t (Score:2)
Grid laptop (Score:2)
Re:Grid laptop (Score:1)
Re:Perhaps... (Score:2)
P2P? (Score:2, Interesting)
The guy in TFA talks about P2P being another type of grid and that a family could create a distibuted environment for shared data. He also talked about trust.
My idea is that with adding strong encryption you get basically small priate network that is almost impossible to crack. DVDs + CDs + Encrypted P2P among a small group of people == Old Skool Sneakernet (aka borrowing your friend's stuff). You and your friends can share all the entertainment among yourselves as you like. All you need is a P2P-type
Re:P2P? (Score:2)
Spammers have it figured out (Score:1)
Somebody please analyze what the malware world is doing, and share it with the grid computing gurus. The technology can't be THAT different, can it?
Re:Spammers have it figured out (Score:2)
The problem of sending 10 billion identical emails is basically parallizable to 10 billion pcs without a problem, timing isn't important, only small amounts of data must be transfered to and from a controlling central host,...
DDOS attacks require timing accuracy of a few seconds, better isn't necessary if you have a lot more capacity than needed (which is no problem if you don't pay for it).
Most legal applications for clusters, grid computing,... are much more difficult sinc
Convergence of Grid and Virtualized LSB (Score:5, Interesting)
Take a pinch of Standard Linux [linuxbase.org]
Wrap it up in Xen [cam.ac.uk]
Add a touch of SELinux [nsa.gov]
And a little bitty bit of Globus [globus.org]
Oh like a Sandboxed Platform [blogspot.com]
Oh Lordy, Lordy, mixed with Free and Open Source Code [freshmeat.net]
You know you lump it all together
And you got a recipe for a Multi Vendor Development scene [google.com]
It is coming though, you know, you know.
What we have is a great big melting pot
Big enough enough enough to take every vendor and all IT's got
And keep it stirring for a hundred years or more
And turn out Application Service [google.com] and Content Providers [ostg.com] by the score.
With apologies to Blue Mink [dustygroove.com] .
Re:Convergence of Grid and Virtualized LSB (Score:1)
What it's all about (Score:2, Interesting)
Oh come on. (Score:4, Informative)
The problem is that most of the CPUs out there run Windows, which is currently damned near useless for this kind of thing. It'll require a rewrite of the OS to take proper advantage of the potential of a network of windows boxes for general purpose computing. OTOH, a couple of shell scripts and SGE (http://gridengine.sunsource.net/) does the job on Linux and other Unix systems.
The network problem; somebody do the maths (Score:5, Interesting)
EXAMPLE: IBM is currently offering CPU/Hour service in Houston to oil and gas companies. Sounds great till you realize the multi-terabyte files that consume such a massive compute service are too big to be readily sent over the network. Instead they use vans to haul tape and disk over to IBM and then run the process on it.
What is the bandwith of a station wagon? Right now its faster than the internet on a 20 mile drive across Houston.
But even take it a step further and the ratio remains. What if I wanted to pay Sun/hr for CPUs while I worked on a big Maya render of 200 gigs. By the time I've sent that over cable modem have I gained a ton in performance time?
The problem I see is that we are making CPU massively parrallel but not networks. So will it EVER make sense to send a massive file to a commercial grid over a singular network connection.
Somone should do the math.
Re:The network problem; somebody do the maths (Score:1)
It's only the Internet that's holding back Disbursed Interconnected Grids (DIGs). But that's why there's an Internet2 to focus on greater bandwidth and reduce latency times. With such large data sets the other side to the coin is data compression or the ability to transform it into something more manageable while
Re:The network problem; somebody do the maths (Score:1)
Re:The network problem; somebody do the maths (Score:1)
I just wonder how much further that reduces the market which is already limited. It seems like the highly graphicaly intensive oil and gas crowd and rendering crowd might not participate as easily in the remote grid shift.
I'd actully love to be wrong!
Intresting (Score:1)
Like I said though I will probably look back and call myself a moron,
Imagine... (Score:1)
Quantify the deliverables (Score:1)
"Deployment of deliverables on-cycle and on-quota"
I HATE management-ese. It's nothing but BS. "...deployment in the science space," "...focus on vertical particulars, financial services for example..."
If you need to generate obtuse buzzwords to justify your job, I need to generate ways and means of deploying you at the unemployment line. This is worse, because the guy seems to come from academia, not industry.
Here's a worthy grid project, if anyone's looking. (Score:1)
Current Project:
Human Proteome Folding Project: A layperson's Explanation
Proteins are essential to living beings. Just about everything in the human body involves or is made out of proteins.
What are proteins?
Proteins are large molecules that are made of long chains of smaller molecules called amino acids. While there are only 20 different kinds of amino acids that make up all proteins, sometimes hundreds of them make up a single protein.
Adding to the complexity, protein