Grid Computing: Conceptual Flyover For Developers 79
An anonymous reader writes "This article relates many Grid computing concepts to known quantities for developers, such as object-oriented programming, XML, and Web services. The author offers a reading list of white papers, articles, and books where you can find out more about Grid computing."
Re:Hey (Score:3, Insightful)
The beowulf cluster joke is dead, long live the beowulf cluster joke!!
So what exactly is "grid computing"? (Score:5, Insightful)
Nope- that still does not tell me what "grid computing" is. This vague, loosely defined definition can describe just about every "next big thing" since the mainframe.
Re:So what exactly is "grid computing"? (Score:1, Insightful)
yeah its vague but that narrows it sopmewhat.
Re:So what exactly is "grid computing"? (Score:2)
However, it is much more than just plugging together a bunch of computers and calling it a Grid -- it may constitute a cluster, but a Grid requires more.
And that PS2 thing - it's cool and nice that you can do it, but it's not even a proof of concept because as long as you can harness any kind of computational power, you can make a cluster or a grid out of it.
It's probably a whole lot cheaper and optimal to do so with a bunch of o
Re:So what exactly is "grid computing"? (Score:1, Informative)
And yes, I understand the difference between a Cluster and a Grid.
Re:So what exactly is "grid computing"? (Score:4, Informative)
In a few words: grid computing is the use of many connected computers for one task.
Or, you might want to think about it as multithreading, but spread out over multiple machines.
The author is making a case for a standardization of how this should be handled.
Re: So what exactly is "grid computing"? (Score:5, Informative)
In a way it's a matter of taste, but I'd define it this way: "parallel" -> many CPU's, but quite close in 1 place, like in a SMP desktop. "distributed" -> with network in between, as in Beowulf cluster (possibly over the internet).
What would make "distributed" a Grid? The fact that it's 'everywhere', always working/available somewhere, like P2P networks. You can take your equipment off the network, but the network (ehh, grid) goes happily on doing its thing.
This becomes really useful when it's a easy to use and commonplace as the internet today. Send out some software, it grabs a piece of data here, grabs a program there, finds a server to do the computation, and reports back to you with the result. Got some cycles to spare? Put some in the Grid, earn money. Just wait and see, some day computing power will be supplied and consumed the way electric power is today.
Re: So what exactly is "grid computing"? (Score:1)
No it won't. Computing power is still going to increase exponentially and reduce in price as per Moore's statement. With the advent of new material science, such as diamond microprocessors, the power of each individual processor will continue to improve.
The added flexibility of x86ish based clustering will allow for small businesses to increase their ability to get work done and outcompete the la
Re: So what exactly is "grid computing"? (Score:2)
With networked co
Re: So what exactly is "grid computing"? (Score:2)
Two things:
1) Moore's law won't keep going forever.
2) Networking is currently going faster than Moore's law.
It is true that the relative amounts of CPU and networking available at any given moment do mat
Re: So what exactly is "grid computing"? (Score:1)
If you need to have ONE memory image, with all CPU's using the exact same memory image at the exact same time, then that's Grid computing.
If you have many CPU's, each with their own memory image, and they can perform their work no matter what the other CPU's are doing, then that's Distributed computing.
Put this way, I believe you can really see the differences between Grid and Distributed
Re:So what exactly is "grid computing"? (Score:2)
First, he vastly over values network cost. If one uses more rational numbers the tables turn.
Second, he only thinks of Grid technology as cluster management. This is *grossly* wrong. The Grid is about access to resources. A node with CPU to run a job is only one kind of resource. The network itself is a resource, schedulers on clusters are a resource, data movement managers are a resource, scientific instruments are resources, the list goes on and on.
Re:So what exactly is "grid computing"? (Score:2)
Re:So what exactly is "grid computing"? (Score:5, Interesting)
Consider this: most IT departments are being forced to do more with less. Budgets are tight, resources are thin, and skilled human resources can be scarce or expensive. To top it off, most corporate managers know that they have a super-abundance of idle computing power. It's well known in industry circles that most desktop machines only use 5% to 10% of their capacity, and most servers barely peak out at 20%. No surprise then that many of the big money people in corporate America balk at the thought of purchasing more equipment to get the job done. What these companies need is not more horsepower, but more efficient use of existing horsepower. They need a way to tie all of these idle machines together into a pool of potential labor, manage those resources, and provide secure and reliable access to the number-crunching muscle. Imagine if a corporation or organization could use all of its idle desktop PCs at night to run memory- and processor-intensive tasks? They would get more work done faster, possibly get to market faster, and at the same time cut down their IT expenses.
The idea seems to be to turn the whole network into a cluster. "Why buy more servers when you can gove some of the load to your desktops?" is a short summary.
Re:So what exactly is "grid computing"? (Score:3, Informative)
The idea seems to be to turn the whole network into a cluster.
This is answered in the FAQ at gridcomputing.com [gridcomputing.com]:
The key distinction between clusters and grids mainly lies in the way resources are managed. In case of clusters, the resource allocation is performed by a centralised resource manager and all nodes cooperatively work together as a single unified resource. In case of Grids, each node has its own resource manager and don't aim for providing a single system view.
Re:So what exactly is "grid computing"? (Score:3, Informative)
Todd Tannebaum just gave an exec lent keynote at Boise State's HPC Workshop. He explained that while computing power has increased on a system by system basis, the total available computing power to a single person
Re:So what exactly is "grid computing"? (Score:3, Informative)
As far as the user is concerned, you have 1 Black Box system containing everything.
The physical implementation could be a single supercomputer, or a whole host of different systems spread all over teh place. But the the end user, it's just a single computer that handles all their stuff for them.
That's a grid. Check out http://wwws.sun.com/software/n1gridsystem/ for a good overview of how this can be implemented.
Remember, a true grid system is more than just raw CPUs, it's all a
Re:So what exactly is "grid computing"? (Score:2, Informative)
The aim is get rid of the usual limitation to clusters (homogenious hardware/os) and make use of all the clusters in a virtual organization (think university network, distributed.net, seti@home protein folding and similar research institutions). You can then use a resource broker to specifiy you resource, say all big-endian machines with more than 1 gb of memory or all pentium machines.
Re:So what exactly is "grid computing"? (Score:3, Funny)
I like to think of "grid computing" as the idea of building technology with the global use in mind: pluggable security models, standard protocols for job management/data transfer/etc., and so on. how to build services with the future (where the future is sharing those services) in mind.
grid computing MAY take the form of: shared supercompu
Re:So what exactly is "grid computing"? (Score:1)
Ian Foster's definition is here http://www-fp.mcs.anl.gov/~foster/Articles/WhatIsT heGrid.pdf [anl.gov]
A book he edited is often considered the an excellent starting point, The Grid: Blueprint for a New Computing Infrastructure. His website http://www-fp.mcs.anl.gov/~foster/ [anl.gov]
Only reason why I know anything is because I have a 10 page report due before the end of the week for my High Performance Computing module. Yippee.
Re:So what exactly is "grid computing"? (Score:1)
Re:So what exactly is "grid computing"? (Score:1)
Change can be hard (Score:5, Interesting)
One thing I have noticed is that for many "old timers" there is the feeling of we have always done it the old way, why change. Any thoughts of how we drag that old donkey into the new methods when they don't want to go?
Re:Change can be hard (Score:5, Insightful)
Unfortunately for Grid Computing its still in the stage where people are struggling to explain what it is, nevermind what it does or how it can improve life. Thats always going to be a hard sell to me anyway. If its function is not obvious it makes you sceptical just how necessary it is.
Re:Change can be hard (Score:3, Insightful)
The overheads will be enormous though, at least initially. Security issues, data issues, even logging issues have to be thought through. Any system that implements this will be *fucking* complicated to work through.
In any case, the "old
You're stuck in the Von Newman model (Score:1)
It's a distributed hierachy of
Re:Change can be hard (Score:2)
Perhaps if they can add on some value to the basic service, there
Why is slashdot pushing grid computing so much? (Score:4, Informative)
Re:Why is slashdot pushing grid computing so much? (Score:1)
I'm not shocked that its quite an old story, whats more shocking is that it appears to be an original posting...
Re:Why is slashdot pushing grid computing so much? (Score:1)
Unready Hype (Score:5, Interesting)
We were taught that the test of research software is whether a full professor (or corporate executive or other obscenely busy person worth >> $100/hour) finds it useful enough that they take time to learn it - the uses I've seen for the Grid don't pass that threshold yet.
There are some exceptions: tightly-integrated applications put together in a couple of the hard sciences that really just do supercomputing with a friendlier face. There's enough payoff there for a physicist to be happy with the software.
For a geek, however, even there, most "grid UI research" is simplistic, derivative, and uninspired.
Apologies to my first-ever-advisor who is now a Grid bigwig.
Re:Unready Hype (Score:1)
Doesn't sound all that different than a lot of software I know.
Re:Unready Hype (Score:2, Informative)
It has been standard practice over the past couple of years to overhype Grid, a practice which I suppose was intended to bootstrap interest but which instead just tends to leave people feeling confused and vaguely betrayed as they discover that what was presented as a production capability turns out to be a research project in its early stages. The article is typical of the approac
Grid (Score:2, Funny)
"No Neo, try again"
"What is grid computing?"
"Bingo."
There's certainly alot of info to devour there, but I guess if companies like Google and Dreamworks are using it, then it has to be a Good Thing.
Google and Dreamworks (Score:2)
Security in Grids (Score:5, Interesting)
First and foremost, grids are designed to run in a distributed environment which makes security design and administration that much more complex.
Second, grids are currently in their infancy and there is little prior art to the types of attacks and problems that will affect them. Despite this, they are very juicy targets with the kind of storage and bandwidth that would make even a hard-bitten cracker weep for joy. (i apologise for the imagery)
Third, in my book security has to be a top-down approach - i.e. the guys on top lead the way and then everyone else follows. Grids have no tops or bottoms which makes this a bit tough to apply. In short there is no security hierarchy in a default grid environment. Responsibility HAS to be established explicitly. A simple example is who is responsible for the data held on one of the nodes? Is it the person who wrote the application, the person who owns the application, the person who owns the hardware?
Grids are fascinating in their security requirements (and those who think these are solved by web services have another thing coming! People are a huge aspect of the security of a system, and distributed system like grids have a very complex task of ensuring that people behave the way they should).
Re:Security in Grids (Score:2)
I'd have to say that in its essence, grid security is just traditional security on a larger scale with heavy X.509 use. sure, there's federation ideas and such, but there's nothing nothing "new" and unique to grid computing. we already have tons of users using public key authentication (agent systems, like "proxies" in grid-speak) from a distributed set of sit
Ancient Technology (Score:1, Insightful)
Grid Computing is a buzzword (Score:5, Insightful)
Re:Grid Computing is a buzzword (Score:2)
ENGINEERING JOBS L@@K!! (Score:2)
SpreadThin is a BayAreaStartup with a Mission. The Mission: to combine Nanotech, BioInformatics, and GridComputing to create the NewParadigm for MoleculeBasedServices! Your Task: to synthesize these concepts into a Marketable NewParadigm. RoomAndBoard + Equity.
Knowledge of WikiEmergencyMaintenance is DesirableButNotRequired. StartingImmediately.
Resumes in HR-XML Resume
Well it's ONE view on Grid Computing (Score:4, Informative)
The grid discussed here seems only to be built on the OGSA and Globus Toolkit, and Globus has not really covered itself in glory with their poor UIs etc.
Grid seems to address occasional demand for "much more power" from your computing resource, but does not really provide a consistent flexible computing resource.
The academic world uses External Grids to pool resources [bioinformatics.org] but private Enterprise has little to gain from these External Grids in exchange for a HUGE security problem.
And Internal Grids? These are so immature as to beggar belief. Why risk investing in these configurations when bang per buck is so uninviting.
Re:Well it's ONE view on Grid Computing (Score:1)
Yes, I do think that external mega-grids are all about bringing cpu power to bear on an occasional problem (more appropriate to poorly-funded academia) and they are a security nightmare. Also internal grids are unwieldy,
Re:The name sucks (Score:2, Interesting)
Kind of similar to a power grid no? plug toaster, insert bread, get toast - no need to worry about coal/oil/nuclear fuel burning, transfomers or megawatts...
Re:The name sucks (Score:1)
Another re-invention of the wheel (Score:2)
Re:Another re-invention of the wheel (Score:2)
I think the difference is the attempt to scale this thing up no end; all the data coming of the CERN LHC will be grid processed; this is one of those problems where the grid is ideal: The data is not secret, you need lots of CPU power and you have PhD students to throw at the engineering of the whole thing.
Classic web sites and the like are a different problem, one where the goal is not so much infinite scalability, but infinite flexibility. The deployment stuff I work on is designed to deal [smartfrog.org]
Re:Another re-invention of the wheel (Score:2)
Re:Another re-invention of the wheel (Score:1)
RPC is bad (Score:2)
Yes, and that's why it didn't work: thinking about distributed computing as a bunch of procedure calls that happen to be remote is wrong. The sad thing is that a sizeable number of people still thinks it's the way to go (e.g., all the SOAP adherents).
This to me is just a nother refashioning of age old ideas so the people involved can justify their re
Practical application... (Score:3, Interesting)
If there were an easy way to cluster their workstations they wouldnt need to invest in an underutilized server farm. They could just schedule their processor/disk intensive reports and processes for off hours or rely on grid load balancing to take the extra cycles from the computer of the CSO (Chief Solitaire Officer) so that the impact would be imperceptible to the average user.
The current problem with the concept of grid computing is the lack of an easy way to deploy it in a standard business environment. What the article and its links are driving at is coming up with a cheap and easily implimented mechanism to turn every office, and chain of offices into a grid.
In theory, you could sell your unused processor cycles the same way people who generate their own power sell power back to their power companies. You ISP could actually, someday become a processor cycle reseller and you could operate on a minimal set of hardare in the typical office enviroment becuase you can always pick up extra cycles from your ISP when you need them.
Ah, the pipe dream.
Before you say, "How useless"... (Score:1)
There are several competing definitions of "Grid" going around - from the happy-big-cluster idea that Apple calls Xgrid (bad name, good product, IMHO) to the TeraGrid and NCSA grids in the US to the LCG/Grid3/Nordugrid to SETI@home. They all speak different languages and are built on different models.
Most of the definiti
Does this really have any business applications? (Score:1)
On the other hand it does mean that new networks can be created using less resources, but at the moment the biggest interest in this would probably be from the scientific community to do really intensive proces
GRID = CORBA or DCE Repackaged (Score:2)
My experience with DCE at least was that it was a distributed environment that took a lot of coordination between systems, which unfortunately was not done very well in the environment I'm familiar with. As a result of this it did not prove robust e
Re:GRID = CORBA or DCE Repackaged (Score:1, Interesting)
Grid is a set of capabilities which must satisfy requirements at a much higher conceptual level. Whereas we might say that DCE and CORBA define the atomic structure, Grid provides the biochemistry.
If grids worked, hosting companies would sell them (Score:3, Interesting)
Hosting companies have large numbers of identical machines with high bandwidth interconnects. That's just what you want for "grid computing". They're already set up to allow customers to run applications on their machines, and are able to deal with the security problems. Load is very low during off-peak hours. The machines stay up; they don't suddenly get disconnected from the net because somebody turned their desktop off. They're all loaded with the same base software. It's the ideal situation for commercial "grid computing".
So why is nobody selling this? Because there's no market for it. There's no real commercial market for supercomputer time, distributed or otherwise. Once upon a time, from about 1960 to 1980, there were engineering computer service centers, where you bought time-sharing service on big mainframes. Control Data and UNIVAC were the preferred machines for this. But that business is dead. CPU time became too cheap.
A well-known commercial grid was Gateway Processing on Demand [archive.org], announced in late 2002 with great fanfare. Gateway offered "grid computing" on thousands of Gateway-owned machines. They quietly dropped that service some time last spring. Their former CEO admitted that it generated "not a lot" of revenue. Basically, it was an attempt to generate some revenue from Gateway's unsold inventory of machines.
Grid computing is one of those schemes where all the interest is on the sell side. Nobody wants to buy it. "Micropayments" and "portals" are like that. They didn't sell either.