Cell-based "Roadrunner" Tops Elusive Petaflop Mark 269
prunedude writes "The NY times is reporting that an American military supercomputer, assembled from components originally designed for video game machines, is more than twice as fast as the previous fastest supercomputer, the I.B.M. BlueGene/L. To put the performance of the machine in perspective, Thomas P. D'Agostino, the administrator of the National Nuclear Security Administration, said that if all six billion people on earth used hand calculators and performed calculations 24 hours a day and seven days a week, it would take them 46 years to do what the Roadrunner can in one day."
Summary should have a shout out (Score:3, Informative)
Comment removed (Score:5, Interesting)
Re:Summary should have a shout out (Score:5, Informative)
You're precisely correct. Cell's strength is in very predictable workloads (ones it can perform without branch mispredict penalties), very parallelizable workloads (ones that can be distributed over 6-8 SPU's / SPC's) that fit within 256 KB of local storage per SPU (manually managed cache, mapped to main memory). The non-double precision floating point enhanced version's (the version in the PS3) strength is further limited to integer and single precision floating point workloads. Roadrunner's Cell-DP eliminates that last limitation. While video games, encryption, nuke simulations and anything else that involves matrix manipulation can really stretch their legs on such a beast, general purpose computing won't find a benefit.
Re: (Score:3, Insightful)
BTW, the original Cell can do Double Precision in hardware. The big limitation it had was the DP was not PIPELINED so all DP instructions caused huge stalls in processing. You can use DP on the PS3 just fine and it's still fairly fast (especially compared to software DP) -- it's just not nearly as fast as SP.
The Cell's double precisio [hpcwire.com]
Re:Summary should have a shout out (Score:5, Interesting)
As you probably know your current desktop computer executes software 'in parallel' on multiple cores. Unless that software is written to use multiple cores then the parallelization will probably take place at the task level, in other words, unless special care was taken during the development each core will be executing a different process (or parts of a process, slicing itself up between multiple processes as long as the number of tasks is greater than the number of cpus).
The cell processor and the graphics cards you are talking about (I assume you refer to the nvidia 'cuda' architecture) are not your ordinary processor (though the cell comes a lot closer). They need special software to get the maximum out of their parallel features, which means that you can only exploit that speed when you are trying to solve a particular class of problems.
The issue is that not every problem is easily adapted to a parallel solution, and for some classes of problems it's simply impossible. For general purpose computing a general purpose cpu (with or without multiple cores) is good enough.
If you have a problem that you can solve in a parallel way (say computing chess moves, analyzing genetic strings, ray tracing an image or fluid dynamics, add your own examples here) then it makes sense to invest the extra time to code the problem up in such a way that if you have lots of cpus that they can each work on a small part of the problem.
The graphics cards are closer to a traditional vector processor than the cell, and thus even less suitable to be used as general purpose cpus.
Re:Cell processor (Score:5, Informative)
Re:Cell processor (Score:4, Informative)
Cell in addition has 8 SPUs. 1 is disabled in the PS3 for yield reasons, and another is reserved, so there are 6 available for general purpose computing.
Both run at 3.2GHz. I think Cell has at least 3x the vector/streaming power of the XBox 360 CPU, but only 1/3rd of the general purpose capability. Figures pulled from thin air, etc.
But can it run.... (Score:3, Funny)
Re: (Score:2, Insightful)
Crysis is a miserably boring game.
Re:But can it run.... (Score:4, Funny)
Re: (Score:2)
Re: (Score:3, Funny)
Re: (Score:3, Funny)
Re:But can it run.... (Score:4, Funny)
Re:But can it run.... (Score:5, Funny)
Re: (Score:3, Funny)
Re:But can it run.... (Score:4, Funny)
Re: (Score:3, Insightful)
exaflop, zettaflop, the yottaflop and the xeraflop (Score:3, Insightful)
Who cares? It's awesome sui generis.
Re:exaflop, zettaflop, the yottaflop and the xeraf (Score:5, Insightful)
They can hope for random breakthroughs, mostly based on chance/luck/etc..
Or they can follow the natural progression of things. If you want to make things explode you have to know the nature of the explosion. And to know the nature of explosions you have to know all about high-energy physics at a molecular level. And to know about high-energy physics you have to know about how molecules and atoms interact. Now, with all of these things you can either make them yourself and study the real explosion, or you can simulate it and confirm with real-world results..which is what they're doing.
They have the resources AND the desire to do so, and therefore, they are doing so. Private industries will rarely do things like this on their own. They're much more likely to wait for someone else to do the research, or research with grants and then patent the results for their own profit. Its the same reason NASA has spurred many developments and improvements in the rest of the civilian world.
This setup will make it easier to study weather, physics, etc, etc. On the other hand, it'll also make it easier to figure out how to make bigger sticks that are lighter and sharper.
Re:exaflop, zettaflop, the yottaflop and the xeraf (Score:2)
Re: (Score:2)
Re:exaflop, zettaflop, the yottaflop and the xeraf (Score:5, Informative)
Are you kidding? [warresisters.org]
Re:exaflop, zettaflop, the yottaflop and the xeraf (Score:5, Insightful)
Not really. The post you link to describes the defense budget as it dwarfs other spending, but doesn't really argue why or why not that spending is progressive/regressive.
The military was one of the first racially integrated public institutions in the U.S., it researched and funded the Internet, it's pouring money into synthetic fuels right now, and it's pushing the limits of computing power as seen in this article. There are numerous other scientific and social areas in which the military advances society, with far more practical results than do-gooders in other government or public institutions.
Re:exaflop, zettaflop, the yottaflop and the xeraf (Score:5, Insightful)
It's because the military doesn't have the scrutiny and oversight other institutions do, lets face it. Do public institutions besides the miilitary get secret prison's and liscense to do whatever the want? The military is not held back by moral qualms. We've seen this with all sorts of classified documents coming out of the government. The military has budgets that are kept secret. For anyone to claim the 'military helps us' vs public institutions, we'd have to do an analysis. But that would be fairly difficult and politically sensitive, now wouldn't it?
Re: (Score:3, Interesting)
Are you really arguing that the scientific and social advances from the military arise from secret prisons and lack of moral qualms?
Re:exaflop, zettaflop, the yottaflop and the xeraf (Score:4, Insightful)
Re: (Score:3, Insightful)
Re:exaflop, zettaflop, the yottaflop and the xeraf (Score:2)
Re: (Score:2)
Re: (Score:2)
Re:exaflop, zettaflop, the yottaflop and the xeraf (Score:3, Informative)
Each node has two Opterons and 4 PowerXCell 8 processors (an upgrade to the PS3's Cell processor). This allows a developer writing code for the platform to run in a number of different modes: all Opteron, all Cell, or something in between. The first of these (all Opteron) may constitute a significant amount of the early work on the machine by practitioners, as they can simply compile legac
Re: (Score:2)
The future (Score:2)
Re:The future (Score:5, Informative)
Cell 1 (the Playstation chip) didn't have the double precision floating performance to achieve the petaflop mark; Cell 2 is far better on that front.
Enough With The Fanboyism (Score:2, Interesting)
He was obviously asking about people with a clue, not stupid little fanboys who love to spout obviously false quotes and claims supposedly made by console makers.
Sony NEVER once claimed the PS2 was a supercomputer. Not ONCE.
The EE WAS powerful, cheap, and power efficient enough that at the time of its arrival on the market it fell under government scrutiny for its potential military uses. The EE utterly SHIT over any other chip on the market or would be on the market for another two year
So what is it for? (Score:2)
This contraption makes lots of people really, really, tired of punching on calculators?
calculators (Score:3, Funny)
But the TI-68 will cut it down to 23 years.
Re: (Score:2)
Change in paradigm (Score:5, Informative)
Re: (Score:2)
wikipedia (Score:2, Informative)
Re: (Score:3, Informative)
all BlueGene CPUs were running at less than a GHz. And it seemed those low power cores were key to HPC (high performance computing).
Supercomputing is on its way to a water cooled infrastrucure.
IBM is already selling a product under the name bluefire
http://www.eetimes.com/news/latest/showArticle.jhtml?articleID=207100873 [eetimes.com]
http://www.eweek.com/c/a/Infrastructure/IBM-Ships-First-WaterCooled-Supercomputer/ [eweek.com]
I hope we see more water & less air in the future
I feel bad for Whyle E. ... (Score:5, Funny)
so what else is new? (Score:5, Funny)
and roadrunner's always been cel-based, at least in the modern era. i bought one of those cels from the warner bros. store before they went under, nice one too with his tongue sticking out
Perspective? (Score:5, Funny)
Re: (Score:3, Funny)
Re: (Score:2, Funny)
Re: (Score:2)
ummm... (Score:5, Insightful)
probably because most of those people would either try to eat the calculator or sell it for food and medicine
Re:ummm... (Score:5, Funny)
Back it my day! (Score:3, Funny)
Whatever happened to nuked marsh mellows or sitting round with Geiger counters trying to make funny sounds?
Kids are lazy these days!
Not in perspective (Score:4, Insightful)
To put the performance of the machine in perspective, Thomas P. D'Agostino, the administrator of the National Nuclear Security Administration, said that if all six billion people on earth used hand calculators and performed calculations 24 hours a day and seven days a week, it would take them 46 years to do what the Roadrunner can in one day.
That does not put the performance of the machine in perspective at all. Technical details would be much more accurate and effective.
Re:Not in perspective - this is a media number (Score:5, Interesting)
I suspect the first example of this happening was trying to estimate how many angels could fit on the head of a pin.
Other meaningless analogies could be:
The simple fact is that a petaflop computer works faster than humans can conceive and any kind of analogy cannot be comprehended.
myke
Re:Not in perspective - this is a media number (Score:5, Informative)
No, not at all scary. It's apparently twice is fast as the BlueGene/L, which apparently set a record of 478.2 teraFLOPS. Let's assume it takes 1 floating-point operation to test a single key, which is a gross underestimate. We'll thus assume the Roadrunner can test 10^15 keys per second. Testing 2^128 keys would then take about 10^16 years.
Re: (Score:2)
myke
Re: (Score:3, Funny)
Unless the 128-bit cipher being used is weak, that is the worst case, and the average case is that it takes half that long.
Re: (Score:2)
A link from LANL itself -- some propaganda (Score:2)
http://www.lanl.gov/news/index.php/fuseaction/1663.article/d/200805/id/13277 [lanl.gov]
Take that petaflop with a grain of salt (Score:5, Interesting)
The real PITA with these machines is that the powers that be are trying to kill two birds with one stone: they want an R&D platform for advanced computing, but they also want to certify an aging and untestable nuclear stockpile. That rather requires a fairly static platform, and so far our experience with ASC has been that when a machine hits that sweet state, they yank it and give us the next one.
Computing the data pyramids (Score:2, Funny)
"Let my people goto!"
Re: (Score:2)
The result of their research: (Score:4, Funny)
WTF happened to plain old units of measure? (Score:4, Funny)
I'm glad to see the continuing trend of creatively "dumbing down" units of measure (in this case, flops) to the point where they are not only practically useless, but entirely divorced from reality. I would like to propose the following similar, hype-worthy measure for fuel economy:
Old: Miles per gallon
New: Number of miles from which one would smell the excrement from the number of cattle one could feed for a day with the amount of corn it would take to produce one gallon.
Re: (Score:2)
Yes - are they regular (infix) or Reverse Polish (postfix) notation calculators? Dumbed-down minds need to know!
Yes, It Does Run Linux (Score:3, Informative)
Now get out there and supercompute!
Old News (Score:3, Informative)
This was covered last year, and the Los Alamos website [lanl.gov] had a few interviews with some people involved on what the uses of Roadrunner are. They had a time-line of what phases are to be done, and as far as memory serves me, they were going with Opterons for the first phase, then performance assessment, then add the Cell processors in the third phase.
From these pictures [lanl.gov], it clearly shows they're using IBM Blades (4 chassis in each rack), and IBM already offers BladeQ [ibm.com] servers which use Cell processors for HPC applications. The IBM BladeQ servers pack double the CPUs of a PS3.
If you take a look at the Folding@Home project statistics [stanford.edu], you can see the performance of PS3 boxes, and almost relate...Explore scientific problems like climate change? (Score:2, Funny)
It will take new technologies. (Score:2)
NOT MILITARY! (Score:4, Interesting)
Re: (Score:2, Funny)
I misread and expected cell phone supercomputers (Score:2)
My first thought when I saw the article title was a cellphone based networked super computer. Something along the lines of Rainbow's End [amazon.com] or Halting State [amazon.com].
I wonder how many iphones would be needed to do a cellphone petaflop computer.
Human Brain (Score:3, Interesting)
Not sure about the software though...
Re: (Score:3, Funny)
Re: (Score:3, Funny)
That's probably a new contender for the stupidest metric ever, it beats 'libraries of congress per second' hands down.
Re:Question (Score:5, Informative)
Re:Question (Score:5, Informative)
After it's done with that (I wonder how they will determine what done is...), it will go classified and do nuke simulations.
Re: (Score:3, Funny)
Now there IS something of a vast global conspiracy (PNAC, Republicans, Bilderberg, etc), but, er, it's not on the p
Re: (Score:3, Interesting)
Data Collection anomalies (Score:3, Interesting)
and
I could see how these would be of concern to anybody interested in d
Re: (Score:3, Insightful)
In fact you could analyse such systemic bias by comparing data sets from varying geopolitical areas (say comparing Russian data to US data).
Re: (Score:3, Insightful)
Re:Question (Score:4, Funny)
Re:Question (Score:5, Funny)
Re:Question (Score:4, Funny)
Re:Question (Score:4, Insightful)
Wouldn't it be really neat to run some tests before you build it?
For instance, how cool would it be to have a simulation that could test a weapon being mishandled, or shot. At every single point from every possible angle at every possible velocity?
It would be nice to know that there is a possibility of detonation if it were to drop off of a loading rack.
Re: (Score:2)
So nobody has to shoot him.
Re: (Score:2)
Brett
Re:Question (Score:5, Funny)
The military will use this advanced technology to assist and perhaps automate the RTFA process, also known as Reading The Fucking Article, which would allow you to answer your query without posting.
Re: (Score:3, Funny)
They could tell you, but then they'd have to kill you.
Re: (Score:2)
DARPA Files Patent On Predictive Simulation [slashdot.org]
Modeling Urban Panic [slashdot.org]
Re:Question (Score:4, Insightful)
Uh
Re: (Score:2)
Uh...codebreaking?
Re: (Score:3, Informative)
Cell was the brainchild of Ken Kutagari of Sony and Peter Hofstee of IBM.
Re: (Score:2)
This CELL is not single precision (Score:5, Informative)
Things move fast in technology Jethro, including this 2nd gen of the CELL proc, this is what you missed:
Double Precision FP - 190TFLOPS (5 times faster than 1st CELL)
Memory: Expanded to 32gb
Memory: DDR2 instead of Rambus
65nm (I know, I know, but it's better than 90nm)
Re: (Score:3, Interesting)
Re: (Score:3, Informative)
a) PS3 "Classic Cell" 1 PPC64 w/ 2 threads and 7 SPEs (8, but one disabled, defective or not):
GFLOPS 64-bit (double): 3.2GHz * 1 FLOPS/Hz * 7 SPEs = 22.4 GFLOPS (huge penalty, because of simulation via unoptimized simple precission operation)
GFLOPS 64-bit (double) a optimized 32-bit operation [netlib.org]: 3.2GHz * 3.9 FLOPS/Hz * 7 SPEs = 87.36 GFLOPS
Re: (Score:3, Interesting)
Allow me to oblige ... (Score:5, Funny)
Some other equally useful analogies:
Take the same aforementioned people, and give them a OLPC. The amount of time it takes them all to calculate their degree of separation from Kevin Bacon [wikipedia.org], and divide by a googolplex [wikipedia.org]. , then round up. That is the number of people that think the calculator analogy in the article was a good one.
Take the inverse of the clock frequency and multiply it by the number of instructions required for Windows to boot far enough to attempt to obtain an IP Address dynamically. Add to that the time it takes for the DHCP request to reach your Billion made router [apcmag.com]. That is the amount of time it takes for it to hose your router. Take the inverse of the clock frequency and multiply it by the number of instructions it takes to apply a service pack. Add it to the boot time, calculated as described above. That is the amount of time it takes to achieve a BSOD.
HTH,
- Thomas P. D'Agostino