Cell-based "Roadrunner" Tops Elusive Petaflop Mark 269
prunedude writes "The NY times is reporting that an American military supercomputer, assembled from components originally designed for video game machines, is more than twice as fast as the previous fastest supercomputer, the I.B.M. BlueGene/L. To put the performance of the machine in perspective, Thomas P. D'Agostino, the administrator of the National Nuclear Security Administration, said that if all six billion people on earth used hand calculators and performed calculations 24 hours a day and seven days a week, it would take them 46 years to do what the Roadrunner can in one day."
Summary should have a shout out (Score:3, Informative)
Re:Question (Score:5, Informative)
Re:Question (Score:5, Informative)
After it's done with that (I wonder how they will determine what done is...), it will go classified and do nuke simulations.
Change in paradigm (Score:5, Informative)
Re:exaflop, zettaflop, the yottaflop and the xeraf (Score:5, Informative)
Are you kidding? [warresisters.org]
Re:err (Score:3, Informative)
Cell was the brainchild of Ken Kutagari of Sony and Peter Hofstee of IBM.
Re:The future (Score:5, Informative)
Cell 1 (the Playstation chip) didn't have the double precision floating performance to achieve the petaflop mark; Cell 2 is far better on that front.
Yes, It Does Run Linux (Score:3, Informative)
Now get out there and supercompute!
Old News (Score:3, Informative)
This was covered last year, and the Los Alamos website [lanl.gov] had a few interviews with some people involved on what the uses of Roadrunner are. They had a time-line of what phases are to be done, and as far as memory serves me, they were going with Opterons for the first phase, then performance assessment, then add the Cell processors in the third phase.
From these pictures [lanl.gov], it clearly shows they're using IBM Blades (4 chassis in each rack), and IBM already offers BladeQ [ibm.com] servers which use Cell processors for HPC applications. The IBM BladeQ servers pack double the CPUs of a PS3.
If you take a look at the Folding@Home project statistics [stanford.edu], you can see the performance of PS3 boxes, and almost relate...Re:Not in perspective - this is a media number (Score:5, Informative)
No, not at all scary. It's apparently twice is fast as the BlueGene/L, which apparently set a record of 478.2 teraFLOPS. Let's assume it takes 1 floating-point operation to test a single key, which is a gross underestimate. We'll thus assume the Roadrunner can test 10^15 keys per second. Testing 2^128 keys would then take about 10^16 years.
wikipedia (Score:2, Informative)
Re:Question (Score:1, Informative)
This CELL is not single precision (Score:5, Informative)
Things move fast in technology Jethro, including this 2nd gen of the CELL proc, this is what you missed:
Double Precision FP - 190TFLOPS (5 times faster than 1st CELL)
Memory: Expanded to 32gb
Memory: DDR2 instead of Rambus
65nm (I know, I know, but it's better than 90nm)
Re:exaflop, zettaflop, the yottaflop and the xeraf (Score:3, Informative)
Each node has two Opterons and 4 PowerXCell 8 processors (an upgrade to the PS3's Cell processor). This allows a developer writing code for the platform to run in a number of different modes: all Opteron, all Cell, or something in between. The first of these (all Opteron) may constitute a significant amount of the early work on the machine by practitioners, as they can simply compile legacy codes to the platform and ignore the Cell processors. Of course, to reap the full benefit of the machine, developers will exploit both the Cell chips and the Opteron chips.
Re:Question (Score:0, Informative)
And don't forget it's pretty much the same data as goes towards the weather forecasts. Those things on TV? Sure, they're not always right. But they're pretty good nowadays. You can't have it both ways, like some creationist happily microwaving his dinner.
Re:Question (Score:1, Informative)
Re:Change in paradigm (Score:3, Informative)
IBM is already selling a product under the name bluefire
http://www.eetimes.com/news/latest/showArticle.jhtml?articleID=207100873 [eetimes.com]
http://www.eweek.com/c/a/Infrastructure/IBM-Ships-First-WaterCooled-Supercomputer/ [eweek.com]
I hope we see more water & less air in the future
Re:This CELL is not single precision (Score:3, Informative)
a) PS3 "Classic Cell" 1 PPC64 w/ 2 threads and 7 SPEs (8, but one disabled, defective or not):
GFLOPS 64-bit (double): 3.2GHz * 1 FLOPS/Hz * 7 SPEs = 22.4 GFLOPS (huge penalty, because of simulation via unoptimized simple precission operation)
GFLOPS 64-bit (double) a optimized 32-bit operation [netlib.org]: 3.2GHz * 3.9 FLOPS/Hz * 7 SPEs = 87.36 GFLOPS
b) Roadrunner "New Cell" 1 PPC64 w/ 2 threads and 8 SPEs:
GFLOPS 64-bit (double): 4GHz * 4 FLOPS/Hz * 8 SPEs = 128 GFLOPS
P.S. ad hoc rewrite, based on my own Journal at Barrapunto (spanish
Re:But can it run.... (Score:1, Informative)
Re:Cell processor (Score:5, Informative)
Re:Cell processor (Score:4, Informative)
Cell in addition has 8 SPUs. 1 is disabled in the PS3 for yield reasons, and another is reserved, so there are 6 available for general purpose computing.
Both run at 3.2GHz. I think Cell has at least 3x the vector/streaming power of the XBox 360 CPU, but only 1/3rd of the general purpose capability. Figures pulled from thin air, etc.
Re:Summary should have a shout out (Score:5, Informative)
You're precisely correct. Cell's strength is in very predictable workloads (ones it can perform without branch mispredict penalties), very parallelizable workloads (ones that can be distributed over 6-8 SPU's / SPC's) that fit within 256 KB of local storage per SPU (manually managed cache, mapped to main memory). The non-double precision floating point enhanced version's (the version in the PS3) strength is further limited to integer and single precision floating point workloads. Roadrunner's Cell-DP eliminates that last limitation. While video games, encryption, nuke simulations and anything else that involves matrix manipulation can really stretch their legs on such a beast, general purpose computing won't find a benefit.