Cray Unveils XC30 Supercomputer 67

Posted by timothy on Thursday November 08, 2012 @04:58PM from the show-us-the-sheets dept.

Nerval's Lobster writes "Cray has unveiled a XC30 supercomputer capable of high-performance computing workloads of more than 100 petaflops. Originally code-named 'Cascade,' the system relies on Intel Xeon processors and Aries interconnect chipset technology, paired with Cray's integrated software environment. Cray touts the XC30's ability to utilize a wide variety of processor types; future versions of the platform will apparently feature Intel Xeon Phi and Nvidia Tesla GPUs based on the Kepler GPU computing architecture. Cray leveraged its work with DARPA's High Productivity Computing Systems program in order to design and build the XC30. Cray's XC30 isn't the only supercomputer aiming for that 100-petaflop crown. China's Guangzhou Supercomputing Center recently announced the development of a Tianhe-2 supercomputer theoretically capable of 100 petaflops, but that system isn't due to launch until 2015. Cray also faces significant competition in the realm of super-computer makers: it only built 5.4 percent of the systems on the Top500 list, compared to IBM with 42.6 percent and Hewlett-Packard with 27.6 percent."

Cray Unveils XC30 Supercomputer

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 67 Comments Log In/Create an Account

Comments Filter:

Does it have a bench-seat? (Score:4, Insightful)

by Jeremiah Cornelius ( 137 ) writes: on Thursday November 08, 2012 @05:01PM (#41924097) Homepage Journal

It's no Cray, unless it also doubles as stylish atrium furniture.

- Re: (Score:3)
  
  by srussia ( 884021 ) writes:
  
  It's no Cray, unless it also doubles as stylish atrium furniture.
  ...and space heater!
  - Re: (Score:3)
    
    by Jeremiah Cornelius ( 137 ) writes:
    
    Sit on it, Fonzie!
    http://dooki.com/supercomputers/cray/xmp/cray.xmp.4.jpg [dooki.com]
    - MORE CRAY PR0N! (Score:5, Informative)
      
      by Jeremiah Cornelius ( 137 ) writes: on Thursday November 08, 2012 @05:24PM (#41924473) Homepage Journal
      
      http://www.craysupercomputers.com/images/Systems/CrayXMP/CrayXMP_Feathered.jpg [craysupercomputers.com]
      http://www.craysupercomputers.com/images/Systems/CrayYMP8/CrayYMP8_Feathered.jpg [craysupercomputers.com]
      
      - Re:MORE CRAY PR0N! (Score:4, Interesting)
        
        by psergiu ( 67614 ) writes: on Thursday November 08, 2012 @06:11PM (#41925007)
        
        How about Cray T90 - looking like something out of David Lynch's Dune:
        http://www.craywiki.com/images/f/fb/T916.jpg [craywiki.com]
        now THAT was a computer any CEO was proud to show to visitors.
        Not a row of boring cabinets.
        
        
        Re: (Score:1)
        
        by Anonymous Coward writes:
        
        My favourite still has to be the Thinking Machines CM2:
        http://www.mission-base.com/tamiko/cm/cm2-hds.gif
        
        Re: (Score:3)
        
        by Jeremiah Cornelius ( 137 ) writes:
        
        HERETIC! Machines that THINK? You need a Mentat!
        
        Re: (Score:2)
        
        by bedouin ( 248624 ) writes:
        
        Sorta looks like something that would summon the Cenobites if opened for repairs.
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  It's not a Cray, full stop. Like Atari, the Cray name has been bought a completely unrelated company after the original company went bankrupt.
  - Re: (Score:2, Informative)
    
    by Anonymous Coward writes:
    
    It is however, the same group of very clever engineers.
    - Re: (Score:2)
      
      by jsfetzik ( 40515 ) writes:
      
      Yup. Cray's advantage has always been the engineering behind the interconnects and the highly optimized compilers. The people behind those are still around.
      - Re: (Score:2)
        
        by Celarent Darii ( 1561999 ) writes:
        
        By the way, for those who don't remember Cray computer and their engineers, I very much recommend this video: http://www.youtube.com/watch?v=J9kobkqAicU [youtube.com] Seymour Cray was a brilliant man and he attracted many brilliant engineers to work on his machines. The video gives some history from some actual workers at Cray and the first users of the Cray 1.
100-petaflops? Amazing! (Score:1, Funny)

by Anonymous Coward writes:

That's almost enough to run Vista
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  So then not enough left to play Crysis 2?
  - Re: (Score:1)
    
    by Anonymous Coward writes:
    
    But a beowulf cluster of those ALMOST could run it!
- Re: (Score:2)
  
  by unixisc ( 2429386 ) writes:
  
  Yeah, if they are based on Xeon, shouldn't Vista run on them?
"Unveiled" my arse (Score:1)

by fatphil ( 181876 ) writes:

They've released the output of a raytracer, and little more by the looks of it.

Things that don't exist are not "capable" of anything. (Well, unless you're of a religious persuasion...)
- Re: (Score:3)
  
  by suso ( 153703 ) * writes:
  
  I'll be revealing my supercomputer that has finally broken the exaflop barrier in about an hour. (opens Blender)
details, details (Score:1)

by whistl ( 234824 ) writes:

While the article says they 'unveiled' it, it doesn't give any information about the hardware at all. I'm guessing it hasn't actually been built yet. Too bad. The Top 500 Supercomputers list is due to be updated this month.
- Re: (Score:3, Informative)
  
  by whistl ( 234824 ) writes:
  
  The Cray website (http://www.cray.com/Products/XC/XC.aspx) has more details. 3072 cores (66 Tflops) per cabinet, initially, and the picture make it look like they have 16 cabinets, making 49152 cores total. Amazing.
  - Re: (Score:1)
    
    by Desler ( 1608317 ) writes:
    
    They'll need more than 16 if this is a 100 petaflop computer. So either you are looking at the wrong machine or there's a typo somewhere.
    - Re: (Score:2)
      
      by corvair2k1 ( 658439 ) writes:
      
      The statement is that the xc30 can _make it_ to 100 PF. Nobody will build a 100 PF machine (i.e., 1600 cabinets, 8x more than Jaguar) with this product line, there will be upgrades before then. 32k sq ft of machine room space and cooling is too expensive.
- Re: (Score:2)
  
  by godrik ( 1287354 ) writes:
  
  Actually Super Computing is next week. So the ranking will be available probably on monday!
Damn (Score:2, Funny)

by Anonymous Coward writes:

That shit cray.
On your desktop in 11 years (Score:5, Insightful)

by michaelmalak ( 91262 ) writes: <michael@michaelmalak.com> on Thursday November 08, 2012 @05:37PM (#41924635) Homepage

In November, 2001, the fastest supercomputer was 12 TFlops [top500.org]. You can achieve that today for less than $5,000 on your desktop by ganging together four GPGPU cards (such as the 3 TFlops Radeon 7970 for less than $500 each). Go back to 1999 and it's only 3 TFlops and to match today you wouldn't even need a special motherboard.
So just wait 11 years for the prices to come down.

- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  Supercomputers measure double precision FLOPS while the GPGPU vendor cheat and report single precision. And that doesn't take into account the ugly "kernel" programming needed for GPGPU and memory synchronization.
  - Re:On your desktop in 11 years (Score:5, Interesting)
    
    by michaelmalak ( 91262 ) writes: <michael@michaelmalak.com> on Thursday November 08, 2012 @06:13PM (#41925025) Homepage
    
    Supercomputers measure double precision FLOPS while the GPGPU vendor cheat and report single precision.
    Ah, OK, Radeon is then 1 TFlop [rpi.edu] for double precision (which is new to the Radeon). So four Radeon 7970's beat the top 1999 supercomputer.
    
    - Re: (Score:2)
      
      by bws111 ( 1216812 ) writes:
      
      Except that 1999 supercomputer was capable of doing real work. You have 4 fast GPUs sitting in a box, doing nothing. What is feeding them work, coordinating their inputs/outputs, etc? That is where all the hard work is.
      - Re: (Score:2)
        
        by michaelmalak ( 91262 ) writes:
        
        What is feeding them work, coordinating their inputs/outputs, etc? That is where all the hard work is.
        OpenCL uses C99. It's tricky, maybe even "hard", but far from impossible.
        
        Re: (Score:2)
        
        by bws111 ( 1216812 ) writes:
        
        What I meant was, once you add in all the overhead of scheduling work, passing messages etc, you will find that you are running at a much slower speed than the raw speeds of the GPUs would have you believe. A GPU waiting for work, or memory access, or IO, or whatever is running at 0 FLOPS, regardless of how fast the processor is capable of running. If you can't keep those 4 GPUs running full speed doing actual work at all times, you have nothing near a 3 TFLOPS machine.
        
        Re: (Score:2)
        
        by michaelmalak ( 91262 ) writes:
        
        once you add in all the overhead of scheduling work, passing messages etc, you will find that you are running at a much slower speed than the raw speeds of the GPUs would have you believe
        Would you happen to know how that compares to real supercomputers?
        I don't have any first-hand experience with supercomputers -- only hearing about and reading about that they also struggle against Amdahl's law.
        
        Re: (Score:1, Informative)
        
        by Anonymous Coward writes:
        
        Well with supercomputers, the benchmark in the TOP 500 is LINPACK. Which will spit out the amount of double precision FLops. The theoretical performance is Ghz*cores*floating point ops/cycle = GFLops. thats in Gflops. A regular supercomputer with CPUs should never be below 80% of the maximum theoretical performance, if it is, something is wrong. A well tuned CPU cluster can get over 95% of the theoretical performance, a well tuned GPU cluster around 60%.
        Staying with a small scale ( 12 TFLops ), a real clu
        
        Re: (Score:2)
        
        by Meeni ( 1815694 ) writes:
        
        Correct in general, but extensive research in the last 5 years has lead to many production codes today. GPU accelerators can indeed live to (most) of their promises, and would typically reach 55 to 70% of peak in typical deployments (Tian-he is a good example ~55% efficient). Top notch designs can extract as good as 85% of peak in LINPACK, that is obtained by Sequoia, unvailed last year. We'll see how Titan will fare, its the new Supercomputer GPU giant, that will be announced this year to replace the Jagua
      - Re: (Score:2)
        
        by michaelmalak ( 91262 ) writes:
        
        Yeah, if only that box had some other processor specializing in scalar operations and connected to the vector processors via high bandwidth low latency link.
        An i5 has four cores and is connected to the Radeon via PCIe 3.0 x8.
      - Re: (Score:2)
        
        by timeOday ( 582209 ) writes:
        
        The linpack yield of current generation GPU clusters is about 50% [mcorewire.com]. So while your point is valid, "doing nothing" is a rather large exaggeration. For that matter, 50% is the yield on a cluster, so the yield on a single-bus machine is almost certainly higher.
        .
        From the following, it sounds like 1 Teraflop - not theoretical, but on Linpack - Is available on a desktop [gfxspeak.com], now or very soon:
        Intel has been working hard on its many-integrated core (MIC), which it describes as a 50+ core capable of one teraflops rea
        
        Re: (Score:2)
        
        by bws111 ( 1216812 ) writes:
        
        I stand by what I said, although maybe I worded it poorly. I did not mean that the config he proposed was uncapable of doing work. I meant that the only way to achieve the speeds he is talking about is by doing no work (in other words, not benchmarking, just going by what the box says).
- Re: (Score:1)
  
  by etash ( 1907284 ) writes:
  
  wrong. top500 measures double precision performance, not single.
- Re: (Score:2)
  
  by fuzzyfuzzyfungus ( 1223518 ) writes:
  
  The ugly trick is interconnect performance, unless you aren't planning to scale up very much at all or have the (atypical) good fortune to be attacking nothing but hugely parallel problems.
  It's been a while since the supercomputer crowd found rolling their own esoteric CPUs to be worth it(with POWER the possible exception); but if all the silicon you want to devote to the problem won't fit on a single motherboard, you quickly enter the realm of the rather specialized.
  At very least, you are probably looking
  - Re: (Score:1)
    
    by michaelmalak ( 91262 ) writes:
    
    At very least, you are probably looking at doing some networking as or more costly than a 10GbE setup
    There is no networking involved in a four-Radeon setup, just a special rackmount motherboard that has a dozen PCIe slots (because each Radeon is triple-width physically).
- It's more like 13 years (Score:2)
  
  by gentryx ( 759438 ) * writes:
  
  The Top500 reports actual performance as measured with LINPACK, hardware vendors report the theoretical performance of their chips, which in the case of GPUs is often quite a bit more than you'd be able to squeeze out with LINPACK.
  For comparison: Tsubame 2.0 consists of 1400 nodes with approx. 4200 NVIDIA Tesla C2075, which should yield -- according to your estimate -- 2.1 PFLOPS (4200 * 0.5 TFLOPS [nvidia.com]), yet it is listed at 1.2 PFLOPS [top500.org]. So just add two years to your estimate and you should be fine...
XC30 (Score:5, Funny)

by Cid Highwind ( 9258 ) writes: on Thursday November 08, 2012 @06:13PM (#41925033) Homepage

"Originally named 'Cascade'" ... and now named for a midsize Volvo.
It might not be the fastest supercomputer in the world, but at least it'll be safe.

- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  The Cray product may also be faster than the Volvo product!
- Re: (Score:2)
  
  by corvair2k1 ( 658439 ) writes:
  
  It's the number of nodes that can be connected into a single machine multiplied by the theoretical peak performance of each node (implying zero actual communication). The limit on the number of nodes can be limited by a range of things, from how many nodes are addressable by the networking hardware to an #IFDEF on the maximum number of nodes the software is willing to support.
But does it... (Score:2)

by KingMotley ( 944240 ) writes:

But does it run linux?
Imagine a beowulf cluster of those.
And not first post!
- Re: (Score:1)
  
  by emoreau ( 1247650 ) writes:
  
  That's exactly what those supercomputer are, Linux clusters.
It has also been reported... (Score:2)

by Progman3K ( 515744 ) writes:

That it runs Windows 8 nearly-acceptably
"Just 5%"??????? (Score:2)

by rubycodez ( 864176 ) writes:

For a company with a market cap of less than half a billion to have made 1 in 20 of the Top500 is an extraordinary achievement. IBM -> $215 billion, HPQ ->$27 billion
- Re: (Score:2)
  
  by RicktheBrick ( 588466 ) writes:
  
  Now lets asks how much power this computer will need? Lets say it can do a billion flops per watt. 100 petaflops is 100,000 trillion flops. A trillion flops is 1000 billion flops so a trillion flops is 1000 watts at a billion per watt. So 100,000 trillion flops would 100 million watts. So lets hope they can do at least 50 billion flops per watt so that would mean 20 watts per trillion flops or 2 million watts. At 10 billion flops per watt would mean 5 times that or 10 million watts. Now lets assume t
  - Re: (Score:2)
    
    by Meeni ( 1815694 ) writes:
    
    $500 million is aprox. the entire budget over the lifetime of the computer (including the electric bill, which is becoming increasingly the dominant cost to amortize). Typical build cost is around $100M.
    However, there is a false dichotomy in your comparison. The supercomputer is not designed to perform the job of 1 billion workstations. It is designed to perform a single task that could not be done on another machinery. Just like you cannot build a supertanker in a million bathtubes but need a shipyard, you
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  It's not really an achievement but a business model :-) they have 17% of the top 100, that is just their "sweet spot"
  Cray is the Ferrarri of Computing ....
Choice of CPUs (Score:2)

by unixisc ( 2429386 ) writes:

Why does Cray still stick to Xeons? This would have been a perfect application for Itanium III, and they would have hit their petaflop goals easier
- Re: (Score:2)
  
  by ajlitt ( 19055 ) writes:
  
  That's adorable.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Does it have a bench-seat? (Score:4, Insightful)

Re: (Score:3)

Re: (Score:3)

MORE CRAY PR0N! (Score:5, Informative)

Re:MORE CRAY PR0N! (Score:4, Interesting)

Re: (Score:1)

Re: (Score:3)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2, Informative)

Re: (Score:2)

Re: (Score:2)

100-petaflops? Amazing! (Score:1, Funny)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

"Unveiled" my arse (Score:1)

Re: (Score:3)

details, details (Score:1)

Re: (Score:3, Informative)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Damn (Score:2, Funny)

On your desktop in 11 years (Score:5, Insightful)

Re: (Score:1)

Re:On your desktop in 11 years (Score:5, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:1)

It's more like 13 years (Score:2)

XC30 (Score:5, Funny)

Re: (Score:1)

Re: (Score:2)

But does it... (Score:2)

Re: (Score:1)

It has also been reported... (Score:2)

"Just 5%"??????? (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Choice of CPUs (Score:2)

Re: (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals