Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

The Supercomputer Race

Posted by samzenpus on Tue Sep 23, 2008 06:45 PM
from the greased-lightning dept.
CWmike writes "Every June and November a new list of the world's fastest supercomputers is revealed. The latest Top 500 list marked the scaling of computing's Mount Everest — the petaflops barrier. IBM's 'Roadrunner' topped the list, burning up the bytes at 1.026 petaflops. A computer to die for if you are a supercomputer user for whom no machine ever seems fast enough? Maybe not, says Richard Loft, director of supercomputing research at the National Center for Atmospheric Research in Boulder, Colo. The Top 500 list is only useful in telling you the absolute upper bound of the capabilities of the computers ... It's not useful in terms of telling you their utility in real scientific calculations. The problem with the rankings: a decades-old benchmark called Linpack, which is Fortran code that measures the speed of processors on floating-point math operations. One possible fix: Invoking specialization. Loft says of petaflops, peak performance, benchmark results, positions on a list — 'it's a little shell game that everybody plays. ... All we care about is the number of years of climate we can simulate in one day of wall-clock computer time. That tells you what kinds of experiments you can do.' State-of-the-art systems today can simulate about five years per day of computer time, he says, but some climatologists yearn to simulate 100 years in a day."
+ -
story

Related Stories

This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • by Anonymous Coward on Tuesday September 23 2008, @06:49PM (#25129355)

    Is how many libraries of congress it can read in a fortnight.

  • Simulation (Score:5, Funny)

    by gringer (252588) on Tuesday September 23 2008, @06:55PM (#25129393)

    Simulate 100 years of climate in a day? Here's my code:

    echo -e "sunny\nrainy\ncloudy" | rl -rc 36525

    • What is rl?
    • Re: (Score:3, Interesting)

      You jest, but that's exactly the point. "Simulated years per day" is about as meaningless a metric as it gets, because, as you proved, that number depends on the complexity of the underlying climate model, and also on how well the software was written, i.e. if it is optimized for both the hardware and the model to be computed.

      Both these factors are hard/impossible to control and to standardize, and the only factor that does not change is the actual hardware and its peak/sustained performance, so it's the o
  • by Ritz_Just_Ritz (883997) on Tuesday September 23 2008, @06:55PM (#25129395)

    Sadly, while predicting the weather and better understanding it ultimately helps a lot of people, I suspect a LOT more computing power is thrown at more mundane things like predicting where the financial markets are going to be based on a gazillion data inputs. Probably even better funded are the vast datacenters around the world that fondle communications and other data for the spymasters. I doubt those computing resources are represented in the annual supercomputing lists. :)

    • by Quarters (18322) on Tuesday September 23 2008, @07:03PM (#25129469)
      But, in a roundabout way the financial market simulator will ultimately help the weather simulator's performance. Everyone knows that business apps are written in VB. That means the financial simulator folks need MUCH more powerful supercomputers to run their code at anything close to appreciable speed. That same machine will run well coded weather apps blazingly fast!
    • Re: (Score:2, Insightful)

      I suspect no computing power is being thrown at predicting where the financial markets are going.

      A lot is thrown at pretending to predict it, but it's brilliantly obvious that the output of such things is no better than chicken entrails or the last two presidential elections.

    • Re: (Score:3, Interesting)

      Sadly, while predicting the weather and better understanding it ultimately helps a lot of people, I suspect a LOT more computing power is thrown at more mundane things like predicting where the financial markets are going to be based on a gazillion data inputs. Probably even better funded are the vast datacenters around the world that fondle communications and other data for the spymasters. I doubt those computing resources are represented in the annual supercomputing lists. :)

      There are a couple of misperceptions here.

      Both the problems described, modeling years of weather models or modeling financial instruments, suffer from a definite flaw: they are not mathematical problems in the "high school" sense of the world, i.e. it is not possible to prove that there is only one finite solution that is demonstrably right.
      Financial models are "fit to reality": you take a long time series, make a few wild guesses, throw it into a Cray-2, and look what the model says. Lather, rinse, r

  • Flops not useful? (Score:5, Informative)

    by zippthorne (748122) on Tuesday September 23 2008, @06:57PM (#25129413) Journal

    But.. The whole point is to test the model, and the models change, don't they? Surely we're not just simulating more "years" of climate with the current batch, but improving resolution, making fewer simplifying assumptions, and hopefully, finding ways to do the exact same operations with fewer cycles.

    How can you possibly evaluate supercomputers in any other way except how many mathematical operations can be performed in some reference time? And.. some serial metric if the math is highly parallel, since just reducing the size of vectors in those cases wouldn't actually result in those flops being useful for other tasks.

    • Re:Flops not useful? (Score:5, Interesting)

      by geekoid (135745) <dadinportland.yahoo@com> on Tuesday September 23 2008, @07:03PM (#25129473) Homepage Journal

      That's just the problem, people want to hear raw numbers, but those are useless.
      How well can it do the specific task it needs to do is the actual question. It's a hard one, to be sure.

      • Re:Flops not useful? (Score:5, Informative)

        by jd (1658) <imipak@@@yahoo...com> on Tuesday September 23 2008, @08:31PM (#25130127) Homepage Journal
        To be honest, I thought most people already knew about and used HPC Challenge, which produces 7 different benchmarks covering different types of mathematical problem, memory bandwidth and communications bandwidth. I also imagined people would use MPI-IO [lanl.gov] for measuring MPI performance, that the numbers on the Top500 was simply because it's hard to track a vast number of stats in a meaningful way.

        Of course, if it's actually the case that people are dumb, lazy or in marketing, then that would explain why we don't get a full range of stats, even though the tools have existed for many years and are certainly widely known.

    • Re:Flops not useful? (Score:5, Informative)

      by corsec67 (627446) on Tuesday September 23 2008, @07:13PM (#25129573) Homepage Journal

      Flops wouldn't test how well the interconnects work.

      Since you say "increase the resolution of the model", you are expanding the size of the model, and how much data must be used by all of the nodes of the computer.

      Since how important the interconnect properties are is dependent on the model, with almost no communication needed, like for F@H, to a problem that needs all of the nodes to have access to a single shared set of data, it would be very hard to quantify performance in one number.

      Unfortunately, there are more than a few fields where marketers want a single number to advertise in a "mine is bigger than yours" competition, and come up with a metric that is almost worthless.

      • Re: (Score:3, Informative)

        Actually, Linpack is not embarrassingly parallel so it DOES test how well the interconnects work, to some extent.

        The top 500 list is interesting, but if you're building a supercomputer to make a certain rank you have too much money and you should really give me some.

        You build a supercomputer to perform some task or class of tasks. If it gets you on the list, cool.

    • Because the climate simulation model they use does a LOT of inter-process communication. Each piece of the calculation depends on what's going on around it.
      Ever see footage of manual calculation rooms NASA used to have*? Imagine if every one of the calculations those people were doing depended on the previous calculation they did, AND all of the previous calculations of their eight nearest neighbors.

      Now you know why that atmospheric model has a benchmark rated in "century/months" - the number of centuries

    • Re:Flops not useful? (Score:5, Informative)

      by Salamander (33735) <jeff@p[ ]typ.us ['l.a' in gap]> on Tuesday September 23 2008, @07:46PM (#25129839) Homepage Journal

      How can you possibly evaluate supercomputers in any other way except how many mathematical operations can be performed in some reference time?

      Simple: you evaluate how much actual work it can perform across the entire system per unit time, where "actual work" means a mix of operations similar to some real application of interest. The whole problem here is that practically no real application is as purely focused on arithmetic operations as Linpack. Even the people who developed Linpack know this, which is why they developed the HPCC suite as its successor. It's composed of seven benchmarks, including some (e.g. stream triad) that mostly stress memory and some (e.g. matrix transpose) that mostly stress interconnects. If you want to get an idea how your application will perform on various machines, you determine what mix of those seven numbers best approximates your application, assign appropriate weights, and then apply those weights to the vendor numbers. Then you negotiate with the two or three most promising vendors to run your application for real. SPEC should have put an end to simplistic "single figure of merit" comparisons, or if not them then TPC, SPC, etc. Sadly, though, there's still always someone who comes along and tries to revive the corpse.

    • Re: (Score:3, Interesting)

      "How can you possibly evaluate supercomputers in any other way except how many mathematical operations can be performed in some reference time? "

      It's much more subtle than that. Most programs, including weather simulations, use a large amount of data stored on disk and in RAM. The problem with LINPACK as a benchmark is that, for all practical purposes, it ignores this cost by using a few very specific linear algebra operations that have very low communication/computation ratios. The LINPACK number is onl

  • by straponego (521991) on Tuesday September 23 2008, @07:02PM (#25129459)
    A quality HPC vendor will give you the opportunity to benchmark your application before you buy a system or cluster. Most will have standard codes installed, but you should also be able to arrange for a login to build and run your own code on their test clusters. This is the only way to guarantee you're getting the best bang per buck, because the bottleneck in your particular applications may be memory, IO, interconnect, CPU, chipset, libraries, OS... An HPC cluster can be a big purchase, and it performance and reliability can make or break careers. Don't trust generalized benchmarks unless you know that they accurately reflect your workload on the hardware you'll be purchasing.
  • I agree (Score:3, Informative)

    by friedmud (512466) on Tuesday September 23 2008, @07:04PM (#25129479) Homepage

    I write massively parallel scientific code that runs on these supercomputers for a living... and this is what I've been preaching all along.

    The thing about RoadRunner and others (such as Red Storm at Sandia) is that they are special pieces of hardware that run highly specialized operating systems. I can say from experience that these are an _enormous_ pain in the ass to code for... and reaching anything near the theoretical computing limit on these machines with real world engineering applications is essentially impossible... not too mention all of the extra time it costs you in just getting your application to compile on the machine and debug it...

    My "day-to-day" supercomputer is a 2048 processor machine made up of generic Intel cores all running a slightly modified version of Suse Linux. This is a great machine for development _and_ for execution. My users have no trouble using my software and the machine... because it's just Linux.

    When looking at a supercomputer I always think in terms of utility... not in terms of Flops. It's for this reason that I think the guys down at the Texas Advanced Computing Center got it right when they built Ranger ( http://www.tacc.utexas.edu/resources/hpcsystems/#constellation [utexas.edu] ). It's about a half a petaflop... but guess what? It runs Linux! And is actually made up of a bunch of Opteron cores... the machine itself is also a huge, awesome looking beast (I've been inside it... the 2 huge Infiniband switches are really something to see). I haven't used it myself (yet), but I have friends working at TACC and everyone really likes the machine a lot. It definitely strikes that chord between ultra-powerful and ultra-useful.

    Friedmud

    • Re:I agree (Score:4, Funny)

      by Abreu (173023) on Tuesday September 23 2008, @07:24PM (#25129663)

      My "day-to-day" supercomputer is a 2048 processor machine made up of generic Intel cores all running a slightly modified version of Suse Linux.

      We all envy you.

    • Well, let's see (Score:5, Interesting)

      by Louis Savain (65843) on Tuesday September 23 2008, @07:29PM (#25129715) Homepage

      It's about a half a petaflop... but guess what? It runs Linux!

      This sounds kind of nice but why should this make it any easier to write parallel programs for it? You still have to manage hundreds if not thousands of threads, right? This will not magically turn it into a computer for the masses, I guarantee you that. I have said it elswhere [blogspot.com] but parallel computing will not come of age until they do away with multithreading and the traditional CPU core [blogspot.com]. There is a way to build and program parallel computers that does not involve the use of threads or CPUs. This is the only way to solve the parallel programming crisis. Until then, supercomputing will continue to be a curiosity that us mainstream programmers and users can only dream about.

      • Re: (Score:3, Insightful)

        "Until then, supercomputing will continue to be a curiosity that us mainstream programmers and users can only dream about."

        I'm not so sure that's a bad thing... most applications don't need the power of a super computer...

        At the same time, I agree that I wish that desktop development tools made it easier to do threading for multi-core machines. Every new computer comes with more than one core... but the development tools (languages, compilers, IDE's, debuggers) simply aren't helping the everyday joe progra

    • RoadRunner (Score:2, Insightful)

      by Anonymous Coward

      Is running relatively stock Fedora (the ppc distribution). True, it's ram hosted, but the OS is hardly specialized in terms of libraries and such. You could say the Cell SDK is a tad specialized, but the underlying platform is not so custom as implied.

      In fact, every single Top500 system I've ever touched has been far more typical linux than most people ever expect.

      In any event, the most compelling aspect of RoadRunner in my view is the flops/watt. Application developers who can leverage highly parallel c

      • Re: (Score:3, Informative)

        The specialization of the hardware / software combo is what I was referring to.

        Have you ever coded for one of these special architectures? It really is a bitch. Yes, Redstorm is even worse (special OS that doesn't even allow dynamic linking!)... but the non-generality of the cell-processors is going to kill the real world impact of Roadrunner.

        ASCII Purple was one of the previous machines at LANL that was a "one-off" build from IBM. It was a complete disaster. Porting code to the machine took much longer

        • Re: (Score:3, Informative)

          Sorry... got my supercomputers mixed up... ASCII Purple was at LANL...

          I was thinking of ASCI Q, but that was made by HP...

          Oh... just nevermind... I screwed it up well enough, just forget it ;-)

          Need to get some sleep.

          Friedmud

          • Re: (Score:3, Informative)

            As far as I know (as of 3 months ago) they're still running Catamount at Sandia... and it's for the reason you state: they developed it.

            Friedmud

    • Re:I agree (Score:4, Insightful)

      by CronoCloud (590650) <cronocloud AT mchsi DOT com> on Tuesday September 23 2008, @07:36PM (#25129765)

      The thing about RoadRunner and others (such as Red Storm at Sandia) is that they are special pieces of hardware that run highly specialized operating systems.

      from http://en.wikipedia.org/wiki/IBM_Roadrunner [wikipedia.org]

      The Roadrunner uses Red Hat Enterprise Linux along with Fedora as its operating systems and it's managed with xCAT distributed computing software.

    • Not just buying the thing, also the cost of maintaining it (standard hardware is most likely easier to maintain), and the power it and its cooling system uses, check green500 for that one. Actually, as a user, I have often found that most supercomputing clusters are inefficient for at least the first year-and-a-half due to imperfect queuing systems or network/filesystem incompatibilities. "Yeah, your run will likely crash every now and then but we don't know yet why". I do not blame the administrators, I b
  • Non-story... (Score:3, Insightful)

    by Kjella (173770) on Tuesday September 23 2008, @07:05PM (#25129497) Homepage

    ...ever looked at gaming benchmarks? Server benchmarks? Productivity benchmarks? Rendering benchmarks? In fact, any kind of benchmark? Seen how they all differ depending on the product and test run? Same with supercomputers, you got some synthetic benchmarks, and you got some real world benchmarks. But the weather simulation may not be a relevant benchmark at all if you're doing nuke simulations or gene decoding or finite deformation or some other kind of simulation. Synthetics are the lowest common denominator - you'd rather see benchmarks in your field, and most of all benchmarks with your exact application. That doesn't change that those are individual wants and synthetic benchmarks are the only ones with any value to everyone.

  • by Junta (36770) on Tuesday September 23 2008, @07:23PM (#25129657)

    Just with a lot more dollars behind it...

    Every one remotely engaged in Top500 systems knows how very specific the thing being measured is. It's most sensitive to the aggregate clock cycles and processor architecture, and not as sensitive to memory throughput/architecture or networking as many real world things are.

    http://icl.cs.utk.edu/hpcc/ [utk.edu]

    Is an attempt to be more comprehensive, at least, by specifying a whole suite of independently scored benchmarks to reflect the strengths and weaknesses of things in a more holistic way. Sure, it's still synthetic, but it can give a better 'at-a-glance' indicator of several generally important aspects of a supercomputer configuration.

    The thing probably inhibiting acceptance of this is that very fact, that it is holistic and the winner 'depends' on how you sort the data. This is excellent for those wanting to more comprehensively understand their configurations standing in the scheme of things, but hard for vendors and facilities to use for marketing leverage. Being able to say 'we built *the* fastest supercomputer according to the list' is a lot stronger than 'depending on how you count, we could be considered number one. Vendors will aggressively pursue pricing knowing about the attached bragging rights, and facilities that receive research grant money similarly want the ability to make statements without disclaimers.

    Rest assured, though, that more thorough evaluations are done and not every decision in the Top500 is just about that benchmark. For example, AMD platforms are doing more strongly than they would if only HPL score is counted. AMD's memory performance is still outrageously better than Intel and is good for many HPC applications, but Intel's current generation trounces AMD in HPL score. Of course, Intel did overwhelmingly become popular upon release of their 64-bit core architecture based systems, but still..

  • by Anonymous Coward

    Most of the locations listed are mostly educational institutions, r&d centers, and computer companies. The results were probably submitted unofficially. There are few exceptions, but they are just that--few. It makes you wonder what the Big Data companies (Google, Yahoo!, etc) actually have running. They have no reason to participate, after all...

    Consider something like Yahoo!'s research cluster [yahoo.com]. Why isn't it on this list? Why don't they run the tests?

  • That an article about featuring IBM supercomputers comes shortly after a few misguided individuals were posting that "IBM is no longer relevant, they are a OEM reseller nowadays" or that they "only make bloated, slow software"

  • Number of simulated years per day isn't exactly the metric you want. I can simulate a million years in a minute on my home pc, just not very accurately. As you get more accurate, the sim years/CPU day will decrease.

    So knowing the number of simulated years per cpu day doesn't tell you anything unless you know exactly what algorithm you're using.

  • by Raul654 (453029) on Tuesday September 23 2008, @08:56PM (#25130269) Homepage

    It's fair to criticize Linpack for being a one-trick pony. It measures system performance for dense linear algebra, and nothing else. Jack Dongarra (the guy who wrote Linpack and maintains the top 500 lists) is quite up-front about Linpack's limitations, and he thinks that using a single number as the end-all-be-all of a computer's performance is a bad idea. It's a simple fact of life that certian kinds of computers do better on certain problems. The good guys out at Berkeley even sat down a couple years ago and enumerated [berkeley.edu] all of the problems they found in real-world HPC applications (See the tables on pages 11-12). The real truth here is that people should stop treating Linpack like it's the final measure of system performance. If you are doing pure linear algebra problems, it's a pretty good measurement for your purposes; if you are not, then you use it at your own peril.

    • by Fishbulb (32296) on Tuesday September 23 2008, @06:54PM (#25129385)

      Don't hold your breath; it'll disrupt the predictions.

    • 48 hours forecasts are very accurate. Hurricanes aren't so bad either,.
      I remember when the big leap was talking about a 3 day forecast to viewers, not it's 7-10. Still with decent accuracy.

      Now I note you said 'predictions' and that will never happen. I am assuming you meant forecasts.

      • by geezer nerd (1041858) on Tuesday September 23 2008, @07:23PM (#25129653)
        I can remember when the big desire of weather simulation supercomputers was to take less than 24 hours to do a 24-hour forecast. IIRC back in the second half of the '70s there was a big government-funded effort to build special fluid-dynamics oriented new machines to break that barrier.

        44 years ago 1-5 megaflops was hot! What excitement we felt when the CDC6600 was installed at my university!

        Back in '85 I was part of a startup building a mini-Cray, reimplementing the Cray instruction set in a smaller, cheaper box. I remember we focused on the Whetstone benchmark a lot, and it turned out that the Whetstone code really was bound up by moving characters around while formatting output strings, etc. We paid very careful attention to efficiently coding the C library string handling routines, and that got us more performance payback than anything we could do to optimize the arithmetic. One needs to understand the benchmark being used.

    • by TapeCutter (624760) on Tuesday September 23 2008, @08:10PM (#25129961) Journal
      I just threw away a couple of mod points to bring you this announcement: Climate != weather, climate is the long term statistics of weather. Two different numerical analysis models, both computationally expensive.
          • Re: (Score:3, Insightful)

            Can't I? - On average the weather will be colder in winter 4008 than it will be in summer 4008.

            Are you sure about that? You see, you opened a hole so broad that your statement isn't accurate even today. It's always summer and always winter on the earth. So it could be the same temperature in summer and winter in 4008.

            Also, you didn't explain what future climate predictions you came to that conclusion over. I might have not been specific enough for a fan boy like you, but I was making the claim to future

              • Re: (Score:3, Insightful)

                No, in fact my faith in science as the provider of the best available explaination for systematic observations of the natural world means that I'm not even sure the Sun will rise tomorrow BUT it my faith also tells me not worry about it.

                I wasn't talking about your faith in science. I am talking about your faith in global warming and how it has to be true.

                Absolute certainty of future events is a sport played by politicians and opionion columnists and I suggest that is why you insist on using their stale

    • by Waffle Iron (339739) on Tuesday September 23 2008, @08:40PM (#25130183)

      IANAM (I am not a meteorologist)

      That's for sure.

      Here's an analogy: Say you pour two different colored cans of paint into a bucket and start stirring. Weather is like predicting the exact patterns of swirls that you'll see as the colors mix. Very hard to do looking ahead more than a couple of stirs.

      Climate is more like predicting the final color that will result after the mixing is done. Not nearly so intractable. The summary is talking about climate, not weather.