NEC SX-9 to be World's Fastest Vector Computer

Please create an account to participate in the Slashdot moderation system

NEC SX-9 to be World's Fastest Vector Computer 137

Posted by CowboyNeal on Friday October 26, 2007 @12:49AM from the better-faster-stronger dept.

An anonymous reader writes "NEC has announced the NEC SX-9 claiming it to be the fastest vector computer, with single core speeds of up to 102.4 GFLOPS and up to 1.6TFLOPS on a single node incorporating multiple CPUs. The machines can be used in complex large-scale computation, such as climates, aeronautics and space, environmental simulations, fluid dynamics, through the processing of array-handling with a single vector instruction. Yes, it runs a UNIX System V-compatible OS."

This discussion has been archived. No new comments can be posted.

NEC SX-9 to be World's Fastest Vector Computer

Load All Comments

Search 137 Comments Log In/Create an Account

Comments Filter:

- What is this, mad mods Friday? (Score:2)
  
  by Burb ( 620144 ) writes:
  
  Who thought this was Insightful? We need -10 predictable as an option.
- - Re: (Score:1, Offtopic)
    
    by Poltras ( 680608 ) writes:
    
    Yeah, because we haven't seen that troll ever before... BTW, I'm posting this unanonymously to prove I haven't moderated, and that I have karma to burn
    - - Re: (Score:1)
        
        by armareum ( 925270 ) writes:
        
        No, it should be modded funny if it's funny. And the Beowulf cluster meme has never made me laugh..
        
        Re: (Score:1)
        
        by catmistake ( 814204 ) writes:
        
        well, crud, sorry I couldn't get the first post on a story about possible alien life or newly discovered invading insects or robots, and posted a "I, for one, welcome our new _________ _________ overlords!" salute. Its a fast computer story. I didn't have much to work with, but... we just do what we can. If I am fortunate enough again to ever be graced with a first post, I hope I have the fortitude, the intelligence, the wisdom, the insight, the... the wit, in those dwindling milliseconds to actually post s
        
        Re: (Score:1)
        
        by armareum ( 925270 ) writes:
        
        No-one who counts gives two shits you got first post.
        
        Re: (Score:1)
        
        by catmistake ( 814204 ) writes:
        
        Blasphemy! Mod the heretic!
Oh? (Score:2, Interesting)

by SnoopJeDi ( 859765 ) writes:

Yes, it runs a UNIX System V-compatible OS.

Of course, but the true question is...

Does it run Linux.

Cue the redundant replies and grouchy mods.
- Re: (Score:1)
  
  by azulza ( 651826 ) writes:
  
  Of course, but the true question is...
  Does it run Vista, without being a slow mofo.
  Cue redundant linux rants / MS bash
  - Re: (Score:1, Funny)
    
    by Anonymous Coward writes:
    
    ...MS made Bash?
    - Re:Oh? (Score:5, Funny)
      
      by colourmyeyes ( 1028804 ) writes: on Friday October 26, 2007 @04:11AM (#21125649)
      
      user@host:~ $ ls You are about to list the files in this directory. Are you sure you want to do this? [y/n] y Enter Administrator password: We're sorry, using MS Bash 4.00 Basic you do not have the proper privilege level to view system files. Please purchase MS Bash 4.00 Mega, Ultra, or Extreme. Would you like to purchase one of these products now? [y/n] y We're sorry, this product is not upgradeable. Please reinstall your operating system, choosing "clean install" during the upgrade process. Thank you for choosing the rich user experience provided by MS Bash 4.00. MS Bash must now restart your computer.
      
      Parent Share
      twitter facebook
- Quite possibly. (Score:5, Interesting)
  
  by jd ( 1658 ) writes: <imipak@ y a hoo.com> on Friday October 26, 2007 @01:09AM (#21124777) Homepage Journal
  
  The architecture (a vector processor) is not in the vanilla kernel, but the kernel is fairly parallel, thread-safe and SMP-safe, so I really can't see any reason why you couldn't put Linux on such a platform. Because a lot of standard parallel software these days assumes a cluster of discrete nodes with shared resources, they'd be best borrowing code from Xen and possibly MOSIX to simulate a common structure.
  (This would waste some of the compute power, but if the total time saved from not changing the application exceeds the time that could be saved using more of the cycles available, you win. It is this problem of creating illusions of whatever architecture happens to be application-friendly at a given time that has made much of my work in parallel architectures - such as the one produced by Lightfleet - so interesting... and so subject to office politics.)
  
  Parent Share
  twitter facebook
  - Re: (Score:3, Informative)
    
    by Kristoph ( 242780 ) writes:
    
    A user would pay the extremely high cost of a supercomputer - with it's proprietary memory architecture and interconnects - precisely because it can much more effectively scale up parallel processes then a cluster. If the benefit of that did not outweigh the cost of tailoring software to fit the device then these devices would never be made.
    
    ]{
    - Re:Quite possibly. (Score:5, Informative)
      
      by Calinous ( 985536 ) writes: on Friday October 26, 2007 @02:25AM (#21125239)
      
      The cost of the supercomputers is so high, that sometimes several man-month of tailoring the software to run as efficient as possible on the hardware could be recovered during a couple of days of processing.
      For the kind of computation the supercomputer market requests, a 5% improvement in running speed on a supercomputer can worth millions
      
      Parent Share
      twitter facebook
      - Re: (Score:3, Insightful)
        
        by Chrisq ( 894406 ) writes:
        
        This was certainly the case when I used vector processors. It is possible that the vector processor does not run an OS at all. It has been many years since I have worked on such a beast but when I did we ran a loader system with a standard OS which would cross compile code for the processor and load it almost onto the hardware (there was actually a small program we called a monitor to deal with I/O, etc, but no multi-tasking, security or anything). It would then run and the results were read back into the f
        
        Re:Quite possibly. (Score:5, Informative)
        
        by bockelboy ( 824282 ) writes: on Friday October 26, 2007 @08:48AM (#21127281)
        
        That's the current, popular, Blue Gene/L architecture. The Blue Genes are composed of densely packed boards, each of which has a PowerPC chip and many vector processors. The PowerPC chips run a Linux-like OS and do some normal-looking I/O (filesystems, networking, etc), while the vector processors churn lots of data and have simplistic I/O.
        
        That GP who suggests that Xen is used to distribute tasks obviously isn't familiar with the needs of big iron.
        
        Parent Share
        twitter facebook
        
        Re: (Score:2)
        
        by flaming-opus ( 8186 ) writes:
        
        I think you are mistaken. There are no vector processors in blue gene/L. BG/l is composed entirely of IBM ppc/440 cores. Each node (out of 65,000) is composed of 2 ppc scalar cores. In most cases one runs the application, and one handles the message passing. The Blue Gene/P uses 4-core nodes, but is otherwise similar.
        
        The cell processor has many scalar cores, which can be programmed to behave a little-bit like a vector processor, though they really aren't. Cell processors are not currently used in Blue-Gene
        
        Re: (Score:1)
        
        by owndao ( 1025990 ) writes:
        
        Thank you for your informed reply. It's sad that we have to dig down through all of the "Funny 5" posts to find any facts. As for the PowerPCs that you mention are they similar to the G5 (a PowerPC 970 variant) processors formerly found in Apple's machines? I believe they each had one or two AltiVec SIMD units.
        
        Re: (Score:2)
        
        by DerekLyons ( 302214 ) writes:
        
        This was certainly the case when I used vector processors. It is possible that the vector processor does not run an OS at all. It has been many years since I have worked on such a beast but when I did we ran a loader system with a standard OS which would cross compile code for the processor and load it almost onto the hardware (there was actually a small program we called a monitor to deal with I/O, etc, but no multi-tasking, security or anything). It would then run and the results were read back into the f
  - Re:Quite possibly. (Score:5, Insightful)
    
    by SamP2 ( 1097897 ) writes: on Friday October 26, 2007 @01:52AM (#21125097)
    
    CAN run Linux and RUNS Linux are not quite the same thing.
    
    To put things in perspective, 99% of PCs in the world CAN run Linux. :-)
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by anarxia ( 651289 ) writes:
      
      Calling this a PC (Personal Computer) is a bit of a stretch :)
    - - Re: (Score:2)
        
        by Anpheus ( 908711 ) writes:
        
        Hey now, I was about to correct the grandparent and tell him that 99% of all appliances can run Linux (I swear I got my toaster to boot!) but are you saying my golf clubs can too?
  - Re: (Score:3, Insightful)
    
    by deniable ( 76198 ) writes:
    
    The front end OS for these things is pretty meaningless. Being a Unix like will keep the programmers and admins happy. The front-end is only a shell for the code running on the back-end processing units. These do all of the work and rely on specific hardware, instructions, and libraries to do things in *actual* parallel. These things basically exist to run big number crunching tasks for mathematicians and mathematicians in disguise like physicists. :) These people will generally be running their own code wi
  - Re: (Score:2)
    
    by flaming-opus ( 8186 ) writes:
    
    The NEC processor is modern in its memory protection, so linux could easily be ported, however, there's a lot of time/money invested in super/ux so there's little incentive to do so. Even if linux were ported, it wouldn't be like the linux running on your desktop, it would be a stripped-down kernel, and some basic libraries.
    
    I don't know what you're talking about with Xen and mosix. Neither seem at all applicable to the sort of software run on big-iron machines like this. NEC SX machines run code written for
  - Re: (Score:1)
    
    by ChrisLeif ( 67430 ) writes:
    
    When we were porting System V to the ETA 10 supercomputer (a short vector machine) one of the hardware engineers came running over one day with the devastating news that the vector square root instruction didn't work on all the test machines. We gravely told him that we would take all of them out of of the Unix kernel.
- No, the real question, vi or emacs? (Score:3, Funny)
  
  by SmallFurryCreature ( 593017 ) writes:
  
  Come on, we need to know, what is the default editor, vi or emacs? We need to know.
  - Re: (Score:1)
    
    by arfonrg ( 81735 ) writes:
    
    What are you a troll??
    
    VI OF COURSE!
- - Re:Oh? (Score:5, Funny)
    
    by Anonymous Coward writes: on Friday October 26, 2007 @01:01AM (#21124717)
    
    so awesome when girls post...
    
    Parent Share
    twitter facebook
GFLOPS? TFLOPS? (Score:2, Offtopic)

by pushing-robot ( 1037830 ) writes:

Forget esoteric units, how fast is it in Playstation3s per foot-second?
- Re:GFLOPS? TFLOPS? (Score:5, Funny)
  
  by thedarknite ( 1031380 ) writes: on Friday October 26, 2007 @01:13AM (#21124805) Homepage
  
  What's with these new fangled measurements.
  
  I'd like to know what it is in Libraries of Congress per Jiffy
  
  Parent Share
  twitter facebook
  - Re: (Score:1)
    
    by The Orange Mage ( 1057436 ) writes:
    
    What the heck is LoCs/J measuring...librarian/archiver/etc. efficiency?
    - Re: (Score:1)
      
      by thedarknite ( 1031380 ) writes:
      
      LoCs is a measure of data, jiffy is a measure of time. Therefore LoCs/J is a unit of work cycles much like hertz.
      - Re: (Score:1)
        
        by thedarknite ( 1031380 ) writes:
        
        In hindsight, I think I should have used P(arallel)-ICBMs as a measurement.
- Re:GFLOPS? TFLOPS? (Score:4, Funny)
  
  by PresidentEnder ( 849024 ) writes: <(moc.liamg) (ta) (rednenrevyw)> on Friday October 26, 2007 @01:14AM (#21124829) Journal
  
  Your units don't cancel properly. Flops = floating point operations / second, PS3s / foot-second = physical object / (viscosity / weight). You could stretch PS3s to be units of processing power / time, which gives you processing power / time / viscosity, which we'll fudge to be about flops / viscosity.
  I dunno: maybe this thing could run faster at higher temperatures in lower gravity?
  (/pretending to know what I'm talking about)
  
  Parent Share
  twitter facebook
  - Re:GFLOPS? TFLOPS? (Score:4, Funny)
    
    by sqrt(2) ( 786011 ) writes: on Friday October 26, 2007 @02:26AM (#21125241) Journal
    
    Your units don't cancel properly...
    Oh no, physics class flashback! No! NOOOOOOOO! I don't want to do this whole equation again!
    
    Parent Share
    twitter facebook
  - Re: (Score:2)
    
    by glittalogik ( 837604 ) writes:
    
    I guess a more accurate question would be, how high a stack of PS3s running in parallel would you need to equal this thing's processing power? I so can't be bothered...
    - how high a stack of PS3s (Score:1)
      
      by foobsr ( 693224 ) * writes:
      
      ~10000 would be a good guess.
      
      Quote [physorg.com]: "Mueller, an associate professor of computer science, has built a supercomputing cluster capable of both high-performance computing and running the latest in computer gaming. His cluster of eight PS3 machines - the first such academic cluster in the world - packs the power of a small supercomputer, but at a total cost of about $5,000, it costs less than some desktop computers that have only a fraction of the computing power.
      ...
      Mueller estimates that with approximatel
- Re: (Score:1)
  
  by Grendel70 ( 1000350 ) writes:
  
  Funnily enough - this isn't totally irrelevant.
  In 2000, IBM, Toshiba and Sony collaborated to create a Cell processor, consisting of one scalar processor and eight vector processors, for the Sony PlayStation 3.
  - Wikipedia.org
  - Re:GFLOPS? TFLOPS? (Score:4, Funny)
    
    by Fred_A ( 10934 ) writes: <fred@NOspam.fredshome.org> on Friday October 26, 2007 @06:45AM (#21126481) Homepage
    
    Funnily enough - this isn't totally irrelevant.
    In 2000, IBM, Toshiba and Sony collaborated to create a Cell processor, consisting of one scalar processor and eight vector processors, for the Sony PlayStation 3.
    - Wikipedia.org
    So there should be a Tera-PlayStation (TPS) measurement ? And if so who has the TPS report on that machine ?
    
    Parent Share
    twitter facebook
- Re: (Score:2)
  
  by Sponge Bath ( 413667 ) writes:
  
  I'm not sure about that, but I hear it can make the Kessel run in 12 parsecs.
Logical question: (Score:3, Interesting)

by r_jensen11 ( 598210 ) writes: on Friday October 26, 2007 @01:09AM (#21124773)

So, aside from having all of this power in one centralized spot, how does this compare to the combined power used for distributed computing projects like ClimatePrediction.net, fold@home, and any other project on Boinc?

Share
twitter facebook
- Re: (Score:2)
  
  by Silverlancer ( 786390 ) writes:
  
  Something people constantly misunderstand about supercomputers is that they assume that all problems can be broken apart into manageable portions that can be split among thousands of computers. Many problems exist that have vast memory requirements and/or require interaction among all parts of the problem, and so have to be run on a single supercomputer.
  - Re: (Score:3, Insightful)
    
    by ajs318 ( 655362 ) writes:
    
    Something people constantly misunderstand ..... is that they assume that all problems can be broken apart into manageable portions that can be split
    Exactly! Having sex 39 times does not mean you will be able to get a baby in one week. Some operations are by nature sequential -- and while there is scope for some parallelisation, doing so in a highly-distributed fashion can end up increasing latency, because you end up spending more time splitting the data up and putting the results back together than act
- Re: (Score:2, Informative)
  
  by sophanes ( 837600 ) writes:
  
  Put simply, the problem set that vector processors are geared towards (those involving large matrix ops) are the type clusters perform horribly at.
- Re:Logical question: (Score:5, Informative)
  
  by deniable ( 76198 ) writes: on Friday October 26, 2007 @01:42AM (#21125031)
  
  Well, distributed is often seen as poor-mans parallel, but in this case they don't compare. Vector units have large arrays of data and perform the same operation on all of them at once. Think array or matrix operations being done in one step rather than needing loops. This is where a SIMD architecture takes off.
  
  The only unit I ever got to play with had a 64x32 grid of processors, you could add a row of numbers in log2(n) steps instead on n. It was cool because you could tell each processor to grab a value from the guy next to him (or n steps in a given direction from him) and so on. You could calculate dot products of matrices very quickly.
  
  The distributed stuff you mentioned is mostly farming. Take a big loop of independent steps, break them up and pass them out to a (possibly) heterogeneous collection of processing nodes. Collect the answers when they finish. Render farms work the same way. It's a good way to break up some problems, but it's not what a vector unit does.
  
  Now, I haven't touched this stuff for eleven years so my facts are possibly wrong. I'm sure someone will be along to correct me.
  
  Parent Share
  twitter facebook
- Grid Computing vs. Supercomputers? (Score:3, Insightful)
  
  by SamP2 ( 1097897 ) writes:
  
  Hate to burst your bubble, but while grid computing can certainly achieve strong speeds, it is not quite AS fast as you might think.
  
  The entire SETI@HOME [wikipedia.org] project (biggest grid computing project on the net) pumps out 274 teraflops. By comparison, Blue Gene L [wikipedia.org] (first in series) pumps out 360 teraflops, and newer versions will achieve petaflop range, much faster than similar anticipation for grid computing projects.
  
  Sure, you might say, that just like supercomputers evolve, so does grid computing. The probl
  - Re: (Score:2)
    
    by renoX ( 11677 ) writes:
    
    Does the figures for the Blue Gene are real or just the maximum possible?
    
    Quite often what you can achieve on a particular problem is much less than what the computer is theoretically capable (say 10%).
  - - Re: (Score:1)
      
      by argiedot ( 1035754 ) writes:
      
      I don't know much about this, but what does that mean? I mean, doesn't Folding@Home do funky stuff with the Cell processor? Does a single floating point operation on a PS3 the same as a single operation on a general purpose processor?
- careful now: (Score:2, Funny)
  
  by Nirvelli ( 851945 ) writes:
  
  Don't forget the Storm project!
- Re: (Score:2)
  
  by bockelboy ( 824282 ) writes:
  
  Eventually, the combined power of the Boinc architecture will be much larger than any supercomputer in terms of CPU, yet be total insufficient for any of the supercomputer's task.
  
  Here's the experiment I've used to teach the concepts: Take a deck of cards, shuffle it, and time yourself sorting it. Now, have 1 other person help you sort it - it should be about 2 times as fast, maybe a little slower.
  
  Repeat again with increasing number of people until you have 1 card per person. You now have a room full of b
- Re: (Score:2)
  
  by LWATCDR ( 28044 ) writes:
  
  It doesn't compare at all.
  
  They are not used for the same type of problems. Some problems are ideal for cluster systems like the ones you have described. Others are are ideal for Vector systems like the SX. They don't compair well at all because they are not used for the same type of problem.
- Re: (Score:2)
  
  by flaming-opus ( 8186 ) writes:
  
  Well, the answer depends on your problem. These are sort of at the opposite end of the spectrum from distributed. There are a lot of solutions in between. In order from cheapest to most expensive per flop.
  
  Distributed computing needs to do a lot of computation on very tiny bits of data. You can pack up the problem set and send it over the internet, then do an hour or two of work on a CPU, and send another internet-sized transfer back. It's very economical, only cares about raw cpu performance, and can't be u
- Re: (Score:2)
  
  by kjs3 ( 601225 ) writes:
  
  Not all problems lend themselves to a distributed solution.
I can see the ads now (Score:3, Funny)

by UnixUnix ( 1149659 ) writes: on Friday October 26, 2007 @01:10AM (#21124783) Homepage

"Easter Island's Weather Forecasting Service believes operation of the NEC SX-9 would realize a 53% savings under Windows Server 2008 compared to under UNIX"

Share
twitter facebook
- Re: (Score:2)
  
  by GotenXiao ( 863190 ) writes:
  
  53% savings, 100% loss in function. I personally don't know of any version of Windows that can run on a vector CPU.
  - Re: (Score:1)
    
    by UnixUnix ( 1149659 ) writes:
    
    Exactly! :-)
  - Re: (Score:2)
    
    by Bert64 ( 520050 ) writes:
    
    Exactly, you could save a lot of money by keeping this machine turned off!
    - - Re: (Score:2)
        
        by Bert64 ( 520050 ) writes:
        
        Well, that's as may be...
        The point was originally about the CIS test tho, which performs no virtualization-related security checks so it's irrelevant in that context. Maybe it should, but so long as it doesn't the results would not be affected by running in vmware or any other virtualization technology.
        
        Theo is also arguing that the x86 architecture is flawed, and thus any virtualization technology will be flawed when running on x86. I can't say I disagree here, and it would be interesting to see how a more
I can see tomorrow's headline... (Score:2, Funny)

by maciarc ( 1094767 ) writes:

"SCO files umpteen bazzillion dollar lawsuit against NEC"
Does it play vector games? (Score:5, Funny)

by filesiteguy ( 695431 ) writes: <perfectreign@gmail.com> on Friday October 26, 2007 @01:13AM (#21124811)

I wonder how well it will do with the really cool vector games like Asteroids or BattleZone or Tempest or...

...Star Wars! Yeah, we could take this little baby, setup a sit-down booth, add some speakers and we'd be set!

"what's your vector, Victor?"

Share
twitter facebook
1.6 Teraflops? (Score:2, Funny)

by JK_the_Slacker ( 1175625 ) writes:

Why, that's more powerful than a cluster of 60 PS3s! I'll take three!
- Re:1.6 Teraflops? (Score:5, Funny)
  
  by RuBLed ( 995686 ) writes: on Friday October 26, 2007 @01:25AM (#21124917)
  
  Ahh... so you're planning to turn Aero on.
  
  Parent Share
  twitter facebook
Impressive ... (Score:2, Funny)

by ackthpt ( 218170 ) * writes:

with single core speeds of up to 102.4 GFLOPS and up to 1.6TFLOPS on a single node incorporating multiple CPUs.

Don't be too proud of this technological marvel you have created for it is nothing compared to the power of the slashdot effect.
SCO!? (Score:3, Funny)

by flyingfsck ( 986395 ) writes: on Friday October 26, 2007 @01:40AM (#21125017)

Did they buy a license from SCO?

Share
twitter facebook
vector machines in the top500 list refuse to die (Score:2, Informative)

by paleshadows ( 1127459 ) writes:

There's an interesting paper [huji.ac.il] that analyzes the data accumulated in the top500 list site [top500.org], which ranks the 500 most powerful supercomputers twice a year: it shows that, over time, the share of vector machines within the list is sharply declining, both in aggregated power and in number: from around 60% in 1993 to around 10% in 2003 (see Figure 3, page 6, in said paper). Still, vector machines refuse to die and always seem to maintain a presence in the top500, as is evident from the above slashdot post. Will ve
- Re:vector machines in the top500 list refuse to di (Score:3, Insightful)
  
  by cerberusss ( 660701 ) writes:
  
  Will vector machines live forever?
  Well, I actually doubt it. You could say 'those vector processors are used for matrix calculations and are wildly different from general purpose CPUs' and you'd be right.
  
  However, I could see a point in time where hybrids like the Cell (one scalar processor and eight vector processors) will become so cheap that the number of vector machines will decline even more.
  
  The idea will never die of course, I mean, hardware is so flexible nowadays that a good student could make
  - Re: (Score:3, Interesting)
    
    by putaro ( 235078 ) writes:
    
    I haven't looked closely but I would guess (based on having worked at a manufacturer of vector supercomputers many years ago) that all of the machines represented on the Top 500 list are hybrid machines. All of the vector architectures I'm familiar with had a scalar processor to handle most of the housekeeping, run the OS, compilers and things like that. Vector processors aren't very good at doing things like that.
    
    Vector excel at running through essentially loop operations. There's two components to thei
    - Re: (Score:2)
      
      by pimpimpim ( 811140 ) writes:
      
      The consequence of the parent post is that code can be run very fast on a vector machine IF it is written in such a way to take full advantage of the vector architecture, using smartly written loops. Now it is a good idea to have such loops in your high-performance-computing code anyway, but since not everyone is writing a whole scientific software package from scratch each time a new computer is available, most codes used now are optimized for either shared memory or cluster systems. (It would be nice if
      - Re: (Score:3, Interesting)
        
        by putaro ( 235078 ) writes:
        
        Realize that most scientific code probably still has lots of code in it written for the original CRAY system it ran on in the 80's, and you see why vector systems will live on for a while: code that was written for one will have to be used on a vector system. One has to have to luck to find a PhD student willing and able to rewrite the code for a new machine.
        Worse than that even. I was doing this back in the late 80's/early 90's and we spent a large amount of energy getting the FORTRAN compiler to automatically vectorize "dusty deck" (that would be code that was originally written on PUNCHCARDS) scientific code.
        
        Parallel programming is hard. Vectorized code is kind of like parallel light in that it parallelizes very narrow operations without all that messy locking and message passing.
        
        Oh, there was one thing that the vector excelled at that OS's do a lot of -
  - Re: (Score:2)
    
    by Rhys ( 96510 ) writes:
    
    What do you think your Pentium's MMX instructions are? They're vector operations. Every machine on the list is already a hybrid between the two. They aren't dedicated individual vector processors under the command of a master GP-CPU, but a different version of hybridization.
    
    I'd actually suggest that you'll probably see vector processors marginalized or pushed out eventually by stream processors: aka nvidia/ati graphics boards.
  - Re: (Score:3, Informative)
    
    by flaming-opus ( 8186 ) writes:
    
    So here's what you're missing: Vector processors aren't about doing a lot of math. True, they do that very well, but that's not where they excell. Where vector processors really shine, is in memory bandwidth. Vector operations let you use that 4Terabyte/second of memory bandwidth, and actually use it, not spend it all flushing out cache lines. On this machine, a single load instruction can fetch 2KB of data.
    
    Cell (and many GPUs or future whatever) have the ability to do a LOT of math, but they do it on a ver
    - Re: (Score:1)
      
      by specific_pacific ( 904746 ) writes:
      
      Well put. I've never really had that top down view before.
"up to" (Score:5, Insightful)

by Duncan3 ( 10537 ) writes: on Friday October 26, 2007 @01:46AM (#21125057) Homepage

The only text that can ever follow the words "up to" in computing is "0.1 *". As in "speeds of up to 0.1 * 102.4 GFLOPS". Every time a marketing droid published a press release, a kitten dies.

Share
twitter facebook
- Re: (Score:1)
  
  by kramulous ( 977841 ) writes:
  
  I particularly liked:
  This new computer features the addition of an arithmetic unit and an increased number of vector pipelines. This has resulted in the development of the fastest single-chip vector processor with a computing performance of 102.4 GFLOPS per single core, and a wide memory bandwidth of 256GB/s. With a single node incorporating up to 16 CPU, computing performance in excess of 1.6TFLOPS is achieved.
  
  So, these are 'core solos'?
- Re: (Score:2)
  
  by darthflo ( 1095225 ) writes:
  
  With a single node incorporating up to 16 CPU
  
  "up to" in computing is "0.1 *".
  Heh, so there's nodes with 1.6 CPUs? How does a three-fifth of a cpu look like? Will they call it Tertium?
one can only imagine... (Score:1)

by admactanium ( 670209 ) writes:

one can only imagine what a game of 'tail gunner' or tempest would look like on this machine.
Vector graphics (Score:1)

by xx01dk ( 191137 ) writes:

Vector graphics are so 1980's... oh wait
This is unix (Score:2)

by Mr. Flibble ( 12943 ) writes:

Yes, it runs a UNIX System V-compatible OS

I was reading the description of the system, and thinking I would never be able to operate it as I am such a dinosaur until I saw the above line. My response is: "This is Unix, I know this!"
- JP reference? (Score:2)
  
  by ebbomega ( 410207 ) writes:
  
  You'd better hurry, the raptors are breaking through the doors....
Obligatory karma-whoring first post (Score:1)

by daboochmeister ( 914039 ) writes:
To keep this to the shortest set of discussion threads ever for ./, let's get right out there with:
- Yeah, but does it run Linux?
- I, for one, welcome our new complex large-scale computation climate-adjusting, environment-simulating overlord
- Imagine a beowulf cluster of these
- Useless, they didn't open source the hardware
- Finally, a machine that can run Vista
- Bet it infringes hundreds of chair-throwing M$ patents
- Hmm, did I forget any ...
- Re: (Score:1)
  
  by daboochmeister ( 914039 ) writes:
  
  Dang, took too long to type. Oh well, have to revert back to other less trustworthy and stable things than the ./ karma system as the foundation for my self-esteem.
- Re: (Score:2)
  
  by darthflo ( 1095225 ) writes:
  Hmm, did I forget any ...
  
  Does it have hot grits and nekkid Amidalas inside?
  
  Oog the caveman will beat it up
  
  In soviet russia, vectors are faster than you (wtf?)
  
  idk my bff jill
- Re: (Score:2)
  
  by rickb928 ( 945187 ) writes:
  
  Just don't try to patent the multi-meme-karma-whoring method. I'll claim prior art and embarass your lawyers. No, wait, you really can't embarass a patent lawyer. No, that's not right, you can't embarass any lawyer. Anyways, I'll do something.
  - Re: (Score:2)
    
    by notnAP ( 846325 ) writes:
    
    Well, I for one welcome our new multi-meme-karma whores.
Link to more information (Score:5, Informative)

by Tom Womack ( 8005 ) writes: <tom@womack.net> on Friday October 26, 2007 @06:11AM (#21126327) Homepage

http://www.nec.de/hpc/hardware/sx-series/index.html [www.nec.de]

There are four PDFs there; the brochure is a four-colour glossy, but there is some real information. Sadly, the interesting-looking white papers are for the SX6, two generations earlier.

SX9 summary: 65nm technology, 3.2GHz clock speed, eight vector elements handled per cycle with two multiply and two add units, which is where the 102.4Gflop/CPU figure comes from. 16 CPUs in a box about the size of a standard 42U rack.

Totally absurdly fast (ten 64-bit words per cycle per CPU) access to a large (options are 512GB or 1TB) shared main memory; absurdly fast (128GB/second) inter-node bandwidth.

Share
twitter facebook
- Re: (Score:2)
  
  by flaming-opus ( 8186 ) writes:
  
  NEC keeps plugging along at the cpu's. Those things are incredible. They are, however, VERY VERY vector dependant. They do not run scalar code very fast at all. This many pipe-sets and ALUs per pipe requires very long vector lengths to vectorize well. What I'd love to see is one of these vector CPUs tied very closely to a high-speed scalar CPU like an opteron, xeon, or power6. For a while it sounded like IBM was getting back into the vector game with a power6-derived processor, but that seems to have faded
why must every super computing story... (Score:1)

by gravisan ( 1179923 ) writes:

have obligatory cluster yay retarded replies. Firstly ... I highly doubt this is running linux on the vector part of the core itself, more likely than anything it has a von Neumann machine on the core somewhere or even separately and I will say the obvious here...which I am sure everyone here knows (especialy those people yelling retarded cluster noises) parallelization is as only good as the algorithm (http://en.wikipedia.org/wiki/Amdahl's_law) ... a vector computer is like a bigger version of a GPU with
- Re: (Score:1)
  
  by juanfgs ( 922455 ) writes:
  
  you must be new here.
- Re: (Score:2)
  
  by NevarMore ( 248971 ) writes:
  
  "...a vector computer is like a bigger version of a GPU..."
  
  So instead of using these in a Beowulf cluster we could use them to get a killer (no pun) framerate in Battlefield??
Video and Interview with NEC Project Manager (Score:2, Informative)

by dk3nn3dy ( 722733 ) writes:

There is a video news release and interview with the project manager here: http://movie.diginfo.tv/2007/10/26/07-0502-r.php [diginfo.tv]
Computer (Score:1)

by hernyo ( 770695 ) writes:

wow... "imagine a Beowulf cluster of those"...
FLOPS are useless (Score:2, Funny)

by Anonymous Coward writes:

How many BogoMips does it have? =)
Unicos and Cray (Score:3, Interesting)

by wandazulu ( 265281 ) writes: on Friday October 26, 2007 @09:57AM (#21127937)

Whenever I hear "supercomputer" and Unix I think of using a Cray and Unicos, which was the version of Unix that ran on them. Unicos was, at least the version I used, the ultimate in bare-bones Unix. I think when people think of Unix today they think of something like Linux or the BSDs or OS X, or whatever where the environment is very rich with tools. Unix on a supercomputer is not much more than an interface between your C (or Fortran) program and the bare metal; they don't (again, in my experience) make it the kind of environment you *use*...you get your code on the machine, compile it, submit it, and log off and wait for an email.

Maybe this NEC machine is different but Unix on a supercomputer is like the cockpit of a Forumula 1 race car; just there to provide a way to steer, comforts be damned.

Share
twitter facebook
- Re: (Score:2)
  
  by flaming-opus ( 8186 ) writes:
  
  Supercomputer OSes, like all unix OSes, have gained functionality over the years. In the supercomputing world, data storage and I/O performance are almost as important as the computational job. Thus a lot of attention is paid to filesystems.
  super/UX is pretty stripped down, but getting better. Cray Unicos is no longer based on that system-V stuff and is either based on Irix on the X1, or is linux-derived on the XT4. The compute nodes are pretty stripped down, but the loggin nodes are pretty much off the she
- Re: (Score:1)
  
  by cthulhu11 ( 842924 ) writes:
  
  The completeness of the *ix environment (real X11 port, vectorizing ANSI C compiler, etc) was one thing that drew customers to Convex vector machines back in the day.
Not For Long (Score:2)

by Doc Ruby ( 173196 ) writes:

he SX-9 closes in on the PFLOPS (one quadrillion floating point operations per second) range by achieving a processing performance of 839 TFLOPS.
Pretty fast, but IBM will release its Roadrunner [wikipedia.org] at Los Alamos NL next year with 1.6PFLOPS:
The computer is designed for a performance level of 1.6 petaflops peak and to be the world's first TOP500 Linpack sustained 1.0 petaflops system. [...] It will be a hybrid design with more than 16,000 AMD Opteron cores (~2200 IBM x3755 4U servers, each holding four dual core
- Re: (Score:2)
  
  by flaming-opus ( 8186 ) writes:
  
  There are several petaflop machines in their initial phases of roll-out right now, but peak performance isn't the only number worth paying attention to. The SX-9 is an amazing architecture, with orders of magnitude more bandwidth than the roadrunner system, both in interconnect, and in memory bandwidth. It's also a very expensive machine. The SX, however, has an advantage of being an update and refinement to a very established architecture. codes written for the SX-3 are going to perform well on the sx-9.
  
  Ro
- - Re: (Score:2)
    
    by flaming-opus ( 8186 ) writes:
    
    infinaband is not bad for a single processor node. The trouble is that everyone wants to use a single port to power 16 cores.
    
    Sadly, linpack does play a role in the computers that are purchased. Sometimes the guys making the decision are not the engineers and programmers, who then have to suffer with the consequences. Alas.
- Re: (Score:1)
  
  by mauthbaux ( 652274 ) writes:
  
  No, but it just might be in charge of Gundam.
- Re: (Score:1)
  
  by KudyardRipling ( 1063612 ) writes:
  
  Crack NSA encryption for breakfast, Design and 'test' micronukes for lunch, reboot republics for dinner. Control and/or euin every person on the planet for dessert.
  
  [off camera voice] But that would make you the Antichri[silencedgunshot-THuD!bagtagdragdragdrag]
  
  Damn those abrahamic monotheists!

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

What is this, mad mods Friday? (Score:2)

Re: (Score:1, Offtopic)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Oh? (Score:2, Interesting)

Re: (Score:1)

Re: (Score:1, Funny)

Re:Oh? (Score:5, Funny)

Quite possibly. (Score:5, Interesting)

Re: (Score:3, Informative)

Re:Quite possibly. (Score:5, Informative)

Re: (Score:3, Insightful)

Re:Quite possibly. (Score:5, Informative)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re:Quite possibly. (Score:5, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3, Insightful)

Re: (Score:2)

Re: (Score:1)

No, the real question, vi or emacs? (Score:3, Funny)

Re: (Score:1)

Re:Oh? (Score:5, Funny)

GFLOPS? TFLOPS? (Score:2, Offtopic)

Re:GFLOPS? TFLOPS? (Score:5, Funny)

Re: (Score:1)

Re: (Score:1)

Re: (Score:1)

Re:GFLOPS? TFLOPS? (Score:4, Funny)

Re:GFLOPS? TFLOPS? (Score:4, Funny)

Re: (Score:2)

how high a stack of PS3s (Score:1)

Re: (Score:1)

Re:GFLOPS? TFLOPS? (Score:4, Funny)

Re: (Score:2)

Logical question: (Score:3, Interesting)

Re: (Score:2)

Re: (Score:3, Insightful)

Re: (Score:2, Informative)

Re:Logical question: (Score:5, Informative)

Grid Computing vs. Supercomputers? (Score:3, Insightful)

Re: (Score:2)

Re: (Score:1)

careful now: (Score:2, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

I can see the ads now (Score:3, Funny)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

I can see tomorrow's headline... (Score:2, Funny)

Does it play vector games? (Score:5, Funny)

1.6 Teraflops? (Score:2, Funny)

Re:1.6 Teraflops? (Score:5, Funny)

Impressive ... (Score:2, Funny)

SCO!? (Score:3, Funny)

vector machines in the top500 list refuse to die (Score:2, Informative)

Re:vector machines in the top500 list refuse to di (Score:3, Insightful)

Re: (Score:3, Interesting)

Re: (Score:2)

Re: (Score:3, Interesting)

Re: (Score:2)

Re: (Score:3, Informative)

Re: (Score:1)

"up to" (Score:5, Insightful)

Re: (Score:1)

Re: (Score:2)

one can only imagine... (Score:1)

Vector graphics (Score:1)

This is unix (Score:2)

JP reference? (Score:2)

Obligatory karma-whoring first post (Score:1)

Re: (Score:1)