Homebrew Cray-1 140
egil writes "Chris Fenton built his own fully functional 1/10 scale Cray-1 supercomputer. True to the original, it includes the couch-seat, but is also binary compatible with the original. Instead of the power-hungry ECL technology, however, the scale model is built around a Xilinx Spartan-3E 1600 development board. All software is available if you want to build one for your own living room. The largest obstacle in the project is to find original software."
The originals really are something else (Score:5, Interesting)
Back when I was an intern with SGI, we took a day off* to visit the Chippewa Falls Museum**, which has a good-size collection of Control Data and Cray Supercomputer relics along with other items relevant to my interests***.
I got to poke around inside of an original Cray-1. To me, the most interesting thing about it was just how insanely packed the internal wiring was; I'd been expecting the intricate plumbing, but the sheer volume of wires running from Point A to Point B was really impressive. I mentioned this to the guy giving the tour, who turned out to be a retired manufacturing supervisor -- he told me that the hardest part of his job was finding women with both enough skill and small enough hands to handle the internal wiring jobs. The thing had been assembled *by hand*, every connection in this crazy bulk of wired clipped or soldered into place one after another.
Anyhow, after that I sat on the couch. It was not comfy.
* My boss was *pissed* about this -- she went around telling anyone that would listen that "interns are here to work, not go sightseeing". This marks the one and only time in my career that anyone in HR has ever done anything worthwhile, calling her up and telling her this was part of the program and she didn't get a vote.
** Seymour Cray moved to Chippewa Falls, his hometown, when he was still with Control Data because he felt most middle managers wouldn't want to drive that far just to bother him. Visionary man, that Seymour Cray.
*** Stuff from Leinenkugals.
Re:The originals really are something else (Score:5, Interesting)
IIRC, they also deliberately hired both tall and short women, and assigned the appropriate subset of connections to each.
Re: (Score:2)
Re: (Score:3, Funny)
Anyhow, after that I sat on the couch. It was not comfy.
+1 Informative
Re: (Score:2)
I agree, I always wondered about that couch. :)
Re: (Score:2)
Good example of why I often wish that AC couldn't post on Slashdot.
Re: (Score:2)
Hence the reply filtering.
If one people would stop responding to AC's, we wouldn't even know they existed.
Re: (Score:3, Insightful)
It's not a problem with the ability to post AC; it's a problem with slashdot's moderation, which is hugely broken. -1, troll, is the same as -1, disagree - and they're both very common. If you turn off by points, you turn off the comments the moderators disagree with, and those are quite often every bit as interesting and valid as something modded up to five. Likewise, some things can only be said as an AC; and that's why the role needs to remain, and why again, if you turn them off, you miss comments of v
Re: (Score:2)
One mans troll is another's disagree.
I do not think that there is anything one can only say as an AC.
I mean what your afraid of loosing Karma? Karma doesn't affect your paycheck or get you put in jail.
Being an AC doesn't pass your comment through a TOR node so it doesn't offer you any protection from the government.
I have decided long ago to never post as an AC. There are times when I know my post will get moded down because I dare to criticize the one of the Slashdot sacred cows. There are also times when
Re: (Score:2)
I don't think a GNAA post is anyone's "disagree"; so I'm just going to have to disagree.
That only reflects your shallow understanding, based on your limited experience. Suppose the poster has been forbidden to use the Internet by the courts? Do you think it's wise to expose your username to Google under those conditions? And do you think the courts are always right, *especially* when it comes to f
Re: (Score:2)
"Suppose the poster has been forbidden to use the Internet by the courts? "
Actually in that case your just nuts to be posting on slashdot to start with. But if you are dumb enough to do it then create a new account with a different username.
If the court ordered you to not use the internet which has happened how often? I would hope that you are taking a lot more precautions than just posting as an AC. Frankly I would say that if you think posting as an AC gives you any protection from law enforcement then it
Re: (Score:2)
Re: (Score:2)
My SunServer 600 is less drastic, but an example of the same phenomenon. It was free, but it has a 1400-watt power supply for 90Mhz or processor and 192 MB of memory.
Re: (Score:2)
Anyhow, after that I sat on the couch. It was not comfy.
I guess the measure of the comfort of integrated seating on supercomputers is a kind of "bench"mark...
Re: (Score:3, Interesting)
Re:The originals really are something else (Score:5, Interesting)
BTW, the Leinie's Lodge is also well worth visiting! It's less than a half mile from the museum, and is actually the number one tourist attraction in at least a 50-mile radius.
Re: (Score:2)
I used to work in Engineering -- this takes me back. Who owns those building now? Celestica? The new Cray spin-off?
Re:The originals really are something else (Score:4, Interesting)
For those of you in/visiting the DC area you can check out a couple of old Crays @ the National Cryptologic Museum on the outskirts of Ft Meade. http://www.nsa.gov/about/cryptologic_heritage/museum/virtual_tour/index.shtml [nsa.gov]
On exhibit in the museum are two Cray supercomputers. The XMP-24 on display is the upgrade to the original XMP-22 that was the first supercomputer Cray ever delivered to a customer site. It was in operation from 1983 to 1993 and was arguably the most powerful computer in the world when it was delivered. It used serial processing to conduct 420 million operations per second.
The second generation Cray, the YMP, replaced the older version in 1993. It had a 32 gigabyte (32 billion bytes) memory capacity. In 1993 most personal computers held only 16 million bytes. The YMP used vector processing, a very powerful form of overlapping, parallel processing to conduct 2.67 billion operations per second. The YMP was decommissioned and went on display at the museum in 2000.
The museum is lots of fun and definitely worth a visit.
Re:The originals really are something else (Score:5, Informative)
Re:The originals really are something else (Score:4, Interesting)
a shorter way of saying this is "phase matched cables".
http://www.mwjournal.com/journal/article.asp?HH_ID=AR_7184&tite=Phase%20Matching%20and%20Tracking%20of%20Coaxial%20Cable%20Sets [mwjournal.com]
pay no mind to the fact that it's discussing microwave applications. it's just as relevant to high speed digital (or lower speed, further distance) digital electronics. keep in mind how far this interconnect had to travel due to part density and cooling needs.
of course for modern electronics one might use tunable LC networks on either end to accommodate variations in commodity sourced cables.
also, i take issue with ECL being described as inefficient. CMOS is inefficient when transitioning a lot. ECL merely has a constant power drain, which also happens to make it less noisy. It was also differential before LVDS was cool (or even around).
Re: (Score:2)
"Why," you may ask, "was the internal wiring so insanely packed?" The length of each point-to-point wire was individually calibrated, such that all the signals to each gate arrived at the same moment, so you didn't need flip-flops to latch values in the flow of the circuits.
Deskewing the signals didn't eliminate the need to flop the levels - by a long shot.
What it's about is arranging the placement of gates and routing of signals so you avoid situations where most of the inputs to a logic function have arri
Re: (Score:2)
"Why," you may ask, "was the internal wiring so insanely packed?" The length of each point-to-point wire was individually calibrated, such that all the signals to each gate arrived [after a known amount of time delay] ...
Cray did this from his earliest designs for Control Data.
His first machines were the CDC 160 (a low-power I/O processor) followed by the CDC 1604 (his first "mainframe"). I "cut my hacker teeth" on a 1604-B, serial number 6.
That was a 48-bit machine (with two instructions per word to save
one at UCAR Boulder (Score:2)
Re: (Score:2)
You mean NCAR don't you?
I visited there when my Grandma still lived in Boulder back in 1986, and saw the Cray which, IIRC, was still operational at that time.
NCAR Cray 1-A (Score:5, Interesting)
The new computing center is still under construction, so no supercomputers are located there yet. :-) Cray 1-A serial number 3 is in the hallway as an exhibit on floor 1B, and if you are near the NCAR Mesa Lab, it is open for self guided tours most days of the year.
Serial number 3 was in active use until its decommissioning in 1989 and ran COS (Cray Operating System). It was connected to the NCAR designed Hyperchannel network known initially as the NCAR Local Network (NLN) and later as the Mainframe and Server Network (MASnet). There were rows of 100 MB and 200 MB "washing machine" disk drives connected to it, and it had access to the NCAR Mass Storage System (MSS) for archival storage. Graphical output could be sent to plotters, large Xerox printers, B&W microfiche or B&W or color 35mm film. For a speed comparison, I once ran a Madelbrot generator to produce a 640x480 image on both the Cray 1-A and a Sun 60 workstation. The vectorized C code on the Cray took just under 8 seconds, the Sun several minutes to produce the image. Alas, we don't have much in the way of documentation anymore nor is there any of the original software.
Re: (Score:2)
Very interesting. Thanks for that. Sounds like you worked there.
How did the direct to film imaging work? CRT though optics to film?
Re: (Score:3, Informative)
Very interesting. Thanks for that. Sounds like you worked there.
How did the direct to film imaging work? CRT though optics to film?
Yes, I still do work there. The film output did indeed work like you suspect. They were made by Dicomed, and we ran them so hard that our in house electronics maintenance personnel were well acquainted with them. The Dicomeds supported a 4096x4096 vector graphic display, and the usable portion depended on the output format and frame size. Raster output could be performed by displaying each point for the required amount of time to produce the desired exposure. The color versions had a color wheel and re
Re: (Score:3, Funny)
Thank you. Thank you, thank you, thank you. I've always wondered how comfy those seats were, nearly as much as I've wondered why the hell they were there in the first place. Well now I know, and will die ever so slightly happier because of it. That's the most interesting thing I have ever read on Slashdot and I salute you for it. "It was not comfy". Thank you sir.
(And I'm being utterly
Re: (Score:2)
Re: (Score:2)
As I recall the Cray was curved with wedge-shaped boards and wiring on the inside of the curve to minimize the length of interconnecting wires and thus maximize logic speed. The outside of the tower was for access to the cards. And that meant the power supplies, cooling plumbing, and other support miscellany had to go somewhere else. "Somewhere else" was further out and at floor level - to be out of the way of acce
Re: (Score:2)
The wiring was the key to why it was so fast. All the interconnects. Something this *emulation* wont really have.
Re: (Score:3, Interesting)
Re: (Score:2)
Xilinx... (Score:5, Informative)
After months of free time development, I switched over to surplus Altera Stratix II video decoder hardware, got a copy of Quartus II, and was moving within weeks. Altera would be my suggestion for any geek who wants to try something similar!
Re:Xilinx... (Score:4, Interesting)
Re:Xilinx... (Score:5, Funny)
And you've been playing WoW on it ever since.
Re: (Score:2)
I built a P[D]P11 "supercluster" and started with Xilinx. The hardware is great, but their software toolset is horrendous.
Ran into a similar problem a few years ago with a FPGA programming tool. (I think it was with Xilinx but I'd have to check.)
We were trying an FPGA implementation of a heavily-channelized ASIC, using the same trick Cray used for the peripheral processors in his CDC-6x00 series: One set of fast logic switching between N sets of states stored in a N-word memory to handle N channels (in hi
Apparently... (Score:5, Funny)
Not even a cray supercomputer can survive the slashdot effect.
Re:Apparently... (Score:5, Funny)
Hey! The computer predates Slashdot by two decades, for craying out loud!
Re:Apparently... (Score:5, Funny)
Re:Apparently... (Score:4, Funny)
I hope you realise that jokes of this quality will get you downmodded. And when that happens, don't come craying to me.
Wow! (Score:5, Interesting)
Now that's closer to true News for Nerds, Stuff that Matters. Not the Mac fish tanks.
However, I am a little disappointed that he didn't do something with the tower part of his cray. Cooling perhaps? Blinkenlights?
How does it compare in performance with the original?
Re:Wow! (Score:5, Informative)
It's instructions execute accurately clock-for-clock, but running at 33 MHz instead of 80.
Re: (Score:3, Funny)
Why, cray tell, does it run so slowly?
Re: (Score:3, Informative)
Why, cray tell, does it run so slowly?
I have one of those, the Spartan board, not a Cray-1. I did not remember, but checked online and the Spartan board has a 66 MHz canned oscillator. So, his design probably uses two clock cycles per instruction cycle.
Probably also limited my the memory speed of whatever he's using for memory. 33 MHz equals what 30 ns access cycle?
Re:Wow! (Score:5, Informative)
Chances are (I can't get to the site) it just runs at 33MHz as its best-supported clock frequency. An S3E is a pretty cheap and slow FPGA - I remember writing a 32-bit CPU for one, and until I started optimising the logic-placement in the FPGA, it was only running at ~30MHz. I got it up to ~50MHz after tweaking and pipelining, but his design may do more than my simple CPU.
Simon
Re: (Score:2)
The S3E itself can be clocked internally at 300+ MHz. However, the maximum speed achievable depends on the architecture and layout of the circuit implemented. The maximum clock is dependent on the longest logic and routing delay through the circuit. Since the design is apparently a register for register copy of the original Cray architecture, the original ECL logic still has a speed advantage over the CMOS S3E.
Re: (Score:3, Informative)
The S3E itself can be clocked internally at 300+ MHz. However, the maximum speed achievable depends on the architecture and layout of the circuit implemented. The maximum clock is dependent on the longest logic and routing delay through the circuit. Since the design is apparently a register for register copy of the original Cray architecture, the original ECL logic still has a speed advantage over the CMOS S3E.
Not having seen the design, I don't know how it's been implemented, but it's possible to have a compatible design that implements all the original specifications without designing it the same way... It's also possible for an FPGA design to run faster than the original part - see the multiple-tens-of-MHz variants of the equally venerable 6502 (which maxed out at ~2MHz at the time) for example.
Clearly, the logic path dictates the final speed. That's why placement is so important, and why hand-placement is
Re: (Score:2)
Yes, the 6502 is equally venerable. No, the Commodore 64 or Atari 600 XL was not the same class of machine as a Cray 1 supercomputer. The 6502 did not have vector registers nor twelve pipelines. It didn't handle 32 megabytes of memory. It was a microprocessor, which is a name it gets from being three or four classes of computer below a supercomputer.
supercomputer > mainframe > minicomputer > microcomputer workstation > home and hobby microcomputer
Re: (Score:2)
To address your points individually, however:
Re: (Score:2)
Yes, "a chip" of that vintage. A 2 Mhz MOS microprocessor. Yet the Cray wasn't a single microprocessor. Getting a 50x speedup of a single MOS microprocessor on a single newer FPGA chip is one thing. Getting a 50x speedup from a wire-and-lead NAND array machine running at 80 MHz on a single FPGA isn't quite the same thing.
Not only were the data items 64 bits, but the vector registers could each be fed a 64-bit word each clock cycle. The four 64-bit instruction buffers could be completely filled with the 16-b
Re: (Score:2)
Simon.
Re: (Score:2)
Re:Wow! (Score:4, Interesting)
Why, cray tell, does it run so slowly?
You do realize the Cray-1 is from the late 1970s, right? 80MHz with this architecture was smoking fast.
Re: (Score:2)
Yeah, home computers were non-existant (I remember reading an article in Popular Mechanics about the cray 1 in 1976).
We got our first home computer (a TRS-80 model I) in 1978, and we were hardly the first. I was quite amused to read an article (one of Jon Bentley's Programming Pearls) some years later that showed that a TRS-80 could actually be faster than a Cray-1 (assuming the Cray-1 was using a sufficiently poor algorithm and the TRS-80 wasn't).
Re:Wow! (Score:4, Insightful)
Half the fun of explaining the Cray 1 during museum tours was comparing its cycle time to the time it took light from the nearby ceiling spotlight to hit the Cray. At 33 MHz that would require a really tall room.
Re: (Score:2)
he isn't running his webserver on it, is he? (Score:5, Insightful)
because it just got slashdotted...
No software (Score:5, Interesting)
Never the less, I have to admire the effort put into this.
Re:No software (Score:5, Interesting)
Re:No software (Score:4, Funny)
He has the needed software for the FPGA, but he has (so far) been unable to find some software to run on the machine. At least that is what I got from the TFA. It seems like no-one (including various 3 letter agencies) have copies of stuff so "old".
Maybe he'd have better luck finding old software if ageism wasn't rampant in the IT industry [slashdot.org] ;-)
Pretty cool! (Score:5, Insightful)
This is the sorta hack that should feature on the front page, instead of machining tin can case and similar tripes.
Hope he gets some software for that thing.
But (Score:2, Funny)
What version of Windows does it run? I might have a copy here...
Doesn't everything run Windows?
Re:But (Score:4, Informative)
well, this is a mid 70s computer, so it must have run CP/M 8D
if he really does want to run real Cray software, he'll have to implement the interrupts and context switching for Cray Operating System (COS) or the Unix Unicos
Re:But (Score:5, Insightful)
Doesn't everything run Windows?
Everything runs Linux, but Windows seems to run everything.
Re: (Score:2, Funny)
You misspelled ruin.
Re: (Score:2)
Sad, but true. I sometimes get scared when I see what runs on a networked Windows (XP). Neonatal monitors, respiration controllers, fMRI consoles, oscilloscopes, websites...
The problem with most of these things is that they never can get updated to the latest service packs (and thus all run without Service Pack or Service Pack 1) or have any other software (like virus scanners) installed because that would void the warranty/FDA approval/application support.
Re: (Score:2)
Doesn't everything run Windows?
Everything runs Linux, but Windows seems to run everything.
At NASA Lewis/Glenn there was a Cray XMP that was later upgraded to a YMP. To run programs on the Cray required that the program be compiled on an IBM 360. There were two IBM's there. I was involved with the installation of power cables to peripheral equipment and modifications to the 400Hz UPS.
Actually it runs Crysis (Score:2, Funny)
Where's the VAX? (Score:3, Interesting)
When's he going to build the VAX front end? TFA alludes to the fact that the Cray-1 needed a dedicated mini as a front end, and sounds like he might actually get a diskpack from one (or image thereof).
Re: (Score:2)
Yeah, but... (Score:2)
theoretically, ECL would trounce any CMOS tech fabricated with current accuracy. It's just that it's so horribly power-hungry, that nobody will do this. For one thing, it would need megagalactic cooling.
Re: (Score:2)
Although individual gates of ECL could run faster than individual gates of CMOS or even dynamic NMOS, ICs of useful complexity for single-chip modern CPUs are not likely to be competitive if implemented in ECL. The problem is power density. Modern fast CPUs are already dissipation limited. ECL is inherently a 5 volt technology (I suppose cleverness could improve that somewhat) so it starts with a 4:1 heat disadvantage compared to MOS, and it only gets worse because each gate must always have current flowing
Re: (Score:2)
theoretically, ECL would trounce any CMOS tech fabricated with current accuracy. It's just that it's so horribly power-hungry, that nobody will do this.
CMOS beats ECL in integrated circuits because you can put a LOT of gates VERY CLOSE TOGETHER without cooking them. This means that, though you lose on gate speed, you gain more back in short propagation time on the interconnects.
ECL is inherently hot because each gate starts with an analog current regulator, followed by logic that "steers" the current into
Re: (Score:2)
Nowhere to convect the heat to for a start.
Correct.
NCAR (Score:5, Informative)
Send an email to the folks at the CISL [ucar.edu] division of NCAR [ucar.edu].
They know a thing or two [ucar.edu] about Crays.
Re:NCAR (Score:4, Informative)
See my reply earlier on in this discussion. We didn't keep any of the Cray software, unfortunately.
Re: (Score:2)
See my reply earlier on in this discussion. We didn't keep any of the Cray software, unfortunately.
do you regret that now?
Re: (Score:2)
do you regret that now?
I do, but unfortunately I wasn't part of the group that managed the Crays, and I didn't think to track down the documentation and distribution media to archive copies to more modern media.
Really cool (Score:5, Insightful)
Re:Really cool (Score:4, Interesting)
I clicked the "+" next to the title, then selected "interesting"
Re: (Score:2)
You must have JavaScript turned on.
Plebes, pshaw!
Can't find the software? (Score:3, Informative)
From TFS:
All software is available [chrisfenton.com] if you want to build one for your own living room. The largest obstacle in the project is to find original software."
Um... why not just click on the little link provided there?
Re:Can't find the software? (Score:5, Interesting)
It's obvious from your comment that you haven't downloaded and inspected his source code. It includes some verilog files for making the FPGA behave like a Cray-1, and some python files for debugging it and loading opcodes into the simulated cray. However, if you want to run vintage 1970s computer applications----weather simulations, cryptanalysis, computational fluid dynamics, etc., you would be hard pressed to find any.
Chris - see the Supercomputer Centers, CMU, UCSD (Score:5, Informative)
I think there were (are?) four of Supercomputer Centers that had Cray 1 and later Cray X-MP machines. The Pittsburgh center did a lot of work with Carnegie Mellon, esp. the Robotics Institute.
I personally did one bit of work - porting a photometrically correct ray-tracer by Dr. Robert Thibadeau in the Image Understanding Laboratory from an Apollo workstation to the Cray at PSC - this would have been in 1989, I think. The one complication we had was that the Cray floating point format was different, so our first runs were all zeros. Other than that the code compiled and ran fine on the Cray. Of course, a run that took two weeks on the Apollo ran in about 40 seconds on the Cray.
A lot, maybe all of the work done on these machines was non-spooky research so perhaps you can track some of the professors at the associated universities, such as CMU, Northern Illininois, UCSD, Berkeley, etc. Also check out the weather folks - they have been among the biggest CPU cycle-burners for a long time. I worked briefly with one weather guy at a weather research facility in Wyoming but I don't recall any details - was it U Wy?
The SCs I recall are:
I'm sure that if you dig around in the universities you'll find folks who have stuff piled on a back shelf somewhere (probably in a tape format you can't read). Also look up in the old annals of the ACM SIG on supercomputing - that will give a line on researchers who were working on the Cray.
Re: (Score:3, Funny)
You mean that nobody has ported GCC to it yet? It must run NetBSD right?
Shocked I tell you just shocked.
Re: (Score:2)
I'm guessing the weapons simulations are part of the software collection for Crays he's never going to get.
Re: (Score:2)
Haha! Funny - I still had several Perq workstations stashed in a storage unit until a couple of years ago. I donated them to Free Geek [freegeek.org] because I was moving across country and had no way to move them. I tried to explain to them that they were unique, collectible, etc. and should be sold as such, but I have no idea if they actually followed through.
Perq was an interesting machine, and that was an interesting time.
CTSS? (Score:2)
If memory serves (and wikipedia can help for those who don't recall), the first systems were shipped with just an assembler. Livermore and Los Alamos pulled together the first OS for it. COS (the Cray Operating System) came a little later. The Labs stuck with their own OS for quite some time.
I haven't seen a source bundle, but since the OS shouldn't have been classified and it was paid for 100% tax dollars, someone probably could score a copy (Freedom of information request??).
ARTICLE TEXT (Score:5, Informative)
As part two (see previous attempt) of my ongoing series in ‘computational necromancy,’ I’ve spent the last year and a half or so constructing my own 1/10-scale, binary-compatible, cycle-accurate Cray-1. This project falls purely into the “because I can!” category - I was poking around the internet one day looking for a Cray emulator and came up dry, so I decided to do something about it. Luckily, the Cray-1 hardware reference manual turned out to be useful enough that implementing most of this was pretty straightforward. The Cray-1 is one of those iconic machines that just makes you say “Now that’s a super computer!” Sure, your iPhone is 10X faster, and it’s completely useless to own one, but admit it . . you really want one, don’t you?
The Cray-1A Architecture
Now, let’s get down to specs - What is this bad boy running? The original machine ran at a blistering 80 MHz, and could use from 256-4096 kilowords (32 megabytes!) of memory. It has 12 independent, fully-pipelined execution units, and with the help of clever programming, can peak at 3 floating-point operations per cycle. Here’s a diagram of the overall architecture:
cray_architecture
It’s a fairly RISC-y design, with 8 64-bit scalar (S) registers , 8 64-bit/64-word vector (V) registers, and 8 24-bit address (A) registers. Rather than a traditional cache, it uses a ’software-managed’ cache with an additional 64 64-bit words (T registers) and 64 24-bit words (B registers). There are instructions to transfer data between memory and registers, and then register-to-register ‘compute’ instructions.
One of the coolest aspects of this machine is that everything is fully pipelined. This machine was designed to be fast, so if you’re careful, you can actually get one (or more) instruction every cycle. This has some interesting implications - there’s no ‘divide’ instruction, for instance, because it can take a variable amount of time to finish. To perform a divide, you need to first compute the ‘reciprocal approximation’ (something we *can* do in exactly 13 cycles, it turns out) of the denominator value, and then perform a separate multiply of that result with the numerator.
The vector instructions are particularly cool. A vector Add operation might take only 5 cycles to start producing results (remember, each vector can hold 64 values, so it takes 5 + 64 cycles to finish adding). Why wait for it to finish though? We can take the result output from the adder, and “chain” it straight into another vector unit (say a multiplier). And *that* only takes another 10 cycles or so, so we can chain that result into yet another unit (say, reciprocal approximation). Now, rather than waiting for the first operation to finish, we’re computing up to 3 floating point calculations per cycle. Clever programmers could sustain about 2 floating point operations per cycle, or 160 million instructions per second.
vector_chainingVector Chaining in Action!
The Hardware
The actual design was implemented in a Xilinx Spartan-3E 1600 development board. This is basically the biggest FPGA you can buy that doesn’t cost thousands of dollars for a devkit. The Cray occupies about 75% of the logic resources, and all of the block RAM.
spartan3_1600
This gives us a spiffy Cray-1A running at about 33 MHz, with about 4 kilowords of RAM. The only features currently missing are:
-Interrupts
-Exchange Packages (this is how the Cray does ‘context-switching’ - it was intended as a batch-processing machine)
-I/O Channels (I just memory-mapped the UART I added to it).
If I ever find some software for this thing (or just get bored), I’ll probably go ahead and add the missing features. For now, though, everything else works sufficiently well to execute small test programs and such.
The Software
When I started building this, I thought “Oh, I
One of my favorite quotes (Score:3)
If you were plowing a field, which would you rather use? Two strong oxen or 1024 chickens?
—Seymour Cray [google.com]
Re: (Score:2)
How much to cool the chickens and coordinate the yokes?
Since this is what many clusters do -- HPC that used to be on custom-built supers -- the quote is still as cool as ever but the question is now far from rhetorical.
now that the slashdot effect has subsided.... (Score:2, Funny)
damn... my wife doesn't know what a Cray is, doesn't care what a Cray is and certainly wouldn't want to help me paint it....
in other news, i approve of the Bailey's and Stoli bottles on the table behind the mini-cray...
wow (Score:2, Funny)
imagine a beowulf cluster of these! ;)
Seymour Cray and Steve Jobs (Score:3, Informative)
Ages ago, I heard this story. Can anyone confirm if this is true or not?
Seems Steve Jobs, upon the success of the first Macs, was getting ready for the next step and he went to Cray Computer to buy one (probably to help design the PowerPC?).
Anyway, Cray Computers were not just sitting on the shelf waiting to be sold, so it seems Jobs created an altercation and demanded to see the manager about getting one, so they called Seymour down to the lobby. Steve introduced himself and said words to the effect of “I’d like to use a Cray to design the next Apple Computer”. Seymour replied “Thats great. I used an Apple Computer to design my Cray”.
Re:Seymour Cray and Steve Jobs (Score:4, Informative)
Not sure about your quote, but Apple did have a Cray. That's why they're address is "1 Infinite Loop" - the joke was the Cray was so fast, it ran an infinite loop in seconds.
Then again, a quick Google came up with these links
http://www.clock.org/~fair/computers/sgi-cray.html [clock.org] (it was used for a supercomputing project, and it was Sculley)
http://www.thocp.net/biographies/cray_seymour.htm [thocp.net] claims the quote is "When told that Steve Jobs bought a CRAY to help design the next Apple, Seymour Cray said, "Funny, I am using an Apple to simulate the CRAY-3." http://www.spikynorman.dsl.pipex.com/CrayWWWStuff/Cfaqp3.html#TOC23 [pipex.com] seems to have a more detailed version of the Apple-Cray connection.
I guess the next question is - why didn't Microsoft have one?
Re: (Score:2)
This may be just a story or it may be the real deal. The folks I heard it from were in a position to know, so I favor the latter. Perhaps someone out there can be more certain on this.
Apple's Cray was used, among other things, to simulate plastic-flow and cooling for injection molding tasks. This was necessary because injection-molding complex shapes such that they did not have internal cleave-planes from cooling molten plastic from one injector coming into contact with cooling molten plastic from another
Where is the rest? (Score:2)
Cool electronics, shitty case mod (Score:2)
Building a binary-compatible Cray clone with an FPGA board: Awesome!
Putting it in a case that's...sorta roughly the same shape as a Cray-1: LAME!
Here's a MUCH better scale Cray-1 case mod housing two x86 PCs:
http://www.bit-tech.net/modding/case-mod/2010/07/28/cray-1-by-daryl-brach/7 [bit-tech.net]
Re: (Score:2)
Actually, the microprocessor CPU in the F-14 Tomcat fight was developed with arithmetic pipelines and multiple discrete logic units in the late 1960s. It wasn't a general-purpose machine, but it did use the method.
It could load both the multiplicand and multiplier into the multiplication unit in one cycle, do the multiplication the next, then push out the result while loading the next set. The divider did the same thing at the same time, without requiring an instruction stream since there was always data fl