Follow Slashdot blog updates by subscribing to our blog RSS feed

 



Forgot your password?
typodupeerror
×
Supercomputing Power

Tesla Unveils New Dojo Supercomputer So Powerful It Tripped the Power Grid (electrek.co) 106

An anonymous reader quotes a report from Electrek: Tesla has unveiled its latest version of its Dojo supercomputer and it's apparently so powerful that it tripped the power grid in Palo Alto. Dojo is Tesla's own custom supercomputer platform built from the ground up for AI machine learning and more specifically for video training using the video data coming from its fleet of vehicles. [...] Last year, at Tesla's AI Day, the company unveiled its Dojo supercomputer, but the company was still ramping up its effort at the time. It only had its first chip and training tiles, and it was still working on building a full Dojo cabinet and cluster or "Exapod." Now Tesla has unveiled the progress made with the Dojo program over the last year during its AI Day 2022 last night.

The company confirmed that it managed to go from a chip and tile to now a system tray and a full cabinet. Tesla claims it can replace 6 GPU boxes with a single Dojo tile, which the company claims costs less than one GPU box. There are 6 of those tiles per tray. Tesla says that a single tray is the equivalent of "3 to 4 fully-loaded supercomputer racks." The company is integrating its host interface directly on the system tray to create a big full host assembly. Tesla can fit two of these system trays with host assembly into a single Dojo cabinet. That's pretty much where Tesla is right now as the automaker is still developing and testing the infrastructure needed to put a few cabinets together to create the first "Dojo Exapod."

Bill Chang, Tesla's Principal System Engineer for Dojo, said: "We knew that we had to reexamine every aspect of the data center infrastructure in order to support our unprecedented cooling and power density." They had to develop their own high-powered cooling and power system to power the Dojo cabinets. Chang said that Tesla tripped their local electric grid's substation when testing the infrastructure earlier this year: "Earlier this year, we started load testing our power and cooling infrastructure and we were able to push it over 2 MW before we tripped our substation and got a call from the city." Tesla released the main specs of a Dojo Exapod: 1.1 EFLOP, 1.3 TB SRAM, and 13 TB high-bandwidth DRAM.

This discussion has been archived. No new comments can be posted.

Tesla Unveils New Dojo Supercomputer So Powerful It Tripped the Power Grid

Comments Filter:
  • by allaunjsilverfox2 ( 882195 ) on Sunday October 02, 2022 @12:21PM (#62931019) Homepage Journal
    Considering they literally make backup power supplies.
    • They probably just shorted the circuit, I can do that too.

      • Yeah I think it's more likely that Musk just didn't adequately coordinate with the local utility.

      • by HotNeedleOfInquiry ( 598897 ) on Sunday October 02, 2022 @04:06PM (#62931433)
        This. In any case, the circuit breakers feeding the unit under test should have tripped first, then the ones feeding the Tesla plant. That they would trip breakers outside their infrastructure shows poor planning.
        • That they would trip breakers outside their infrastructure shows poor planning.

          Why? If Tesla's system is designed for 2 MegaWatts and all the breakers are sized accordingly, but powering up their systems to 1.5 MegaWatts caused the local grid to exceed its capacity to both Tesla and all its other customers, then everything is working as intended. In a made up example, if the local substation is designed for 6MW and it was already supplying 5MW, then Tesla's additional 1.5MW would cause it to trip as designed.

          It could be that Tesla needs to find a supplier that can supply 2MW without

    • by AmiMoJo ( 196126 )

      Embarrassing for the headline writer too. Ancient mainframes used vast amounts of power for their vacuum tubes, it didn't make them powerful in terms of performance.

      We need to some some performance/Watt numbers for this thing to know if it's really any good.

    • by klubar ( 591384 )

      Isn't this kind of basic power engineering that is well solved for every data center and industrial users. You should know what your power draw is and the voltage/current of your feeds. It's not completely simple math, but a well-solved engineering problem.

      Doesn't show a lot of confidence in planning if you can't do electrical 101. Also, data center in Palo Alto? Doesn't seem like the ideal locations as the data center can be located almost anywhere. Real estate costs, grid capability and costs per megawatt

  • by thegarbz ( 1787294 ) on Sunday October 02, 2022 @12:27PM (#62931027)

    "Tesla Unveils New Dojo Supercomputer So Incompetently Run It Tripped the Power Grid"

    Doesn't matter how big your thing is. Tripping the substation in the city exposes your incompetence and inability to manage your contractually agreed load.

    Nice computer, but not a shining review of Tesla's business practices.

    • Re: (Score:2, Interesting)

      by leptons ( 891340 )
      "Texas power grid continues to prove it can't handle supplying power".

      At least nobody probably died this time, but powering things in Texas comes with a known risk. Their power grid is not reliable.

      https://en.wikipedia.org/wiki/2021_Texas_power_crisis [wikipedia.org]
      • by Xenx ( 2211586 )
        Palo Alto is in California. Different set of power grid issues.
      • "Texas power grid continues to prove it can't handle supplying power".

        Except that isn't the case here and has nothing to do with the topic at hand. Save your comment for actual grid problems rather than a customer triggering circuit protection.

    • by Rademir ( 168324 )

      Imagine a Beowulf cluster of these. ;-)

      They could take down the whole western grid!

    • Re: (Score:2, Insightful)

      by mobby_6kl ( 668092 )

      It's the kind of statement that pseudotechie Musk fanboys just love. Haha we just blew up another extremely expensive rocket, isn't science great!

    • by AmiMoJo ( 196126 ) on Sunday October 02, 2022 @02:54PM (#62931275) Homepage Journal

      I may be a complete waste of time and energy anyway.

      Look at how Waymo's self driving cars work, compared to Tesla's. Waymo uses lidar and high end cameras to capture a huge amount of information about the environment. While some machine learning is used to identify objects, it is working with a lot more information than Tesla's camera only solution.

      Recently there have been multiple incidents of Tesla cars ramming into motorcyclists at high speed because they see two tail lights and assume from the distance apart that they are seeing a car far off in the distance. Waymo's cars have lidar too, so they can see that it's a bike even in complete darkness and poor weather when their cameras cannot. They have algorithmic checks in place so that even if the machine learning thinks it sees a far off car, the algorithm can see that it's too narrow and the camera got the distance completely wrong thanks to the lidar, so ignore that.

      Secondly, Waymo uses algorithms to determine the path the car will take. It uses algorithms to determine the possible paths that other vehicles, cyclists and pedestrians may take. Tesla is trying to teach machine learning to drive. An algorithm can be audited and humans can make decisions about what kind of driver it should be by directly controlling its decision making logic. Machine learning is a black art where even the most accomplished developers can't stop their chat bots becoming Nazis within 5 minutes of meeting Twitter, or remove racial factors from decision making. Some of them can't even get their cameras to recognize darker skin, kinda important when it's your car's vision system.

      • Tesla's autonomous systems are obviously not as good as Waymo's. However, they have the only commercial system past Level 2 on the road (not counting the very few Honda and Mercedes official Level-3 cars) because they have enough moxie to pass off an attempted Level-3 system as a legal Level-2 while marketing as a Level-3. Well, at least until the government starts doing its job and calling Tesla on their marketing fraud. The government has shirked its responsibility for many years, so it's still not cle

      • by nasch ( 598556 )

        I may be a complete waste of time and energy anyway.

        Don't be so hard on yourself, bro.

      • Re: (Score:3, Insightful)

        by thegarbz ( 1787294 )

        Look at how Waymo's self driving cars work

        I have and I really root for Tesla in this case. We should be putting every effort into decent machine learning. Throwing very sensitive, expensive, and really high end sensing technology into every car is not a sustainable future.

        We live in a world where people simultaneously complain about things being both too dumb and too expensive. Heaping more costs on it isn't the answer.

        Tesla has a way to go but at they are at least leading towards a reasonable solution. Waymo's (and indeed basically every other com

        • by AmiMoJo ( 196126 )

          The tech will get cheaper as commercial users adopt it. Makes financial sense for taxis and haulage.

          That's the same plan Musk had, except he started with luxury consumer models instead of commercial ones. Interestingly rivals like BYD and CATL went with commercial vehicles, and now have the best battery tech.

          • The tech will get cheaper as commercial users adopt it. Makes financial sense for taxis and haulage.

            There is a fundamental floor to tech costs. LIDARS are highly complex pieces of gear. They won't ever be as cheap as not having them at all (LIDAR units still come with advanced cameras too so you don't even get to substitute the cost in this case).

            LIDARS in their current form will never make it to general purpose mass purchased consumer vehicles. They may find a home in the luxury vehicle market, but something needs to fundamentally change in the technology to make it viable.

            • by AmiMoJo ( 196126 )

              Radar used to be expensive and highly complex, now you can buy single chip solutions and it is mandatory on new cars.

              • Radar used to be expensive and highly complex, now you can buy single chip solutions and it is mandatory on new cars.

                Indeed it did. However radars are still quite expensive and still more expensive than nothing at all. Speaking of Radars are nothing like they were in the past so when you are thinking about your next reply consider the fact that I said "LIDARS in their current form"

                But congrats on moving the goalposts. You've jumped from economies of scale to magic technology changes to make your argument, and for the latter I absolutely agree with. I just caution you to not hold your breath, especially considering LIDARS

            • I'm curious what's going to happen when there's thousands of these radars on every street, driving around picking up thousands of stray signals.

              Hopefully the DSPs will be able to pick their coded signal out of the noise in time to make use of it.

              Has anyone seen any research into this area?

    • In the old days it was something like, "Car so powerful it will shake the doors off your neighbor's house."

      Of course if it were true it would just be a problem caused by a malfunctioning exhaust, poorly balanced engine, faulty motor mounts or any number of--DID I MENTION IT BLEW THE DOORS OFF MY NEIGHBOR'S HOUSE???

      Sales are through the roof. Which now needs to be patched. ALL RIGHT!!! NEW SHINGLES!!!

    • by gweihir ( 88907 )

      Indeed. Gross incompetence, and maybe even criminal malfeasance.

      • Yes to the former, no to the latter. When you're a high volume customer you will have a dedicated feeder (more likely two) from the substation. There's nothing criminal about breaching a contract, and if something were to have gotten damaged here it would certainly have been the fault of the energy provider not setting up the feeder protection correctly.

        Bottom line is you as an end user downstream of any protection (be it large 110kV feeder or the service fuse on your house) can never do anything on your si

        • by dgatwood ( 11270 )

          Bottom line is you as an end user downstream of any protection (be it large 110kV feeder or the service fuse on your house) can never do anything on your side of the electrical line to be considered criminal.

          Sure you can. Intentionally connect a generator back-feeding power that's significantly out of phase.

          • by gweihir ( 88907 )

            Bottom line is you as an end user downstream of any protection (be it large 110kV feeder or the service fuse on your house) can never do anything on your side of the electrical line to be considered criminal.

            Sure you can. Intentionally connect a generator back-feeding power that's significantly out of phase.

            For example. Or probe for resonances with a switched load.

          • Sure you can. Intentionally connect a generator back-feeding power that's significantly out of phase.

            Have you done that before? Or seen it done? I have*. The outcome was sending the generator and turbine back to Siemens for a $5million repair job after the sheer coupling sheered and the bearings couldn't handle the force causing the rotor to contact the stator (luckily only slightly) in the generator. Alstrom also replaced our breaker which is standard practice for that gear when it trips.

            The utility provider? Well nothing happened to them, they just gave us a call to say a breaker on our feeder tripped op

    • Or just so "power hungry". After all, powerful means it does something with all that power. That isn't required to trip the grid. A big iron bar across a transformer will do that and it can't compute at all.
  • Poor planning (Score:5, Insightful)

    by haruchai ( 17472 ) on Sunday October 02, 2022 @12:28PM (#62931029)

    They're not the least bit embarrassed by this?

    • Re: (Score:3, Insightful)

      by Anonymous Coward

      Much like the republican party, you can’t embarrass someone without shame.

      • Tesla probably did it as a publicity stunt.

        Also much like the party.

        • by gweihir ( 88907 )

          Tesla probably did it as a publicity stunt.

          Also much like the party.

          Yep. In any reasonable country, doing this intentionally would result in a huge fine and may well result in criminal charges for endangering critical infrastructure. There is absolutely no need for this type of "test". You ask. The power station people can tell you everything, down to the impedance of the network, because they need to know in ordert o do their jobs.
           

    • by cb88 ( 1410145 )
      If they tripped the grid... that is NOT their fault. Its the power company failing to meet their contractual agreements.

      Its quite hilarious that you attempt to pin this on any entity other than CA's power grid itself which is very well known to be unstable.
  • put in the right spot. That copper pipe must be doing zillions of floating point computations per microsecond!

  • by Lohrno ( 670867 ) on Sunday October 02, 2022 @12:41PM (#62931051)

    "Hey look at us we made a computer so powerful it tripped the breakers! We are doing so much sci fi shit over here."

    But I am curious what they're doing with it.

    • by sxpert ( 139117 )

      they explained it: the main use is training their neural network models

      • by Lohrno ( 670867 )

        I'm aware, just was curious about details more than anything. I know, we'll not get them just saying.

        • by cowdung ( 702933 )

          Watch the Tesla AI Day video from last Friday. They are pretty detailed in explaining what they are doing with this.

  • Comment removed based on user account deletion
    • Comment removed based on user account deletion
    • by quonset ( 4839537 ) on Sunday October 02, 2022 @01:37PM (#62931143)

      Had you bothered to read the article from which the headline was taken word for word, you would have found this:

      Bill Chang, Tesla’s Principal System Engineer for Dojo, said during

      “We knew that we had to reexamine every aspect of the data center infrastructure in order to support our unprecedented cooling and power density.”

      They had to develop their own high-powered cooling and power system to power the Dojo cabinets.

      Chang said that Tesla tripped their local electric grid’s substation when testing the infrastructure earlier this year:

      “Earlier this year, we started load testing our power and cooling infrastructure and we were able to push it over 2 MW before we tripped our substation and got a call from the city.”

      That is hardly an endorsement or marketing for Tesla. It means they failed on a number of issues, not least of which was to notify the local power company what they (Tesla) were planning to do, didn't coordinate with said power company on the timing of load testing, didn't have a back up plan despite touting their battery backups for municipalities, and didn't have a fallback plan in case something like this should happen. That's called poor project management and a failure.

      Put another way, "We'll just fuck around and if something goes wrong say, "Oops!"" Not someone you want to do business with.

    • I wonder if it's less scandalous than it looks. Substations are presumably sized to be economical for expected peak load plus a safety margin. Adding two megawatts could break that assumption with only normal incompetence.

    • In order for Tesla to have done this they would had to remove every breaker between their system and the grid, and likewise the power company would have to have nothing in place to protect their grid from rogue users like Tesla.

      They only tripped the incoming feeder from the city substation. Literally nothing needed to be removed or changed and absolutely everything worked as expected (electrical protection wise). It was gross incompetence on the part of Tesla to draw more than their contractually agreed upon demand, and even more so that a load testing activity wasn't co-ordinated with the grid operator, but the power company very much had a thing in place and it worked exactly as designed to protect the grid from Tesla's shenanig

  • What the hell kind of measurement is that? Maybe it was a hot day, and the load was around peak already. Also, how big was the power grid? The size of a football field or a banana? Maybe it was a weak-ass sub-station? Jeeze, it has so many variables.

  • by clawsoon ( 748629 ) on Sunday October 02, 2022 @12:55PM (#62931073)

    "Engineers build road so powerful it causes a rock slide!"

    "Mr. Christie builds Oreo so powerful it breaks all the cookie-making machines in the factory!"

    "Putin starts war so powerful that it causes Russia to run out of military equipment!"

  • by nukenerd ( 172703 ) on Sunday October 02, 2022 @01:39PM (#62931145)
    This is just Tesla seeking publicity. They love it.

    Substations belong to the electricity company. A factory like Tesla's will have a substation dedicated to it, probably several. Jeez, my suburban street has a substation, it's right next to my back garden. Every substation will have some kind of fuse or circuit breaker to limit maximum current. In turn, every property served by the substation will have some kind of fuse or circuit breaker to limit maximum current, to something less than that in the substation - unless the user puts a copper rod in it. In the UK it is illegal to mess with your property's main fuse.

    In this case, allowing for confused journalists, either the substation breaker or the downstream Tesla breaker was tripped. It was Tesla's incompetence that their computer's local supply's fuse was such that didn't blow first (it did have one didn't it?).

    Believe it or not, the electricty supply company guys do monitor consumption in the various parts of their system. No doubt they saw the sudden drop in the supply to Tesla and phoned them up to ask if there was a problem - not that is would have been an issue, but because their watch is so fucking boring without these little events. I don't suppose the rest of their grid was affected in the slightest unless the design of it is really fucked up.
    • by gweihir ( 88907 )

      This is just Tesla seeking publicity. They love it.

      Only because people are stupid. What Tesla demonstrated here was _incompetence_.

    • Jeez, my suburban street has a substation, it's right next to my back garden.

      A more apt comparison between you and Tesla would be your service fuse. When you get up to shenanigans your service fuse pops. When Tesla gets up to shenanigans the substation trips their (dedicated) feeder.

    • > This is just Tesla seeking publicity. They love it.

      It was a brief line in a 4 hour technical presentation.

      The news sites which will use anything âoeElonâ or âoeTeslaâ in a headline for clicks are the ones who love it.

  • by ukoda ( 537183 ) on Sunday October 02, 2022 @01:52PM (#62931167) Homepage
    The headline and most comments are basically incorrect. It was not a computer that tripped the power grid, it was testing of the thermal handling of the cabinet and it cooling systems. Basically the power was being dumped into a dummy load so they could see how well the cabinet and it cooling systems could move out any heat generated before putting the computing hardware in there.

    Tesla was not boasting about how much power their computers used but how thermally efficient their cabinet and cooling systems were, allowing them to pack more computing power into less space without it failing.
    • by thegarbz ( 1787294 ) on Sunday October 02, 2022 @04:24PM (#62931471)

      The headline and most comments are basically incorrect.

      Most comments here are shaming Tesla for not co-ordinating a test with the power company and for drawing enough power that they tripped the feeder from their utility provider. Those comments are completely correct, because what they did was so stupid that it completely overshadows the stupid headline.

      • by ukoda ( 537183 )
        At the time of my post most comments were assuming that the Dojo supercomputer trip the power feed, not a dummy load, which is incorrect and what I took exception to. Regardless, you are right in that if they had a dummy load able to sink that kind of power level they should have been talking with their power company about their planned test as tripping an up stream breakers should never have happened.
  • The title is misleading. From the article:
    “Earlier this year, we started load testing our power and cooling infrastructure and we were able to push it over 2 MW before we tripped our substation and got a call from the city.”

    The computer did not overload the circuit, they were testing to determine how much power would be reliably available. This seems like a good idea to me as you would not want to have power problems when doing a training run on a huge dataset. Most substations from what I se
    • by Junta ( 36770 )

      No one is slamming Tesla because their computer tripped, it's that the article treats it as a badge of honor that Tesla tripped a substation.

      When you do have an operation of this scale, you coordinate with the utility. You don't unilaterally do a test and surprise the utility and *then* 'got a call from the city".

      I consult on this scale of things frequently and this sort of outcome as characterized by Tesla would be a sign of doing it wrong.

      But either way, even if it was a coordinated test done correctly a

    • we were able to push it over 2 MW before we tripped our substation and got a call from the city.

      Still a lack of coordination from Tesla. "getting a call from the city" -- WTH are you guys doing going from 0 to 2 MW in a minute without letting us know in advance? If it was a fault from the city, the factory would call the city, not the opposite. At least at my place when power goes down for no good reason, it's us (the factory) calling the electric company, asking them what is wrong on their side, and let them know we'll ask for compensations for our business losses.

    • by gweihir ( 88907 )

      This is actually terminally stupid. What you do is you ask your supplier how much power you can safely draw, because they _know_.

      If you did something as demented here, you would probably get hit with a huge fine an a credible threat to disconnect you. Endangering critical infrastructure is not something smart people do.

    • This seems like a good idea to me

      Really? Can you not think of other, possibly less sociopathic, ways in which they may have obtained this information? Even if it was not available from the utility, do you think perhaps it might be a good idea to give them a heads-up that you are attempting to intentionally break their infrastructure before doing so? Have you lived in a civilized society long, or are you new here?

  • Just because the power grid is rickety, does not make your machine powerful. Case in point: in California, we have Pacific Gas and Explosion, if you even look at the power lines wrong, they will start ablaze.
    • Pedal Generator for Electricity. When I was young, I got a tour of the SAGE site at Norton AFB. The AN/FSQ-7 computer drew upwards of 3 MW. Back to the future.
  • ...a beowulf cluster of these!
  • elon creates yet another amazing product and somehow all the top comments are nothing but derision, on a tech site no less.

    this site has become festered with the woke mob, blinded by politics.

  • Tesla Unveils New Dojo Supercomputer So Powerful It Tripped the Power Grid

    You mean a supercomputer that *uses* so much power it tripped the grid. "Powerful" in context like this implies computing power, which isn't necessarily related to power consumption -- though it often is, but there's not a hard, direct correlation..

  • It is not a great accomplishment to have a computer or really anything consume a lot of power. It is a great demonstration of incompetence though to overload a local power grid.

  • Isn't California the place with the most stressed power grid anywhere? Why not next to a hydro station in Oregon? Or in the Texas desert with ten acres of solar panels?

  • And it didn't ask permission, or say it was sorry, which shows it has studied well the lessons to be learned from politics.

  • The California grid trips when people run battery powered flashlights.
  • Something tripped, but exactly why and where there other circumstances... Was something not up to code?
  • Nikola Tesla shut down Hover Dam.
  • My old job knew 20 years ago that power density and cooling capacity were the limiters of what could be stuffed inside a big ass room. Did Elon just yell at them to rack 'em to the ceiling and let 'em rip?
  • In June 2022, the HPE Frontier at Oak Ridge Nat. Lab hit 1.102 EFLOPS with a peak of 1.685 EFLOPS, yet draws 21 MW.

    So, the Dojo drawing 2 MW for the 1.1 EFLOPS seems quite a bit impressive.

Real Programmers think better when playing Adventure or Rogue.

Working...