Please create an account to participate in the Slashdot moderation system

 



Forgot your password?
typodupeerror
×
Technology Hardware IT

AMD Confirms It's Issuing a Fix To Stop New Ryzen Processors From Crashing Desktops (digitaltrends.com) 113

AMD says the company has been able to figure out why FMA3 code is causing system hangs on PCs using a new Ryzen desktop processor. From a report: Although AMD didn't provide a detailed report on the problem's root cause, the company said that BIOS changes will be distributed to motherboard manufacturers to resolve the issue. Customers are encouraged to keep an eye on their motherboard vendor's website for an update. "We are aware of select instances where FMA code can result in a system hang," the company said. "We have identified the root cause." AMD released three Ryzen-branded desktop processors at the beginning of March that plug into motherboards based on AMD's new AM4 socket. The trio of processors include the Ryzen 7 1800X, the Ryzen 7 1700X, and the Ryzen 7 1700. However, all three reportedly cause a hard system lock when running certain FMA3 workloads. The problem was replicated across all three processors and a variety of motherboards.
This discussion has been archived. No new comments can be posted.

AMD Confirms It's Issuing a Fix To Stop New Ryzen Processors From Crashing Desktops

Comments Filter:
  • by Penguinisto ( 415985 ) on Tuesday March 21, 2017 @04:11PM (#54083869) Journal

    Kidding, but I'm old enough to remember when running Windows 95 on the old AMD K6 boxen was a no-go...

    That said, does this fix affect performance any (no matter the OS)?

    • by Shinobi ( 19308 ) on Tuesday March 21, 2017 @04:23PM (#54083953)

      K5 had a bug like that too. And let's not forget Athlon/Athlon XP era and AGP issues. The Piledriver Opterons required a patch to fix a bug in the hypervisor system which allowed for escape from a VM. AMD has had just as many horrible bugs as Intel, which can be summed up like this: Making flawless high-performance chips is difficult.

      • Don't remember the agp issue, but vesa local bus was a pain in the butt
      • AFAIK Athlon/Athlon XP had no such bugs, the issues were bugs in the VIA chipsets. AMD's chipsets worked fine.

        I don't remember any bugs on Piledriver. I do remember that Bulldozer had a TLB bug though. The workaround was to disable the TLBs and it significantly hampered performance.

    • by Moof123 ( 1292134 ) on Tuesday March 21, 2017 @04:24PM (#54083957)

      Look at an intel errata list some time. There are huge numbers of bugs in all CPU's in recent times. Bios patches trap the errant instructions and use a work around. Nothing really to see here. I've had several intel instabilities get resolved with a bios flash. It is yet another reminder to always wait a few months after major revisions for the dust to settle unless your goal is to actually be an early adopter for the hell of it.

      • Since there are no X370 motherboards available, we're going to have to wait a few months anyway.
        • I've had multiple in my hands at my local Frys Electronics here in las vegas. I would have to say their available. The Asus x370 Prime, and the white MSI board with the dual m.2 are in stock as we speak.

      • AMD's issue here isn't necessarily that they have more of these problems or that the problems are more serious. It's that they have a reputation for having more of these problems and for them being more serious when they happen.

        A lot of people, self included to a degree (though I do try to counter it) have picked up trust issues around AMD products over the years. In many cases, including mine, that may well be because we tried running AMD CPUs which just run a bit hotter than the Intel equivalents with coo

        • by Anonymous Coward

          It doesn't matter how hoy the CPUs get. AMD has damn critical bugs in their chip microcode. They have a long history of failing to catch microcode issues in emulation before chip tape out. It must be horribly embarrassing for the engineers that work there.

        • by wbr1 ( 2538558 )
          I have been running 3-5 AMD FX series (various revisions, bulldozer, and up) at slight overclocks (100 to 200mhz), on stock AMD coolers for years. The only stability problems I have had were related to buggy drivers and bad RAM, not the CPU/overclock. Have not had a single one die either. Not sure what you are doing but in my experience that is pretty abnormal.
        • by robinsc ( 84714 )

          Remember intel inside can't divide ....

        • Really? How about using the damned stock cooler? That's what I've done and had no issues.

      • And some of them were security critical. Remember having to disable hyperthreading because it let one thread snoop on another's cache? Or having to disable transactional memory because it allowed arbitrary physical memory manipulation from an untrusted process.
        • by Shinobi ( 19308 )

          That's been the case on the AMD side too, like the hypervisor bug in Piledriver Opterons for example.

      • Sage advice for nearly every industry.

        Never buy the first model year of a new car design. You're bound to get engine or transmission issues at a higher rate than after they've worked things out a bit.

        Never buy the first release of a new game console. Just look at the XB360 RROD, or the PS1's CD-ROM issues.

        Never pre-order or buy day-one titles. Look at Ubisoft's Assassin's Creed (Brotherhood?) issues. Or Batman Arkham Knight on PC - with 2 stop sales and recalls/refunds. Or really any

    • by phorm ( 591458 )

      I remember it working but needing some weird patch of config change on Win95.
      On Win98 it worked without any special hacking needed.

    • Kidding, but I'm old enough to remember when running Windows 95 on the old AMD K6 boxen was a no-go...

      That said, does this fix affect performance any (no matter the OS)?

      I never had any issue or quirks with Win95 on my K6. Though that might have been second stepping.

    • And lets not forget the Bulldozer bug, I have been putting up with bluescreens and freezing for years because Gigabyte failed to produce a BIOS update for something that had a fix from AMD also: https://scalibq.wordpress.com/... [wordpress.com] I have a new system now so not longer have this issue, but it surely was frustrating to get it blue screen several times a day.
    • A+BxC in one instruction is the fix. Apparently this instruction fails when the Ryzen is at some power saving stage.

      I believe it is a bug in the Ryzen Microcode, that can be changed by modifying the motherboard. In otherwords, fix the MB for now and fix the instruction in the Ryzen ASAP. I believe there will be a microcode patch for Windows and Linux and whatever to fix the CPU at some near distant future.
      By the way, my CPU has an INTEL patch to correct faulty instructions. It should not happen

  • by Anonymous Coward

    C'mon Slashdot, give us at least some clue instead of simply copying and pasting someone else's article.

    • Apparently, I had to look into the forum posts that the FA referenced, and FMA instructions are Fused Multiply Add, whatever the fuck that is.
      • It's like a teleporter accident gone horribly right.
      • by Baloroth ( 2370816 ) on Tuesday March 21, 2017 @04:41PM (#54084057)

        Apparently, I had to look into the forum posts that the FA referenced, and FMA instructions are Fused Multiply Add, whatever the fuck that is.

        After looking at Wikipedia for 5 seconds, FMA instructions perform round(a+b*c) in a single operation, so you can a) speed up and b) get more accurate results whenever you need such a mathematical operation (which is actually reasonably frequently, in numerical computing).

      • > FMA instructions are Fused Multiply Add, whatever the fuck that is.

        Really? You're posting on /. and you can't google it?

        * Multiply-Accumulate [wikipedia.org]

        This commonly shows up when you are lerping (linear interpolating) between two values, a and b, you have a interpolation parameter usually called t:

        x = a + (b-a)*t

        Compilers will see this pattern and generate a FMA instruction for it, or you can write your own.

        Copying the code snippet from this StackOverflow Question: [stackoverflow.com] How to use Fused Multiply-Add (FMA) instruct

        • by Anonymous Coward
          The point isn't 'can I google it'. It's 'why the fuck should I have to?' I doubt more than a tiny minority of Slashdot readers already knew what FMA meant, so defining it in the summary would have taken 5 seconds and saved lots of aggravation.
        • No it won't. You'll get a link error.
    • by godrik ( 1287354 ) on Tuesday March 21, 2017 @04:50PM (#54084111)

      FMA instructions are Fused Multiply Add instruction. Usually their are on SIMD registers and allow you to do "a += b *c". Modern cores can do that on a vector in a single cycle. Actually, they may be able to do more than one FMA on a vector register in a single cycle.

      FMA are most commonly used to compute dot product, and are therefore very helpful in linear algebra. (And so they are useful in a ton of data mining algorithms.)

      • by tlhIngan ( 30335 ) <slashdot.worf@net> on Tuesday March 21, 2017 @05:53PM (#54084589)

        FMA are most commonly used to compute dot product, and are therefore very helpful in linear algebra. (And so they are useful in a ton of data mining algorithms.)

        Also known as the Multiply-Accumulate (MAC) instruction in DSPs. MAC is an extremely common instruction in signal processing kernels (the inner loop that does the calculations). It is vital to be able to do a lot of them per clock cycle. In fact, it's often why DSPs have special looping registers so you can do zero-overhead loops and thus doing a sequence of MACs without incurring branch (and branch prediction) times thus being able to do nothing but this instruction for very little overhead

  • So no one noticed this during testing? A system hang? Really? This isn't FDIV, you might not notice that.
    • by JustNiz ( 692889 )

      Yes that was exactly what I was wondering too. It seems to be clear evidence of AMD conducting only superficial testing prior to release.

  • Most products released these days aren't really ready to be released. (A trend made popular by our friends at Microsoft). It's sad that AMD in all likelihood rushed this CPU release. I'm going to way 3-6 months before trying out this CPU.
    • by JustNiz ( 692889 )

      Why bother? Just do what time and again I've found works best for a trouble-free life: totally avoid any/all AMD CPUs or GPUs.

      • by mark-t ( 151149 )

        That's only trouble-free for people for whom the price of Intel compared to AMD is already not a concern.

        So in other words, what you really mean is that what works best for a trouble-free life is to just be rich enough that you can buy your way past any problems.

        And who can argue with that?

        • by JustNiz ( 692889 )

          I've personally found time and again that buying cheap is nearly always a false economy, since you need to service it far more often and/or replace it far sooner (Sometimes even immediately) compared to a quality/reliable product. Espcially one with some headroom to meet future needs.
          It reminds me of a (Discworld) story of a poor Policeman (Sam Grimes) who could only ever afford $1 boots on his salary, and he got through 3 pairs a year. He married into a rich family so could finally afford a pair of $5 boot

          • by mark-t ( 151149 )
            Nice analogy.... except it doesn't apply here. The amount of time that I go before upgrading my CPU, which in my experience tends to amount to buying virtually an entirely new computer system (CPU, Mobo, memory, and often a video card, and sometimes even a new case and power supply), is about 2 to 3 years. While you might conjecture that I could upgrade slightly less frequently if I were to buy Intel, I doubt somehow doubt I'd be slowing down my upgrading to every 6 to 10 years, which is about how long I
            • You must be doing some hard core.. something? I have only bought amd for decades and my personal upgrade path is every 6-10 years.. I may upgrade my video card every 3-4 years, but beyond that it just seems like a total waste of money.. but then.. I abhor first person shooters and pvp..
              • i upgrade often, i choose AMD because im not rich and its my hobby. Intel until now has has a better product. Ryzen has caught AMD up, Hopefully VEGA does the same. I honestly dont have the money right now for a full Ryzen upgrade, so im not worried. But its a new architecture so we already knew there would be problems on the motherboard and cpu side. so im kind of glad that i have to wait. Its stupid to be biased. You make stupid decisions.

          • by Shinobi ( 19308 )

            The Sam Vimes theory of economy has a lot of basis in reality.

            Look at real quality furniture for example. I inherited a kitchen work table that's been in the family for over 120 years. It's been sanded down and re-polished a number of times, but it's still a large thick slab of wood on REALLY sturdy legs. Great for baking or cooking in large batches etc(like say 10 plates of various rolls, bisquits and cookies. At the time it was made, it would have cost a working class family more than a year and a half's

            • by JustNiz ( 692889 )

              >> Sure, I could buy something that would last us maybe 20-25 years at best

              It seems to me that even expensive furniture (at least in the US) is made with well-disguised cheapest possible materials such as composites and laminates, so actually only good for about 10 years at most.

      • by Carewolf ( 581105 ) on Tuesday March 21, 2017 @05:48PM (#54084553) Homepage

        Why bother? Just do what time and again I've found works best for a trouble-free life: totally avoid any/all AMD CPUs or GPUs.

        Heh, if you want to avoid bugs, better stay away from Intel also. They have a 1000 page errata list with every single processor.

  • by Anonymous Coward

    Hey AMD, it's because of your Platform Security Processor (PSP) backdoor, aka Intel's equivalent of Intel Management Engine. You know, you should really get rid of it. You really should.

It is easier to write an incorrect program than understand a correct one.

Working...