Become a fan of Slashdot on Facebook

 



Forgot your password?
typodupeerror
×
Intel Networking Bug

Intel Gigabit NIC Packet of Death 137

An anonymous reader sends this quote from a blog post about a very odd technical issue and some clever debugging: "Packets of death. I started calling them that because that’s exactly what they are. ... This customer location, for some reason or another, could predictably bring down the ethernet controller with voice traffic on their network. Let me elaborate on that for a second. When I say “bring down” an ethernet controller I mean BRING DOWN an ethernet controller. The system and ethernet interfaces would appear fine and then after a random amount of traffic the interface would report a hardware error (lost communication with PHY) and lose link. Literally the link lights on the switch and interface would go out. It was dead. Nothing but a power cycle would bring it back. ... While debugging with this very patient reseller I started stopping the packet captures as soon as the interface dropped. Eventually I caught on to a pattern: the last packet out of the interface was always a 100 Trying provisional response, and it was always a specific length. Not only that, I ended up tracing this (Asterisk) response to a specific phone manufacturer’s INVITE. ... With a modified HTTP server configured to generate the data at byte value (based on headers, host, etc) you could easily configure an HTTP 200 response to contain the packet of death — and kill client machines behind firewalls!"
This discussion has been archived. No new comments can be posted.

Intel Gigabit NIC Packet of Death

Comments Filter:
  • Ouch (Score:5, Insightful)

    by Anonymous Coward on Wednesday February 06, 2013 @05:07PM (#42813479)

    I think an actual summary would have been a vast improvement over TFS.

  • by eksith ( 2776419 ) on Wednesday February 06, 2013 @05:10PM (#42813537) Homepage

    Whether it's your brand of switch, motherboard or even memory, never have the same across all machines if you can help it. The only time I'd recommend the same brand would be hard drives (due to concurrency issues), but then at least try go get them from different batches. If your lot of mobos will only handle one brand of memory for whatever reason even when cas latency is identical, then have two machines doing whatever it is you need to be doing.

    One kind of anything makes it easier to kill you swiftly in the end, whether it's by a ping of death or a biological disease.

  • by Anonymous Coward on Wednesday February 06, 2013 @05:13PM (#42813569)

    That's the dumbest sh*t I've ever heard .. idiot

  • by vlm ( 69642 ) on Wednesday February 06, 2013 @05:19PM (#42813647)

    the drivers are certified to work

    LOL this is a firmware bug, you can lock up the hardware even with no OS booted. Hilarious.

    and you get real support

    Yeah I love being told to reinstall windows on my linux boxes. Those guys sure are helpful !

  • by localman57 ( 1340533 ) on Wednesday February 06, 2013 @05:25PM (#42813717)
    It's actually a pretty good write up with a nice trace of his troubleshooting. If my customers gave me bug reports that included 10th of the level of detail he does in the article, i'd be over the moon.
  • Re:Ouch (Score:5, Insightful)

    by whois ( 27479 ) on Wednesday February 06, 2013 @05:26PM (#42813721) Homepage

    It's pretty bad even by slashdot standards:

    'Let me elaborate on that for a second. When I say “bring down” an ethernet controller I mean BRING DOWN an ethernet controller.'

    This statement is worse than useless, it's a waste of space and a waste of your time to read it (I'm sorry I quoted it). The next sentence is okay but then they go back to 'Literally the link lights on the switch and interface would go out. It was dead.'

    Literally, this is a waste of the word literally. And it being dead was implied by everything stated above. The rest is informative but still in a conversational style that makes it hard to read, and it's lacking in details such as:

    What model of Ethernet controller was tested. What Firmware version are they using? Has the problem been reported to Intel?

  • by PRMan ( 959735 ) on Wednesday February 06, 2013 @05:32PM (#42813777)

    or just buy premade servers from dell or HP. they aren't that much more expensive, the drivers are certified to work and you get real support

    ...and you're guaranteed that every shipment will have radically different hardware, despite having identical model numbers.

  • by omnichad ( 1198475 ) on Wednesday February 06, 2013 @06:01PM (#42814131) Homepage

    That's not the same bug. I'd explain, but that's what you get for saying "I wish this guy had done his homework."

  • Replacements (Score:3, Insightful)

    by phorm ( 591458 ) on Wednesday February 06, 2013 @06:39PM (#42814561) Journal

    Errrr, no. Have you ever tried to deal with replacements and/or issues within a large organization where everything is different? It's hellish.

    Try tracking an issue across an enterprise of architecture when all the architecture is DIFFERENT. You also don't want to mix RAM, and drivers can be a real b**** for different motherboards. Oh, and RMA's things, not fun.

    Different brands of RAM. Yeah, you try a rack full of servers playing mix'n'match and see how well that works.

    Lastly... how many vendors/brands of enterprise gear do you think are out there, and for the ones that do exist how well do you think they talk together. Maybe you're happy mixing HP Procurves with your Cisco stuff but I don't recommend it, and for some stuff there aren't a lot of vendors to choose from anyhow.

  • by countach ( 534280 ) on Wednesday February 06, 2013 @08:13PM (#42815477)

    It's just an Intel support strategy. Release NICs with random and minor outward differences. When you have a support issue, say that it is counterfeit. Really cuts support costs!

  • by TheLink ( 130905 ) on Wednesday February 06, 2013 @11:22PM (#42816869) Journal
    Hardware is just what you call something YOU don't configure/patch much even if someone else does :).

    To a PHB everything might be hardware. To a HDD maker HDDs aren't hardware, same for CPU makers and their CPUs.
  • by Cramer ( 69040 ) on Thursday February 07, 2013 @02:56AM (#42817829) Homepage

    And where do I get this mythical "firmware update" for the NIC CHIP? I'm sure the chip has code in it, but I've never even heard of a utility from Intel to update the in-chip code in a nic. (it's called "microcode", not firmware)

BLISS is ignorance.

Working...