Forgot your password?
typodupeerror
Microsoft IT

Microsoft 365 Endured 9+ Hours of Outages Thursday (crn.com) 36

Early Friday "there were nearly 113 incidents of people reporting issues with Microsoft 365 as of 1:05 a.m. ET," reports Reuters. But that's down "from over 15,890 reports at its peak a day earlier, according to Downdetector." Reuters points out the outage affected antivirus software Microsoft Defender and data governance software Microsoft Purview, while CRN notes it also impacted "a number of Microsoft 365 services" including Outlook and Exchange online: During the outage, Outlook users received a "451 4.3.2 temporary server issue" error message when attempting to send or receive email. Users did not have the ability to send and receive email through Exchange Online, including notification emails from Microsoft Viva Engage, according to the vendor. Other issues that cropped up include an inability to send and receive subscription email through [analytics platform] Microsoft Fabric, collect message traces, search within SharePoint online and Microsoft OneDrive and create chats, meetings, teams, channels or add members in Microsoft Teams...

As with past cloud outages with other vendors, even after Microsoft fixed the issues, recovery efforts by its users to return to a normal state took additional time... Microsoft confirmed in a post on X [Thursday] at 4:14 p.m. ET that it "restored the affected infrastructure to a (healthy) state" but "further load balancing is required to mitigate impact...." The company reported "residual imbalances across the environment" at 7:02 p.m., "restored access to the affected services" and stable mail flow at 12:33 a.m. Jan. 23. At that time, Microsoft still saw a "small number of remaining affected services" without full service stability. The company declared impact from the event "resolved" at 1:29 p.m. Eastern. Microsoft sent out another X post at 8:20 a.m. asking users experiencing residual issues to try "clearing local DNS caches or temporarily lowering DNS TTL values may help ensure a quicker remediation...."

Microsoft said in an admin center update that [Thursday's] outage was "caused by elevated service load resulting from reduced capacity during maintenance for a subset of North America hosted infrastructure." Furthermore, Microsoft noted that during "ongoing efforts to rebalance traffic" it introduced a "targeted load balancing configuration change intended to expedite the recovery process, which incidentally introduced additional traffic imbalances associated with persistent impact for a portion of the affected infrastructure." US itek's David Stinner said it appears that Microsoft did not have enough capacity on its backup system while doing maintenance on its main system. "It looks like the backup system was overloaded, and it brought the system down while they were still doing maintenance on the main system," he said. "That is why it took so many hours to get back up and running. If your primary system is down for maintenance and your backup system fails due to capacity issues, then it is going to take a while to get your primary system back up and running."

"This was not Microsoft's first outage of 2026," the article notes, "with the vendor handling access issues with Teams, Outlook and other M365 services on Wednesday, a Copilot issue on Jan. 15 plus an Azure outage earlier in the month..."
This discussion has been archived. No new comments can be posted.

Microsoft 365 Endured 9+ Hours of Outages Thursday

Comments Filter:
  • Nearly 113 huh... (Score:2, Insightful)

    by Anonymous Coward

    What an unusual number to approximate to.

  • Office 364 (Score:5, Funny)

    by awwshit ( 6214476 ) on Saturday January 24, 2026 @04:50PM (#65946986)

    Countdown 2026 begins!

  • by Teun ( 17872 ) on Saturday January 24, 2026 @04:59PM (#65946998)
    If this keeps up they'll have to rename it to, for example, Microsoft 364 (or 363, 362 etc.)
  • "there were nearly 113 incidents of people reporting issues with Microsoft 365 as of 1:05 a.m. ET," reports Reuters. But that's down "from over 15,890 reports at its peak a day earlier, according to Downdetector."

    Only a few more customers to drive away. Then you can put this on the shelf next to the Zune.

  • by mwa ( 26272 ) on Saturday January 24, 2026 @05:38PM (#65947068)

    It was a "targeted load balancing" change. Just like ICE's targeted operations, the new usage means "We fucked up but we're never going to admit it.

    • Is the missing word 'mistakenly' or 'incorrectly' or 'unfortunate' before 'targeted'?

  • Hyperdysfunction (Score:5, Interesting)

    by myootnt ( 2529336 ) on Saturday January 24, 2026 @06:27PM (#65947134)
    Imagine working in an organization that believes hyperconvergence is the key to efficiency and cost savings. Email? Microsoft, hosted. Team collaboration? Microsoft, hosted. Telephone? Microsoft, hosted. Storage? Microsoft, hosted. Compute? Microsoft, hosted. DHCP? Microsoft, hosted. DNS? Microsoft, hosted. On and on. You get the idea. Luckily, I'm the network guy and can just say "It's Microsoft. Again" over OOB channels, of course, because what wasn't working? Every in band communication channel.
  • by gweihir ( 88907 ) on Saturday January 24, 2026 @06:55PM (#65947178)

    I guess they were so keen to get everybody locked in because they knew they could not deliver. Expect things to get worse, far worse.

    • Microsoft has the goal of becoming too essential to fail so they can charge anything they like.

      • Microsoft has the goal of becoming too essential to fail so they can charge anything they like.

        They're well on their way. Give it another couple years and the government will have to pay them a few billion dollars every time they have an outtage. That's the big business dream. Too big to fail = government handouts for failure.

  • This is yet another reason why I'm glad I don't have any M$ products. Now if I could just get my employer to agree...

  • by Temkin ( 112574 )

    Clearing caches, ok... But... And maybe its because I'm just a BIND 9.x guy... How do I lower the TTL for a Microslop hosted record? Even for records I host, changing the TTL doesn't force the remote caches to expire early.

    T

  • by guygo ( 894298 ) on Sunday January 25, 2026 @11:43AM (#65947952)

    "The Enshittification will continue until morale improves!"

  • Install locally, save locally.

    Trusting the cloud opens you to all sorts of miserable prospects. Sure, clouds are fine for archiving. But using them as a primary storage container or relying on them for the most basic applications that drive your business?

    LOLS, what idiots.

It is better to never have tried anything than to have tried something and failed. - motto of jerks, weenies and losers everywhere

Working...