Amazon Web Services (AWS) says a massive overnight outage in its US-East-1 region —one of its busiest data centres—caused widespread internet disruptions across North America and beyond. The company finally restored all services by Monday at 3:01pm PT it says, sharing some more info on just what went wrong.
The outage began late Sunday night after a DNS error in Amazon’s DynamoDB database triggered cascading failures that broke more than 140 AWS services, including EC2, Lambda, CloudWatch, and SQS. Engineers worked through the night and restored full service by Monday afternoon, though some systems, like Redshift, Config, and Connect, needed more time to clear backlogs. Essentially these services power most websites and apps and any outage can cause them to not work properly.
AWS hosts a