AWS outage reminds us that the fate of the internet lies in the hands of the few


Back in the day, we ran websites off personal and corporate servers, usually located within our homes and offices. As the internet grew, we built server racks, co-locations and datacenters. Eventually, though, businesses and services of all sizes offloaded server efforts to third parties—or as they’re known now, cloud services.
The logic is solid. We live in homes, but do not physically build our own houses. The act of serving and scaling websites is not core to the service they provide. Well, it sort of is in that without servers there is no service. But the server is running through APIs, scripts, and other algorithms and programs developed by the company to deliver things like your Netflix stream, the details of your Coinbase wallet account, or the next Tinder prospect.
The ability of cloud services like Amazon Web Services (AWS) and Microsoft’s Azure to, if you pay enough, rapidly scale up (or down, as needed) makes them a smart strategic decision for any business of any size. You never know, for instance, when a small business is going to balloon into a big one and when it needs to service 10,000 simultaneous users instead of 500.
That’s the obvious upside of Cloud-based web services. The downside is what happened this week with AWS.
AWS outage
Tuesday afternoon, huge chunks of AWS crumbled. The AWS Health Dashboard provides a nice play-by-play of the nearly seven-hour outage. At the heart of it was not, at least according to Amazon, an attack, hack, or Denial of Service (DDoS) assault. It was a pair of misbehaving APIs in one sector of the massive service.
We all live in fear of a major DDoS or hack breaching these systems (really any system we rely on) and bringing them to their knees, but that’s rarely the case. When Cloudflare went down in 2019, it was initially assumed to be an attack on its system. However, we soon found out that it was just a bad software deployment, essentially human error.
Even with the AWS outage contained to what Amazon calls “US-EAST-1 Region,” the impact was significant and widespread. It was felt across consumer-facing platforms like Disney+ and, naturally, Amazon.com and some Alexa services.
When I posted the ongoing news on Twitter, I noticed how many people virtually slapped their heads and exclaimed, “That’s why [insert service] was out!”
It occurred to me that many of these users had no idea that AWS sits behind their favorite consumer and business systems. No one, by the way, has the exact number (outside Amazon), but recent reports claim AWS serves millions. Microsoft’s Azure also reports millions of users and the majority of Fortune 500 companies. Google Cloud has big names like Verizon, NewsCorp and Facebook.
Does something need to change?
The widespread use of cloud services is not a bad thing, though the lack of insight can lead to confusion and finger-pointing, like the guy who couldn’t amend orders in his system and got multiple error messages blaming his own systems (and not a third-party provider like AWS).
The combination of cloud systems’ wide reach and general lack of information and real-time feedback to affected customers is cause for some concern. The scale of one any one outage is probably cause for alarm, especially as we consider the inevitable next one.
Gone are the days when someone’s server rack goes down and one website hiccups. Now we have small failures in big cloud systems like AWS, Axure and Cloudflare that trigger a tsunami of outages.
One person on Twitter asked, “What happened to scaling and load balancing?” It’s a fair question. AWS is built on hundreds of separate cloud server clusters and has tons of redundancies, scaling, and load balancing. And still, sometimes, it isn’t enough. Complex systems can misbehave and are especially vulnerable to software updates that can collide with ageing code. For as powerful and distributed as all these cloud services are, AWS included, they’re still programmed, run, and serviced by fallible humans.
So how do we better inform the public and, more importantly, protect AWS, Azure, Cloudflare, and others from these kinds of errors, ones that lead not only to downed sites and services but the loss of millions of dollars?
It may be time to step back and look at cloud systems integrity, security, in the same way we watch out for water systems. None of them are too big to fail, it seems, but all are too important to damage, violate, or lose.
Back in the day, we ran websites off personal and corporate servers, usually located within our homes and offices. As the internet grew, we built server racks, co-locations and datacenters. Eventually, though, businesses and services of all sizes offloaded server efforts to third parties—or as they’re known now, cloud services.…
Recent Posts
- AT&T will let you split your bill with people on your plan
- Sandisk’s revolutionary new memory promises DRAM-like performance, 4X capacity at half the price
- The Best Meta Quest Games You Can Play Right Now (2025)
- ASUS is making a ‘Fragrance Mouse,’ and it’s coming to the US
- Lost Records: Bloom & Rage blends its teen drama with a heavy dose of ’90s nostalgia
Archives
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022
- October 2022
- September 2022
- August 2022
- July 2022
- June 2022
- May 2022
- April 2022
- March 2022
- February 2022
- January 2022
- December 2021
- November 2021
- October 2021
- September 2021
- August 2021
- July 2021
- June 2021
- May 2021
- April 2021
- March 2021
- February 2021
- January 2021
- December 2020
- November 2020
- October 2020
- September 2020
- August 2020
- July 2020
- June 2020
- May 2020
- April 2020
- March 2020
- February 2020
- January 2020
- December 2019
- November 2019
- September 2018
- October 2017
- December 2011
- August 2010