Jump to content

Amazon outage postmortem


Batman_fan

Recommended Posts

http://www.businessinsider.com/amazon-aws-internet-outage-caused-by-engineer-typing-wrong-command-2017-3

Amazon took down parts of the internet because an employee fat-fingered the wrong command

 

Those servers supported other AWS products and services, and it caused a chain reaction, which meant that certain critical systems had to be rebooted — and while they were restarting, Amazon's S3 wasn't working as normal.

At one point the "dashboard," where Amazon tells its users which of its services are operational, wasn't working because of the S3 issue.

The issue wasn't an "outage" because the entire system wasn't down, only some services. Amazon said that it has included new safeguards so that it won't happen again.

Finally, we want to apologize for the impact this event caused for our customers. While we are proud of our long track record of availability with Amazon S3, we know how critical this service is to our customers, their applications and end users, and their businesses. We will do everything we can to learn from this event and use it to improve our availability even further.

Link to comment
Share on other sites

1 hour ago, karthikn said:

Fired count entho.. Valla Families damn .. 

nobody will get fired..

 

it started because of the typo in command..which brought down all the buckets.

 

Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...