---
title: "Episode 22: The Chaos Engineering experiment that is us-east-1"
id: "9833"
type: "podcast"
slug: "episode-22-the-chaos-engineering-experiment-that-is-us-east-1"
published_at: "2018-08-08T06:00:00+00:00"
modified_at: "2023-03-06T18:42:02+00:00"
url: "https://www.lastweekinaws.com/podcast/screaming-in-the-cloud/episode-22-the-chaos-engineering-experiment-that-is-us-east-1/"
markdown_url: "https://www.lastweekinaws.com/podcast/screaming-in-the-cloud/episode-22-the-chaos-engineering-experiment-that-is-us-east-1.md"
taxonomy_shows:
  - "Screaming in the Cloud"
---

About the Author Corey is the Chief Cloud Economist at Duckbill, where he specializes in helping companies improve their AWS bills by making them smaller and less horrifying. He also hosts the "Screaming in the Cloud" and "AWS Morning Brief" podcasts; and curates "Last Week in AWS," a weekly newsletter summarizing the latest in AWS news, blogs, and tools, sprinkled with snark and thoughtful analysis in roughly equal measure.

[https://podcasts.apple.com/us/podcast/screaming-in-the-cloud/id1361244178](https://podcasts.apple.com/us/podcast/screaming-in-the-cloud/id1361244178)

[https://overcast.fm/itunes1361244178/screaming-in-the-cloud](https://overcast.fm/itunes1361244178/screaming-in-the-cloud)

[https://pca.st/7l2e](https://pca.st/7l2e)

[https://open.spotify.com/show/3fBA9eNkGliCzp3Xuy1GVd](https://open.spotify.com/show/3fBA9eNkGliCzp3Xuy1GVd)

[https://feeds.transistor.fm/screaming-in-the-cloud](https://feeds.transistor.fm/screaming-in-the-cloud)

## Episode Summary

Trying to convince a company to embrace the theory and idea of Chaos Engineering is an uphill battle. When a site keeps breaking, Gremlin’s plan involves breaking things intentionally. How do you introduce chaos as a step toward making things better? Today, we’re talking to Ho Ming Li, lead solutions architect at Gremlin. He takes a strategic approach to deliver holistic solutions, often diving into the intersection of people, process, business, and technology. His goal is to enable everyone to build more resilient software by means of Chaos Engineering practices. Some of the highlights of the show include: Ho Ming Li previously worked as a technical account manager (TAM) at Amazon Web Services (AWS) to offer guidance on architectural/operational best practices Difference between and transition to solutions architect and TAM at AWS Role of TAM as the voice and face of AWS for customers Ultimate goal is to bring services back up and make sure customers are happy Amazon Leadership Principles: Mutually beneficial to have the customer get what they want, be happy with the service, and achieve success with the customer Chaos Engineering isn’t about breaking things to prove a point Chaos Engineering takes a scientific approach Other than during carefully staged DR exercises, DR plans usually don’t work Availability Theater: A passive data center is not enough; exercise DR plan Chaos Engineering is bringing it down to a level where you exercise it regularly to build resiliency Start small when dealing with availability Chaos Engineering is a journey of verifying, validating, and catching surprises in a safe environment Get started with Chaos Engineering by asking: What could go wrong? Embrace failure and prepare for it; business process resilience Gremlin’s GameDay and Chaos Conf allows people to share experiences Links: Ho Ming Li on Twitter Gremlin Gremlin on Twitter Gremlin on Facebook Gremlin on Instagram Gremlin: It’s GameDay Chaos Engineering Slack Chaos Conf Amazon Leadership Principles Adrian Cockcroft and Availability Theater Digital Ocean

## Episode Show Notes & Transcript

Trying to convince a company to embrace the theory and idea of Chaos Engineering is an uphill battle. When a site keeps breaking, Gremlin’s plan involves breaking things intentionally. How do you introduce chaos as a step toward making things better?

Today, we’re talking to Ho Ming Li, lead solutions architect at Gremlin. He takes a strategic approach to deliver holistic solutions, often diving into the intersection of people, process, business, and technology. His goal is to enable everyone to build more resilient software by means of Chaos Engineering practices.

Some of the highlights of the show include:

- Ho Ming Li previously worked as a technical account manager (TAM) at Amazon Web Services (AWS) to offer guidance on architectural/operational best practices
- Difference between and transition to solutions architect and TAM at AWS
- Role of TAM as the voice and face of AWS for customers
- Ultimate goal is to bring services back up and make sure customers are happy
- Amazon Leadership Principles: Mutually beneficial to have the customer get what they want, be happy with the service, and achieve success with the customer
- Chaos Engineering isn’t about breaking things to prove a point
- Chaos Engineering takes a scientific approach
- Other than during carefully staged DR exercises, DR plans usually don’t work
- Availability Theater: A passive data center is not enough; exercise DR plan
- Chaos Engineering is bringing it down to a level where you exercise it regularly to build resiliency
- Start small when dealing with availability
- Chaos Engineering is a journey of verifying, validating, and catching surprises in a safe environment
- Get started with Chaos Engineering by asking: What could go wrong?
- Embrace failure and prepare for it; business process resilience
- Gremlin’s GameDay and Chaos Conf allows people to share experiences

Links:

- [Ho Ming Li on Twitter](https://twitter.com/horeal?lang=en)
- [Gremlin](https://www.gremlin.com/)
- [Gremlin on Twitter](https://twitter.com/GremlinInc)
- [Gremlin on Facebook](https://www.facebook.com/gremlininc/)
- [Gremlin on Instagram](https://www.instagram.com/thegremlininc/)
- [Gremlin: It’s GameDay](https://www.gremlin.com/gameday/)
- [Chaos Engineering Slack](https://gremlin.com/slack)
- [Chaos Conf](https://chaosconf.splashthat.com/)
- [Amazon Leadership Principles](https://www.amazon.jobs/principles)
- [Adrian Cockcroft and Availability Theater](https://aws.amazon.com/blogs/opensource/chaos-engineering-meetups/)
- [Digital Ocean](https://do.co/screaming)

.

 View Full Transcript  Hide Full Transcript

## You might also like

[More Podcast Episodes](https://www.lastweekinaws.com/podcast/screaming-in-the-cloud/)

### [The Power of Saying No: Growing by Narrowing Your Focus with Corey Quinn](https://www.lastweekinaws.com/podcast/screaming-in-the-cloud/the-power-of-saying-no-growing-by-narrowing-your-focus-with-corey-quinn/)

Screaming in the Cloud

04.16.2026

29 Minutes

[Play Episode](https://www.lastweekinaws.com/podcast/screaming-in-the-cloud/the-power-of-saying-no-growing-by-narrowing-your-focus-with-corey-quinn/)

### [Build vs Buy: The Hidden Costs of “Just Building It” with Ahmed Bebars](https://www.lastweekinaws.com/podcast/screaming-in-the-cloud/build-vs-buy-the-hidden-costs-of-just-building-it-with-ahmed-bebars/)

Screaming in the Cloud

04.02.2026

43 Minutes

[Play Episode](https://www.lastweekinaws.com/podcast/screaming-in-the-cloud/build-vs-buy-the-hidden-costs-of-just-building-it-with-ahmed-bebars/)

### [FinOps, AI, and the Cost of Cloud Chaos with J.R. Storment](https://www.lastweekinaws.com/podcast/screaming-in-the-cloud/finops-ai-and-the-cost-of-cloud-chaos-with-j-r-storment/)

Screaming in the Cloud

03.19.2026

48 Minutes

[Play Episode](https://www.lastweekinaws.com/podcast/screaming-in-the-cloud/finops-ai-and-the-cost-of-cloud-chaos-with-j-r-storment/)
