Join us and learn how to build confidence in your system by creating disruptive events.
Join us at AWS Resilience and Chaos Engineering Day to dive into six technical sessions focused on ensuring that your workload will operate as intended and deliver the resilience you expect. Hear from leading experts on how to build with resilience in mind and how to use chaos engineering to gain confidence in the system’s capability to withstand turbulent conditions in production.
Who should attend?
Whether new to AWS or an experienced user, you'll learn about building resilience and verification through chaos engineering. The technical content and use-case demos are ideal if you are a developer, system administrator, solutions architect, or engineer who wants to gain confidence in your system’s capability to withstand turbulent conditions or learn the latest resilience and chaos engineering trends.
Join Gunnar Grosch for the opening keynote of AWS Resilience and Chaos Engineering Day. He will show why chaos engineering has emerged as an essential practice for builders striving for resilience, how we see chaos engineering used today and outline what you will learn from this one-day event.
Gunnar Grosch, Sr Developer Advocate, AWS
Getting started with AWS Fault Injection Simulator
AWS Fault Injection Simulator is a fully managed chaos engineering service that helps you improve application resiliency by making it easy and safe to perform controlled chaos engineering experiments on AWS. In this session, see an overview of chaos engineering and AWS Fault Injection Simulator, and then see a demo of how to use AWS Fault Injection Simulator to make applications more resilient to failure.
Rohini Gaonkar, Sr Developer Advocate, AWS
Testing resiliency using chaos engineering
In this session, you learn how to use chaos engineering to set up failure injection testing and validate the resiliency of your service. Referencing the AWS Well-Architected Framework, you learn about the design principles and AWS resources that help ensure a resilient architecture. Then, with a three-tier architecture, this session demonstrates how to implement automation to simulate various failures, from the loss of a single Amazon EC2 server to the loss of an entire Availability Zone. The session focuses on the impacts of these failures and the resources that AWS provides to maintain high service availability and short time to recovery.
Seth Eliot, Principal Reliability Solutions Architect, AWS
Achieving Continuous Resilience in DevOps through ML and AI
How do you achieve Continuous Resilience in a DevOps stack? In this talk we will show you how Prime Video created technology that forecasts capacity, tunes auto-scaling and automates scale & chaos testing. We will also talk about how we use AI and ML to achieve continuous Resilience and the work we are doing for the next level of DevOps tooling for Incident Management.
Sudeepa Prakash, Sr Product Manager, Prime Video
Ali Jalali, Sr Applied Scientist, Prime Video
Pavel Vasilev, SDE, Prime Video
Geoff Robinson, Principal TPM, Prime Video
Improve Resilience with Controlled Chaos Engineering
The transition into more complex systems is accelerating, and chaos engineering has proved to be a great-to-have option in our toolbox to handle this complexity. This session will dive deeper into using chaos engineering with AWS Fault Injection Simulator to improve our application's resilience and performance. We will also look at how automated chaos experiments help us cover a more extensive set of experiments than we can cover manually and how it allows us to verify our assumptions over time as unknown parts of the system change.
Gunnar Grosch, Sr Developer Advocate, AWS
Chaos Engineering for Serverless Architectures
Serverless services offer scalability and flexibility to application teams without the overhead of managing servers. The abstraction offered by serverless services can pose a challenge for chaos engineering techniques because you cannot directly inject faults into the control plane or data plane for the service. But, using tools like failure-lambda and network proxies, can give application teams the ability to simulate complete failure or degraded service levels for serverless capabilities. In this session we will focus on applying the principles of chaos engineering to a serverless architecture. We will demonstrate techniques that application teams can use to enhance their chaos engineering experiments when applied to serverless architectures.
Jason Barto, Principal Solutions Architect, AWS