This presentation was recorded at YOW! 2022. #GOTOcon #YOW Liz Fong-Jones - Field CTO at @lizthegrey RESOURCES ABSTRACT Setting a Service Level Objective for your service is only the start of your quantified reliability journey. What do you do when you've had a few too many incidents and blown your error budget? Or had a pile of near-misses that burned the team out even though the user-facing SLO wasn't violated? What if the incident trigger was the infrastructure refactoring meant to improve, not harm, reliability & maintainability? In this talk, you'll learn how the team at Honeycomb handles incidents, chaos engineering, and the engineering feedback loop for reliability with social practices and architectural design. [...] TIMECODES 00:00 Intro 02:18 Our confidence recipe 02:56 Measuring reliability 06:28 How to stay within SLO 09:23 Validating our expectations 13:44 Experimenting in prod 19:19 Not every experiment succeeds 26:46 Fast & reliable: Pick both! 29:05 Outro 29:36 Q&A Download slides and read the full abstract here: RECOMMENDED BOOKS Charity Majors, Liz Fong-Jones & George Miranda • Observability Engineering • Kelly Shortridge & Aaron Rinehart • Security Chaos Engineering • Nora Jones & Casey Rosenthal • Chaos Engineering • Mikolaj Pawlikowski • Chaos Engineering • Russ Miles • Learning Chaos Engineering • #SLO #Observability #ServiceLevelObjective #SRE #ChaosEngineering #Reliability Looking for a unique learning experience? Attend the next GOTO conference near you! Get your ticket at Sign up for updates and specials at SUBSCRIBE TO OUR CHANNEL - new videos posted almost daily.
Hide player controls
Hide resume playing