This Meetup features talks by J Paul Reed & James Cunningham.
Doors open at 6:30pm. Catch up with other quantifiers over food & drinks. Talks start at 7:00pm & end at 8pm. Space is limited, please RSVP.
This event will be live-streamed atheavybit.com/live/.
Detecting Whispers in Chaos
J Paul Reed,Managing Partner at Release Engineering Approaches
In this talk, we'll look at what decades of research in the safety sciences has to say about humans interacting with & operating complex socio-technical systems, including what air craft carriers have to do with Internet infrastructure operations, how resilience engineering can help us, & the use of heuristics in incident response. All of these provide insight into ways we can improve one the most advancedand most effectivemonitoring tools we have available to keep those systems running: ourselves.
Learn more about Paul here:http://jpaulreed.com/
Vetting your Pager
James Cunningham, Operations Engineer at Sentry
Sentry (sentry.io) receives a million requests a minute to process & store crashes from all around the world. It's the Operations Team's responsibility that everything goes right, but it's also their responsibility to not burn themselves out when things go wrong.
Sentry collects fifty thousand custom metrics inside of DataDog, but only alerts on less than fifty of them. James leads Sentry's observability initiative, creating & maintaining those alerts.
Learn about the lifecycle of an alert at Sentry, including:
How a variety of metrics are collected efficiently
How Sentry justifies a metric's degree of accuracy
Why a metric's logical purpose is defined
How alerts evolve from metrics, articulating its existence
When an Engineer actually gets paged & what they're instructed to do