The DevOps movement has not only influenced the tools we use in modern development and operations engineering, but also how we work....
Join this interactive webinar!
Jason Yee is a technical writer and evangelist at Datadog, where he works to inspire developers and ops engineers with the power of metrics and monitoring. Previously, he was the community manager for DevOps & Performance at O’Reilly Media and a software engineer at MongoDB. When he’s not speaking at conferences or helping organize them, he likes to spend time on planes “travel hacking” and hunting for interesting regional whiskey.
The DevOps movement has not only influenced the tools we use in modern development and operations engineering, but also how we work. As part of how we work, DevOps has changed how we respond when systems inevitably stop working or don't work as expected. This presentation will provide methods and techniques for gathering information and effectively using that information to avoid and mitigate failure in the future.
I'll cover best practices for gathering systems-related data, including monitoring and logging. This presentation will also cover practices for gathering and recording people-related data; including methods we can adopt from police, accident investigators, and other safety management professions to learn the most from incidents.
After discussing how to gather data, I'll discuss how we can use the data to formulate actionable response plans and how to adjust existing organizational practices to avoid repeating failure.
I plan to keep the technical portions of this talk at a novice level so that it's accessible to both developers/engineers and those in non-technical roles who will be involved in incident response.
What is Codemotion?
Codemotion is a unique platform that connects IT professionals, tech communities and international companies.
Our conferences are open to all languages and technologies, and we engage a vibrant community of more than 450.000 developers!