In August ’24, just after our service went live, something went wrong and we effectively DDOS’d our Azure KeyVault instance for several hours, which really wasn’t helpful.
In this session we’ll walk through the incident and our journey from being alerted and the initial panic, through collaborative firefighting, to methodical investigation and finally resolution. On the way we’ll talk about the importance of telemetry, teamwork, and not placing too much trust in your service provider
In the end the issue was resolved. Our team learned a lot that day about how we handle incidents and learn from what happened. I hope you’ll leave the session thinking about how your team can take what happened to us and make sure you can handle your own incidents effectively.