Avature’s Coverage team is dedicated to maintaining and improving the quality of our monitoring tools and practices as applied during on-call shifts or other related incident-spotting endeavors. The scope of the team ranges from the management and continuous improvement of our servers and service monitoring and alerting to a holistic system reliability view.
As a Cloud Reliability Engineer, you’ll strive to implement tools and processes that improve observability, monitoring, and incident management, minimize emergency response time, and provide a pain-free experience for the teams involved in incident management.