Engineering Manager, Site Reliability Engineering - Observability
Job Description
Twitter Site Reliability Engineers (SREs) are Software Engineers who focus on Availability, Reliability, Disaster Recovery, and other challenges of Scale. They possess a breadth and depth of knowledge about Twitter’s production environment that allows them to craft tools, processes and frameworks to guide colleagues through safely releasing production code, provide guidance and support for monitoring distributed systems, reduce operational overhead, and enable teams to achieve their desired reliability outcomes.
The Observability team ingests and serves petabytes of data from all the services and systems across Twitter’s entire infrastructure. This data is highly critical for Twitters production services and includes system and service level metrics, logging, and tracing. You’ll be focused on creating an environment where Observability SREs, who are embedded with the Observability Software Engineering teams, can improve Reliability and meet the challenges of operating at our continuously-increasing scale.
We believe passion and personality matter; as such, we need leaders that can manage teams of diverse, smart, and driven engineers - while balancing day to day people management with moving the business forward both technically and culturally.
Your responsibilities include, but are not limited to:
Qualifications
Additional Information
All of your information will be kept confidential according to EEO guidelines.