Job Summary
A company is looking for a Staff Site Reliability Engineer.
Key Responsibilities
- Design and implement systems that enhance reliability, observability, traceability, and incident management
- Lead cross-team collaborations and drive impactful projects through technical leadership
- Define and enforce production standards, processes, and tools to ensure operational excellence
Required Qualifications
- 7+ years of experience in Production Engineering, Backend Engineering, SRE, DevOps, or a similar role
- Strong coding ability in at least one language (e.g., Golang, Python, Java, Typescript)
- Demonstrated experience delivering medium to large-scale projects that improve platform reliability and scalability
- Deep understanding of production reliability concepts, including SLIs, SLOs, and incident management
- Familiarity with working in dynamic, reliability-focused production environments (preferred)
Comments