Job Summary
A company is looking for a Principal Site Reliability Engineer.
Key Responsibilities
- Lead project work to build and maintain platform features for reliability and cloud infrastructure
- Mentor service owners on deploying and operating services at scale
- Participate in incident response, triage, and root cause analysis
Required Qualifications
- Significant experience operating Kubernetes in distributed environments
- Experience with systems in GCP or AWS
- Exposure to monitoring and observability infrastructure
- Understanding of infrastructure-as-code practices and tools
- Six years of systems experience in operations or development
Comments