Job Summary
A company is looking for a Site Reliability Engineer I (Resilience).
Key Responsibilities
- Lead technical initiatives to automate system engineering efforts for global infrastructure reliability
- Develop and maintain software and tools to scale the global Platform infrastructure
- Respond to and prevent customer impacts during major incidents while participating in an on-call rotation
Required Qualifications
- Experience in software engineering with a focus on Platform reliability
- Familiarity with public cloud and managed Kubernetes services
- Proven ability to work in distributed teams or remote environments
- Experience with Infrastructure-as-Code tooling such as Crossplane or Terraform is advantageous
- Background in system administration with professional skills in Linux on distributed systems
Comments