Job Summary
A company is looking for a Lead Site Reliability Engineer - Remote.
Key Responsibilities
- Drive SRE practices, lead incident response, and ensure high availability and performance of cloud environments
- Develop and maintain software using Python and Node.js
- Automate infrastructure and operations using Infrastructure as Code with Terraform and GitHub Actions
Required Qualifications
- 5+ years of experience in automation and administration of Public Cloud systems (GCP preferred)
- 3+ years of experience in Site Reliability Engineering, including system design and incident response
- 3+ years of CI/CD and Infrastructure as Code experience with Terraform
- 2+ years of experience managing Kubernetes environments
- Availability to work rotating 24/7 call shifts approximately every 8-12 weeks
Comments