Job Summary
A company is looking for a Cloud Site Reliability Engineer to enhance and expand their global monitoring and observability platform.
Key Responsibilities
- Write, configure, and deploy code to improve service reliability and set standards for code quality
- Lead debugging, troubleshooting, and analysis of service architecture and design
- Implement and manage SRE monitoring applications and develop tooling for proactive issue detection
Required Qualifications
- Experience with Golang, Postgres, and OpenTelemetry
- Proficiency in cloud infrastructure, particularly GCP
- Knowledge of Infrastructure as Code (IaC) tools like Terraform
- Experience in implementing security best practices and compliance measures
- Familiarity with disaster recovery planning and execution
Comments