Job Summary
A company is looking for a Site Reliability Engineer II (Platform) to join their Platform team.
Key Responsibilities
- Build and maintain scalable platform infrastructure and services for multiple product teams
- Develop developer tools, automation systems, and self-service platforms to enhance engineering productivity
- Implement monitoring, logging, and alerting systems to ensure high availability of platform services
Required Qualifications
- Strong experience with Linux systems administration and cloud infrastructure, particularly AWS
- Proficiency with infrastructure-as-code tools such as Terraform, Ansible, or CloudFormation
- Solid programming and scripting skills in languages like Python, Go, or Bash
- Experience with containerization technologies including Docker and orchestration platforms
- Knowledge of monitoring and observability tools such as Prometheus, Grafana, or Datadog
Comments