Job Summary
A company is looking for a Senior Site Reliability Engineer.
Key Responsibilities
- Drive stability and scalability across global compute platforms, including data centers and cloud environments
- Implement automation for self-healing infrastructure and develop internal tools to reduce repetitive tasks
- Establish performance and reliability metrics for infrastructure components and ensure high uptime and quality of service
Required Qualifications
- Bachelor's degree in Computer Science or relevant education, experience, and training
- At least 4 years of experience managing distributed cloud environments such as GCP, AWS, vSphere, and Nutanix
- Deep expertise in container orchestration with Kubernetes
- Strong experience in software development for automation and infrastructure tooling using Go and Python
- Experience with Infrastructure as Code (IaC) and configuration management tools like Terraform, Chef, and Pulumi
Comments