Job Summary
A company is looking for a Senior Site Reliability Engineer.
Key Responsibilities
- Architect, manage, and maintain cloud infrastructure supporting applications and internal operations
- Enhance observability through logging, metrics, and alerting systems while leading incident investigations
- Provide technical leadership and mentorship while driving best practices in infrastructure and process documentation
Required Qualifications
- 8+ years of experience in SRE or DevOps roles with a strong track record on major cloud providers
- Deep expertise in configuring and troubleshooting Kubernetes clusters in production environments
- Advanced proficiency with automation tools such as Bash, CI/CD pipelines, Docker, and Terraform
- Experience managing identity and access management (IAM) and single sign-on (SSO) across multi-platform environments
- Operational expertise with observability platforms like New Relic for performance optimization
Comments