Job Summary
A company is looking for a Senior Site Reliability Engineer to manage and maintain platform infrastructure performance, reliability, and security.
Key Responsibilities
- Troubleshoots and resolves complex problems with systems and services, initiating regular deployments
- Leads projects focused on building and maintaining observability and monitoring for applications
- Conducts post-incident reviews and documents findings for future decision-making
Required Qualifications
- Bachelor's degree in a quantitative or business field or equivalent experience
- 4 - 6 years of related experience in site reliability engineering or similar roles
- Experience with Kubernetes, Docker, monitoring and observability tools, Linux, and AWS is desired
- Knowledge of continuous delivery (CI/CD) tools and processes
- Experience in leading and mentoring lower-level engineers
Comments