Job Summary
A company is looking for a Site Reliability Engineer.
Key Responsibilities
- Design, implement, and maintain monitoring, alerting, and incident response systems
- Ensure high availability, reliability, and performance of infrastructure supporting scalable services
- Collaborate with engineering teams throughout the full development lifecycle
Required Qualifications
- 6+ years of DevOps or site reliability experience
- Strong engineering background in Computer Science, Software Engineering, or Mathematics
- Deep understanding of distributed systems and containerization (e.g., Docker, Kubernetes)
- Experience with infrastructure as code using Terraform and/or Ansible
- Proficiency with programming languages such as Python or Go
Comments