Job Summary
A company is looking for a Site Reliability Engineer to join their Platform team and ensure the reliability and performance of software systems.
Key Responsibilities
- Implement and maintain monitoring systems to proactively identify and address potential issues
- Automate repetitive tasks and processes to improve efficiency and reduce manual effort
- Respond to incidents, diagnose problems, and implement solutions to restore service quickly
Required Qualifications
- 5+ years of professional experience in a fast-paced SaaS or similar business environment
- 3+ years of hands-on experience supporting production systems as a Site Reliability Engineer or DevOps Engineer
- 3+ years of hands-on experience with cloud services and technologies (GCP, AWS, Azure, etc.)
- Experience with containerization and orchestration tools (e.g., Docker, Kubernetes)
- Proficient in Infrastructure as Code (IaC) tools and methodologies (e.g., Terraform, Pulumi, Puppet, etc.)
Comments