Job Summary
A company is looking for a Service Reliability Engineer to ensure the reliability and performance of its production services.
Key Responsibilities
- Manage a 24x7 multi-site production infrastructure, including deployment and maintenance
- Root-cause complex problems involving multiple stakeholders and systems
- Collaborate with product engineering teams to ensure operational standards and successful product rollouts
Required Qualifications
- 3-5 years of experience managing and troubleshooting Linux systems
- Experience with TCP/IP, HTTP, DNS, SMTP, and LDAP in a distributed computing environment
- Familiarity with virtualization technologies such as KVM, VMware vSphere, or OpenStack
- Experience with monitoring and alerting systems
- BS in Computer Science, Engineering, or a related technical discipline, or equivalent experience
Comments