Job Summary
A company is looking for a Site Reliability Engineer to support both on-premises and cloud infrastructure.
Key Responsibilities
- Contribute to initiatives aligned with the systems roadmap in a collaborative team environment
- Build and refine monitoring and alerting systems to ensure high availability and performance
- Lead incident response, conduct root cause analysis, and drive remediation to prevent recurrence
Required Qualifications
- Familiarity with security best practices and experience implementing security measures across infrastructure
- Experience in performance tuning and optimizing systems for scalability and efficiency
- Experience in designing and implementing disaster recovery and business continuity plans
- Ability to mentor junior team members and share knowledge to foster a collaborative learning environment
- Experience with tools such as Linux system administration, cloud networking, and automation using Python
Comments