Job Summary
A company is looking for a Site Reliability Engineer.
Key Responsibilities
- Design, implement, and maintain monitoring and alerting systems to ensure optimal system performance
- Lead incident response efforts and implement preventive measures to reduce future occurrences
- Build and maintain automation tools to improve deployment reliability and enhance system resilience
Required Qualifications
- 3+ years of hands-on experience in site reliability engineering, DevOps, or similar roles
- Strong knowledge of SRE best practices including SLIs/SLOs and reliability engineering principles
- Cloud platform experience with services like Compute Engine, Kubernetes, and Cloud SQL
- Experience with monitoring tools such as DataDog
- Backend development experience with Java, PHP, and/or Node.js
Comments