Job Summary
A company is looking for a Site Reliability Engineer to enhance its reliability posture through observability and incident response.
Key Responsibilities
- Build and improve platform tooling for service observation and operation
- Partner with product teams to enhance observability and incident response
- Automate operations and contribute to infrastructure reliability using core technologies
Required Qualifications
- 4+ years of experience in systems, infrastructure, or backend software roles
- Proficient in production-grade coding with Go, Python, or similar languages
- Experience with infrastructure-as-code, cloud services, and container orchestration
- Hands-on experience with observability tools and practices
- Familiarity with AI tools and a proactive approach to improving reliability workflows
Comments