Job Summary
A company is looking for a Senior Site Reliability Engineer (SRE) for a remote contract position.
Key Responsibilities
- Build, review, and maintain application design and architecture documents while ensuring disaster recovery (DR) capabilities are integrated into systems
- Lead complex projects focused on observability, monitoring, and performance improvement, including DR testing exercises
- Collaborate with development teams to optimize service reliability and maintain documentation for incident response and continuous improvement
Required Qualifications
- Bachelor's degree
- 4-6 years of related experience
- Proficiency in AWS, Kubernetes, and monitoring tools such as Prometheus and Grafana
- Experience with load balancing strategies and observability standards
- Familiarity with Rancher and Axway API Gateway is a plus
Comments