Job Summary
A company is looking for a Senior Site Reliability Engineer to help monitor, develop, and scale their platform.
Key Responsibilities
- Administer, monitor, and troubleshoot application and network components in a cloud-based environment
- Design, author, deploy, and monitor manifests for Kubernetes clusters and service mesh configurations
- Provide production support and develop monitoring and alerting architecture
Required Qualifications
- 5+ years of experience in UNIX/Linux Systems and Network Administration
- Experience with AWS services and deploying/maintaining Kubernetes clusters
- Hands-on experience with Helm charts and service meshes
- Development experience in PHP and extensive experience with Docker/containers
- Proficiency in infrastructure as code tools and understanding of observability principles
Comments