Job Summary
A company is looking for a Site Reliability Engineer to assist with infrastructure, monitoring, and cloud services.
Key Responsibilities
- Assist with infrastructure, monitoring, and cloud services, focusing on GCP and Kubernetes
- Build automation within the infrastructure to ensure high availability and resiliency
- Support the migration from PCF to GCP and work with various monitoring and performance management tools
Required Qualifications
- Experience with Terraform, including building modules from scratch
- Knowledge of Ansible or other configuration management tools
- Proficiency in Kubernetes and Python
- Experience with application/performance management tools such as Dynatrace, DataDog, or New Relic
- Familiarity with Prometheus, Grafana, and GCP services like GKE, Pub/Sub, and BigQuery
Comments