Job Summary
A company is looking for a Junior Site Reliability Engineer.
Key Responsibilities
- Monitor production systems for performance and reliability issues, responding to incidents swiftly
- Establish and maintain Service Level Objectives (SLOs), Service Level Agreements (SLAs), and Service Level Indicators (SLIs)
- Build and maintain automation scripts, tools, and dashboards to enhance monitoring and response times
Required Qualifications
- 1+ years of relevant Site Reliability Engineering or production operations experience
- Basic cloud experience with AWS
- Experience with monitoring and troubleshooting production environments
- Knowledge of alerting and monitoring tools (e.g., Datadog, Prometheus, Grafana, Loki, Zabbix - an advantage)
- Experience with Node.js
Comments