Job Summary
A company is looking for a Principal Site Reliability Engineer, Network Observability.
Key Responsibilities
- Automate network observability processes in collaboration with NOC and software engineering teams
- Design and implement effective monitoring and alerting systems to proactively identify issues
- Manage the incident lifecycle, leading root cause analysis and implementing preventative measures
Required Qualifications
- Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience)
- Minimum of twelve (12) years of experience in a Senior Network Engineer or Senior Site Reliability Engineer role
- Strong understanding of system administration, Linux, and proficiency in scripting languages (Python and various shells)
- Expert knowledge of networking concepts and application protocols, especially TCP/IP, BGP, DNS, TLS, and HTTP/S
- Experience with various monitoring platforms and automation tools for efficient operations
Comments