Job Summary
A company is looking for a Staff Systems Reliability Engineer.
Key Responsibilities
- Design and implement scalable, fault-tolerant AWS-based infrastructure using Terraform and/or CloudFormation
- Develop and maintain CI/CD pipelines and write automation tools in Python and/or Go
- Lead incident response efforts and collaborate with software engineers to ensure system reliability and security
Required Qualifications
- Minimum of 8 years of related experience with a Bachelor's degree or equivalent work experience
- Expert-level knowledge of AWS services and infrastructure-as-code
- Strong proficiency in Python and/or Go for automation and tooling
- Experience managing observability and alerting systems at scale
- Familiarity with regulatory requirements such as HIPAA and FDA 21 CFR Part 11
Comments