Remote Jobs

Senior Site Reliability Engineer

7/8/2025

No location specified

Job Summary

A company is looking for a Senior Site Reliability Engineer, AI Infrastructure.

Key Responsibilities

Develop and maintain large-scale systems for AI Infrastructure, ensuring reliability and scalability
Implement SRE fundamentals, including incident management and automation tools to enhance operational efficiency
Establish frameworks for operational maturity and lead incident response protocols to improve system resilience

Required Qualifications

Degree in Computer Science or related field, or equivalent experience with 12+ years in Software Development, SRE, or Production Engineering
Proficiency in Python and at least one additional programming language (C/C++, Go, Perl, Ruby)
Expertise in systems engineering within Linux or Windows environments and cloud platforms (AWS, OCI, Azure, GCP)
Strong understanding of SRE principles, including error budgets and Infrastructure as Code tools
Hands-on experience with observability platforms and CI/CD systems

Comments

No comments yet. Be the first to comment!

Similar Jobs

Maintenance & Pressure Testing Technician (3rd Shift)

6/27/2025

Remote Jobs

Oracle Integration Cloud Specialist

6/25/2025

Remote Jobs

Observability Engineer

6/26/2025

Remote Jobs

Senior Orchestration Engineer

7/8/2025

Remote Jobs

Senior Cloud Architect

6/25/2025

Remote Jobs

Senior AI ML Engineer

6/26/2025

Remote Jobs

Principal AI/ML Engineer

6/28/2025

Remote Jobs

Senior Prompt Engineer

7/1/2025

Remote Jobs

Licensed Civil Engineer

6/26/2025

Remote Jobs

Senior Observability Engineer

7/1/2025

Remote Jobs

Senior DevOps Engineer

6/28/2025

Remote Jobs

Senior Engineering Manager

7/8/2025

Remote Jobs

Senior Authentication Engineer

6/25/2025

Remote Jobs