Job Summary
A company is looking for a DevOps Lead - Bare-Metal & GPU Infrastructure (Linux).
Key Responsibilities
- Design and automate high-availability architectures for fleet reliability
- Build CI/CD pipelines for ultra-fast global configuration and container changes
- Guide a small SRE/DevOps team, setting coding standards and best practices
Required Qualifications
- 5+ years of experience in Linux SRE/DevOps with large bare-metal node fleets
- Deep knowledge of NVIDIA/AMD GPU servers and high-speed interconnects
- Proven record of maintaining 99.999% uptime in high-demand environments
- Expertise in Kubernetes on bare metal and Infrastructure-as-Code tools
- Strong programming skills in Go or Python, plus Bash
Comments