Job Summary
A company is looking for a Member of Technical Staff, Infrastructure & Data.
Key Responsibilities
- Build, manage, and scale GPU infrastructure using tools like Kubernetes, Terraform, or Pulumi
- Maintain and optimize ETL pipelines using Spark, Ray, or Airflow
- Operate and improve telemetry and monitoring stack (Datadog, Grafana, Weights & Biases)
Required Qualifications
- Experience managing large-scale, high-performance infrastructure
- Skilled in designing scalable systems for compute, data, and developer tooling
- Familiar with infrastructure stacks for AI model training and experimentation
- Experienced with Kubernetes, Terraform/Pulumi, Spark/Ray, and observability tools
- Bonus: experience as a Cluster Engineer, Data Engineer, or Developer Advocate in AI/ML environments
Comments