Job Summary
A company is looking for a Senior Machine Learning Ops Engineer to design and build machine learning operations foundations.
Key Responsibilities
- Design, build, and maintain automated pipelines for training, testing, and deploying ML models
- Develop experiment tracking systems for performance metrics, data and model versioning, and documentation
- Monitor and alert for prediction quality, system health, and cost optimization
Required Qualifications
- 8+ years of experience in designing and building production-grade ML pipelines and systems (5+ years may be considered)
- Strong knowledge of experiment tracking, model deployment strategies, data versioning, and monitoring
- Experience with ML infrastructure tools (e.g., MLflow, Kubeflow, Airflow)
- Familiarity with cloud platforms (e.g., GCP, AWS, Azure) and infrastructure-as-code practices (e.g., Terraform)
- Comfortable making architectural decisions and balancing best practices with practical trade-offs
Comments