Job Summary
A company is looking for a Research Engineer - Distributed Training.
Key Responsibilities
- Lead and participate in research to develop a decentralized training orchestration solution
- Optimize performance, cost, and resource utilization of AI workloads using advanced techniques
- Contribute to open-source libraries and publish research in top-tier AI conferences
Required Qualifications
- Strong background in AI/ML engineering with experience in end-to-end training pipelines
- Deep expertise in distributed training techniques and frameworks
- Experience in large-scale model training and distributed training techniques
- Solid understanding of MLOps best practices
- Passion for advancing decentralized AI model training and democratizing AI access
Comments