Job Summary
A company is looking for a Senior DGX Cloud AI Infrastructure Software Engineer.
Key Responsibilities
- Develop infrastructure software and tools for large-scale AI and GenAI infrastructure
- Optimize tools to enhance infrastructure efficiency and resiliency
- Analyze and triage failures from the application level to the hardware level
Required Qualifications
- Minimum of 12+ years of experience in developing software infrastructure for large-scale AI systems
- Bachelor's degree or higher in Computer Science or a related technical field (or equivalent experience)
- Strong debugging skills with experience in analyzing AI applications
- Proven track record in building and scaling large-scale distributed systems
- Experience with AI training and inferencing and data infrastructure services
Comments