Job Summary
A company is looking for a Senior DL Algorithms Engineer.
Key Responsibilities
- Optimize deep learning models for low-latency, high-throughput inference, focusing on LLMs, VLMs, and WFMs
- Convert, deploy, and optimize models for efficient inference using frameworks like TensorRT and SGLang
- Collaborate with teams to integrate AI models from training to deployment and develop automation for inference optimization
Required Qualifications
- Master's or PhD in Computer Science, Electrical Engineering, or related field (or equivalent experience)
- 3+ years of professional experience in deep learning or applied machine learning
- Strong foundation in deep learning algorithms and hands-on experience with LLMs and VLMs
- Proficient in building and deploying models using PyTorch or TensorFlow
- Solid programming skills in Python and C++, with experience in inference optimization techniques
Comments