Job Summary
A company is looking for a Senior Software Engineer, Machine Learning Inference.
Key Responsibilities
- Design, develop, and optimize NVIDIA TensorRT and TensorRT-LLM for inference applications
- Develop software in C++, Python, and CUDA for deploying LLMs and Generative AI models
- Collaborate with deep learning experts and GPU architects to influence hardware and software design
Required Qualifications
- BS, MS, PhD or equivalent experience in Computer Science, Computer Engineering, or a related field
- 8+ years of software development experience on a large codebase or project
- Strong proficiency in C++ (required), Rust or Python programming languages
- Experience in developing Deep Learning Frameworks, Compilers, or System Software
- Knowledge of Machine Learning techniques and GPU programming with CUDA or OpenCL
Comments