Job Summary
A company is looking for a Senior AI Research Engineer, Model Inference (Remote).
Key Responsibilities
- Implement and optimize custom inference and fine-tuning kernels for language models across multiple hardware backends
- Design and optimize Vulkan compute shaders for quantized operators and fine-tuning workflows
- Collaborate with cross-functional teams to integrate optimized serving and inference frameworks into production pipelines
Required Qualifications
- Proficiency in C++ and GPU kernel programming
- Proven expertise in GPU acceleration with Vulkan framework
- Strong background in quantization and mixed-precision model optimization
- Experience with mobile GPU acceleration and model inference
- Familiarity with large language model architectures
Comments