Remote Jobs

LLM Inference Engineer

8/24/2025

Remote

Job Summary

A company is looking for a Member of Technical Staff - LLM Inference.

Key Responsibilities

Drive breakthroughs in LLM inference optimization with structured generation
Deploy and optimize inference engines to enhance performance and reduce latency
Collaborate in a remote environment to innovate and improve AI systems

Required Qualifications

Proven experience with inference engines like vLLM, SGLang, or TensorRT
Hands-on knowledge of NVIDIA GPU architecture, including CUDA
Experience with distributed inference and low-latency communication
Background in LLM MLOps, including monitoring and scaling inference services
Proficiency in Python and familiarity with containerization technologies

Comments

No comments yet. Be the first to comment!

Similar Jobs

Application Engineer - BigID

8/20/2025

Remote Jobs

Senior Systems Engineer

8/27/2025

Remote Jobs

Staff Site Reliability Engineer

8/26/2025

Remote Jobs

Principal Kernel Engineer

8/15/2025

Remote Jobs

Senior Proxy Gateway Engineer

8/16/2025

Remote Jobs

Fiber Networks Specialist

8/21/2025

Remote Jobs

AI Platforms Architect

8/23/2025

Remote Jobs

Senior Server Engineer

8/28/2025

Remote Jobs

Senior Solution Engineer

8/27/2025

Remote Jobs

DevOps Engineer

8/23/2025

Remote Jobs

Staff Firmware Engineer

8/26/2025

Remote Jobs

Salesforce Development Manager

8/22/2025

Remote Jobs

Senior Solutions Engineer

8/27/2025

Remote Jobs