Job Summary
A company is looking for a Voice AI Evaluation Lead to benchmark and evaluate the performance of voice AI models.
Key Responsibilities
- Build and maintain scalable benchmarking pipelines for model evaluations
- Run regular evaluations of production and pre-release models on real-world datasets
- Collaborate with cross-functional teams to develop new evaluation methodologies
Required Qualifications
- Experience designing and executing evaluation pipelines for ML models
- Proficiency in Python and data analysis libraries
- Ability to develop automated evaluation systems and work with large-scale datasets
- Experience using LLMs for analysis or pipeline prototyping
- Proven success working cross-functionally with research, engineering, QA, and product teams
Comments