Job Summary
A company is looking for a MCP & Tools Python Developer - Agent Evaluation Infrastructure.
Key Responsibilities
- Developing and maintaining MCP-compatible evaluation servers
- Implementing logic to check agent actions against scenario definitions
- Creating or extending tools for writers and QAs to test agents
Required Qualifications
- 4+ years of Python development experience, ideally in backend or tools
- Solid experience building APIs, testing frameworks, or protocol-based interfaces
- Understanding of Docker, Linux CLI, and HTTP-based communication
- Ability to integrate new tools into existing infrastructures
- Familiarity with how LLM agents are prompted, executed, and evaluated
Comments