Job Summary
A company is looking for a MCP & Tools Python Developer for Agent Evaluation Infrastructure.
Key Responsibilities
- Developing and maintaining MCP-compatible evaluation servers
- Implementing logic to verify agent actions against scenario definitions
- Creating or extending tools for testing agents used by writers and QAs
Required Qualifications
- 4+ years of Python development experience, ideally in backend or tools
- Solid experience building APIs, testing frameworks, or protocol-based interfaces
- Understanding of Docker, Linux CLI, and HTTP-based communication
- Ability to integrate new tools into existing infrastructures
- Familiarity with how LLM agents are prompted, executed, and evaluated
Comments