Freelance Agent Evaluation Engineer

Mindrift

Apply Now
Canada
$90,000 - $90,000 / year
full-time
senior
Posted May 12, 2026
via himalayas

About This Role

We're building a dataset to evaluate AI coding agents by creating challenging tasks and evaluation criteria within realistic simulated environments. You'll work on a part-time, non-permanent project, creating tasks for AI agents to evaluate and improve their coding abilities. Requirements • Degree in Computer Science, Software Engineering, or related fields • 5+ years in software development, primarily Python • Background in full-stack development, with experience building React-based interfaces and robust back-end systems • Experience writing tests, familiarity with Docker containers, CI/CD tools, and infrastructure tools Benefits • Opportunity to work on a challenging project, Flexible schedule, Compensation up to $45 per hour Originally posted on Himalayas

Ready to Apply?

Click the button below to visit the company's application page.

Apply for this Position