Full-time

Freelance Agent Evaluation Engineer

Posted by Mindrift • illapel, illapel, Chile

📍 illapel, illapel 🕒 June 04, 2026

Apply for this Job Similar Jobs

About the Role

                    Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What This Opportunity Involves We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history
Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is solvable and the evaluation is fair
Design tasks set in isolated environments - emulations of a developer's workstation: a Linux...
                

Job Details

Location illapel, illapel
Job Type Full-time
Category Other-General
Posted June 04, 2026
Deadline July 14, 2026

Ready to Apply?

Submit your application today and take the next step in your career journey with Mindrift.

Apply Now