Share:

Tailor your resume to this posting—match keywords and layout for recruiters. Try Resume.io before you apply.

Job Description

We're building a dataset to evaluate AI coding agents by creating challenging tasks and evaluation criteria within realistic simulated environments. You'll work on a part-time, non-permanent project, creating tasks for AI agents to evaluate and improve their coding abilities.RequirementsDegree in Computer Science, Software Engineering, or related fields5+ years in software development, primarily PythonBackground in full-stack development, with experience building React-based interfaces and robust back-end systemsExperience writing tests, familiarity with Docker containers, CI/CD tools, and infrastructure toolsBenefitsOpportunity to work on a challenging project, Flexible schedule, Compensation up to $45 per hourOriginally posted on Himalayas

Full Description

We're building a dataset to evaluate AI coding agents by creating challenging tasks and evaluation criteria within realistic simulated environments. You'll work on a part-time, non-permanent project, creating tasks for AI agents to evaluate and improve their coding abilities.RequirementsDegree in Computer Science, Software Engineering, or related fields5+ years in software development, primarily PythonBackground in full-stack development, with experience building React-based interfaces and robust back-end systemsExperience writing tests, familiarity with Docker containers, CI/CD tools, and infrastructure toolsBenefitsOpportunity to work on a challenging project, Flexible schedule, Compensation up to $45 per hourOriginally posted on Himalayas

Required Skills

AI-Evaluation-Engineer Senior-AI-Agent-Engineer Evaluation-Engineer

Freelance Agent Evaluation Engineer

Job Description

Full Description

Required Skills

Similar Jobs

Director Security Engineer | DevSecOps

Python Backend Developer

Staff Engineer - Web Platform (Typecript/React)

Software Engineer II, Autonomous Freight Systems

Software Engineer, Hardware Test & Automation (Optical Payloads)

Lead Software Engineer (Java, AWS)