Freelance AI Evaluation Engineer (Python/Full-Stack)

Mindrift
Remote Hungary Full-time 🌐 English
MI
Salary: $100k - $100k/year
Experience: Mid-level
Added to JobCollate: March 14, 2026

AI Summary Powered by Gemini

This freelance role involves creating challenging coding test cases for AI systems, requiring strong Python development, Full-Stack experience with React, and a background in test automation. The opportunity offers flexible hours and a competitive hourly rate for a remote position.

Job Description

We're looking for a freelance AI evaluation engineer with experience in software development, test automation, and Full-Stack development to create challenging coding test cases for AI coding systems.RequirementsDegree in Computer Science, Software Engineering, or related fields5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations)Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systemsExperience writing tests (functional, integration – not just running them)Docker containers (running evaluations locally in containers)CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)English proficiency - B2BenefitsUp to $50 per hour equivalentEstimated 20 hours of work per projectFlexible work scheduleOriginally posted on Himalayas

Full Description

We're looking for a freelance AI evaluation engineer with experience in software development, test automation, and Full-Stack development to create challenging coding test cases for AI coding systems.RequirementsDegree in Computer Science, Software Engineering, or related fields5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations)Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systemsExperience writing tests (functional, integration – not just running them)Docker containers (running evaluations locally in containers)CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)English proficiency - B2BenefitsUp to $50 per hour equivalentEstimated 20 hours of work per projectFlexible work scheduleOriginally posted on Himalayas

Required Skills

Mid-Level-Full-Stack-AI-Developer-(Python-Azure) Python-AI-Engineer Software-Engineer-(AI) Mid-Level-Full-Stack-AI-Engineer Freelance-AI-Developer

Similar Jobs