-
Mogi I/O : OTT/Podcast/Short Video Apps for you
View all jobs
Senior LLM Evaluation Engineer (C++ Focus) - 3-Month Contract
Qatar
· Contract
·
Mid-Senior
Work Type: Contractor | Permanent Remote
Compensation: USD 15 – 25/hour
Hours: 40 hours/week (PST overlap required)
Experience Required: 5+ Years
Contract Duration: 3 Months
Notice Period: Max 2 weeks
Note
Contract-based, fully remote role. Payment based on actual hours worked. No paid leaves/benefits. The contractor handles own taxes/compliance.
About The Project
We're building LLM evaluation/training datasets to solve realistic software engineering problems. Our approach involves creating verifiable SWE tasks using public repository histories through synthetic methods with human-in-the-loop validation, expanding coverage across programming languages, difficulty levels, and task types.
About The Role
Seeking a Tech Lead-level software engineer experienced with high-quality public GitHub repositories. You'll drive hands-on engineering work including environment automation, issue triaging, and test coverage/quality evaluation to advance LLM capabilities for real-world coding tasks.
Day-to-Day Responsibilities
Compensation: USD 15 – 25/hour
Hours: 40 hours/week (PST overlap required)
Experience Required: 5+ Years
Contract Duration: 3 Months
Notice Period: Max 2 weeks
Note
Contract-based, fully remote role. Payment based on actual hours worked. No paid leaves/benefits. The contractor handles own taxes/compliance.
About The Project
We're building LLM evaluation/training datasets to solve realistic software engineering problems. Our approach involves creating verifiable SWE tasks using public repository histories through synthetic methods with human-in-the-loop validation, expanding coverage across programming languages, difficulty levels, and task types.
About The Role
Seeking a Tech Lead-level software engineer experienced with high-quality public GitHub repositories. You'll drive hands-on engineering work including environment automation, issue triaging, and test coverage/quality evaluation to advance LLM capabilities for real-world coding tasks.
Day-to-Day Responsibilities
- Analyze and triage GitHub issues from trending open-source libraries
- Configure code repositories (Dockerization, environment setup)
- Evaluate unit test coverage and software quality
- Modify/run codebases locally to validate LLM bug-fix performance
- Collaborate with researchers to identify LLM-challenging repositories/issues
- Lead junior engineers on assigned projects
- 5+ years overall software engineering experience
- Tech Lead experience with complex codebases
- Proficiency in C++ (primary) or similar systems languages
- Expertise with Git, Docker, and CI/CD pipeline configuration
- Ability to run/debug/test real-world projects locally
- Open-source contribution/evaluation experience
Key Skills
Ranked by relevance
docker
cicd
git
c
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
Senior AI Engineer - Chatbot & Agentic AI
2026-05-27
Full-time
Mid-Senior
Qatar
Technology
Engineering
View Job Details
Related
Senior DevOps Engineer
2026-05-22
Full-time
Mid-Senior
Lithuania
Software Development
Engineering
View Job Details
Related
Backend Engineer - Remote
2026-05-22
Full-time
Not Applicable
Italy
Software Development
Engineering
Login to Apply
- Posted
- Jul 11, 2025
- Type
- Contract
- Level
- Mid-Senior
- Location
- Qatar
Industries
Software Development
Categories
Engineering
Information Technology
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
Senior AI Engineer - Chatbot & Agentic AI
2026-05-27
Full-time
Mid-Senior
Qatar
Technology
Engineering
View Job Details
Related
Senior DevOps Engineer
2026-05-22
Full-time
Mid-Senior
Lithuania
Software Development
Engineering
View Job Details
Related
Backend Engineer - Remote
2026-05-22
Full-time
Not Applicable
Italy
Software Development
Engineering