-
View all jobs
Role Overview
Mercor is partnering with a leading AI research lab on Project Vesuvius, an initiative designed to evaluate and enhance the ability of large language models (LLMs) to generate structured, high-quality research plans for open-ended machine learning problems.
We are seeking Machine Learning Researchers and PhDs to serve as annotators who will assess and provide structured feedback on AI-generated research plans. The goal is to improve how LLMs function as brainstorming partners for machine learning research.
This is a remote, short-term engagement with flexible hours and opportunities to contribute to frontier AI evaluation and research.
Key Responsibilities
Compensation & Contract Terms
Mercor is a global talent marketplace connecting exceptional professionals with leading AI labs and research organizations. Our mission is to empower experts to contribute directly to the most influential and technically advanced AI projects worldwide. Mercor is backed by investors including Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. Thousands of professionals across disciplines — from research and engineering to law and design — have joined Mercor to build the next generation of artificial intelligence systems.
Mercor is partnering with a leading AI research lab on Project Vesuvius, an initiative designed to evaluate and enhance the ability of large language models (LLMs) to generate structured, high-quality research plans for open-ended machine learning problems.
We are seeking Machine Learning Researchers and PhDs to serve as annotators who will assess and provide structured feedback on AI-generated research plans. The goal is to improve how LLMs function as brainstorming partners for machine learning research.
This is a remote, short-term engagement with flexible hours and opportunities to contribute to frontier AI evaluation and research.
Key Responsibilities
- Evaluate and compare AI-generated research plans for clarity, feasibility, and technical soundness.
- Design and compile ML tasks based on real-world challenges and research competitions.
- Draft detailed, executable natural language plans for machine learning workflows.
- Implement and validate research plans in Python within a Docker environment.
- Assess outputs against structured rubrics, provide usefulness scores, and deliver concise, objective feedback.
- 5+ years of experience in applied machine learning or a PhD in machine learning or related fields.
- Strong understanding of ML research methodologies, experimental design, and evaluation practices.
- Excellent analytical and technical writing skills.
- Experience with reproducibility or benchmarking in ML research preferred.
- Detail-oriented and able to deliver high-quality, structured feedback independently.
- Type: Independent contractor
- Mode: Fully remote and asynchronous — work from anywhere, on your own schedule.
- Commitment: Maximum 80 hours per week
- Project Name: Vesuvius
Compensation & Contract Terms
- Hourly Pay Rate: Up to $140/hour
- Payment: Weekly via Stripe Connect
- Contract Type: Independent contractor engagement
- Structure: Remote, milestone-based evaluation with flexible scheduling
- Application Process
- Submit your resume or CV highlighting relevant ML research or engineering experience.
- Complete a short AI-based interview and a brief questionnaire about your experience with reproducibility and model benchmarking.
- Selected candidates will receive detailed onboarding materials and access to the project environment.
Mercor is a global talent marketplace connecting exceptional professionals with leading AI labs and research organizations. Our mission is to empower experts to contribute directly to the most influential and technically advanced AI projects worldwide. Mercor is backed by investors including Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. Thousands of professionals across disciplines — from research and engineering to law and design — have joined Mercor to build the next generation of artificial intelligence systems.
Key Skills
Ranked by relevance
ai
machine learning
artificial intelligence
python
docker
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
AI Software Engineer
2026-05-22
Full-time
Not Applicable
Italy
Software Development
Engineering
View Job Details
Related
Machine Learning Engineer - Remote (all genders)
2026-05-28
Full-time
Associate
Germany
Software Development
Information Technology
View Job Details
Related
AI Software Engineer (m/f/d) - Berlin
2026-05-21
Full-time
Mid-Senior
Germany
Staffing
Information Technology
Login to Apply
- Posted
- Nov 18, 2025
- Type
- Full-time
- Level
- Not Applicable
- Location
- Toronto
- Company
- Mercor
Industries
Software Development
Categories
Other
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
AI Software Engineer
2026-05-22
Full-time
Not Applicable
Italy
Software Development
Engineering
View Job Details
Related
Machine Learning Engineer - Remote (all genders)
2026-05-28
Full-time
Associate
Germany
Software Development
Information Technology
View Job Details
Related
AI Software Engineer (m/f/d) - Berlin
2026-05-21
Full-time
Mid-Senior
Germany
Staffing
Information Technology