-
View all jobs
Position: PhD Rater
Type: Part-Time
Compensation: $70–$120/hour
Location: Remote
Commitment: 30+ hours/week (primarily weekdays)
Role Responsibilities
- Design challenging, real-world STEM benchmark problems in domains such as data science, machine learning, finance, and software engineering.
- Implement tasks within an agentic development environment using Python.
- Create reproducible problem setups with clear specifications and executable tests.
- Evaluate and analyze AI model behavior, including reasoning traces and agent workflows.
- Diagnose reasoning failures, logic gaps, and problem-solving limitations in AI systems.
- Contribute to improving benchmark quality and evaluation frameworks for frontier AI models.
Requirements
- Active or recently graduated PhD.
- Deep expertise in data science, machine learning, finance, and/or Python-based software development.
- Strong research background in advanced STEM topics.
- Ability to commit reliably for 30+ hours per week.
- Demonstrated technical output such as high-quality open-source contributions or research work.
- Ability to analyze agent behavior traces and diagnose failures beyond surface-level errors.
Application Process
- Upload resume
- Interview
- Submit form
Key Skills
Ranked by relevance
ai
machine learning
python
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
Site Reliability Engineer (SRE) Mid-Level / Senior, Portugal
2026-04-11
Full-time
Not Applicable
Portugal
IT Services
Engineering
View Job Details
Related
DevOps Engineer
2026-04-10
Full-time
Not Applicable
Spain
IT Services
Engineering
View Job Details
Related
🚀 ML / AI Engineer (GenAI & MLOps) | Lleva modelos a producción real - Modelo Hibrido (Madrid)
2026-04-10
Full-time
Associate
Spain
IT Services
Engineering
Login to Apply
- Posted
- Mar 30, 2026
- Type
- Contract
- Level
- Associate
- Location
- Canada
- Company
- Crossing Hurdles
Industries
Financial Services
IT Services
IT Consulting
Research Services
Categories
Engineering
Information Technology
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
Site Reliability Engineer (SRE) Mid-Level / Senior, Portugal
2026-04-11
Full-time
Not Applicable
Portugal
IT Services
Engineering
View Job Details
Related
DevOps Engineer
2026-04-10
Full-time
Not Applicable
Spain
IT Services
Engineering
View Job Details
Related
🚀 ML / AI Engineer (GenAI & MLOps) | Lleva modelos a producción real - Modelo Hibrido (Madrid)
2026-04-10
Full-time
Associate
Spain
IT Services
Engineering