-
YO IT Consulting

PhD Rater - Remote

YO IT Consulting
Luxembourg · Full-time · Not Applicable

Seeking experienced researchers and technical experts to support a frontier-model evaluation project focused on agentic workflows. You will design and validate challenging benchmark tasks in data science, machine learning, finance, and coding to help identify reasoning and problem-solving gaps in advanced STEM models. The role involves building real-world tasks with executable tests and analyzing model or agent behavior.

Key Responsibilities

  • Design challenging, real-world STEM problems
  • Implement each task within an agentic development environment using Python

Contract and Payment Terms

  • You will be engaged as an independent contractor.
  • This is a fully remote role that can be completed on your own schedule.
  • Projects can be extended, shortened, or concluded early depending on needs and performance.
  • Payments are weekly on Stripe or Wise based on services rendered.

Key Skills

Ranked by relevance

machine learning
Login to Apply
Posted
Mar 31, 2026
Type
Full-time
Level
Not Applicable
Location
Luxembourg

Industries

Software Development

Categories

Research Analyst Information Technology

Related Jobs

3 roles aligned with this opportunity

View all jobs
View Job Details
Plaud
Related

Product Analyst - Singapore

2026-05-24

Full-time
Not Applicable
Singapore
Software Development
Research
View Job Details
Deloitte
Related

Technology Strategy Trainee

2026-05-22

Full-time
Internship
Luxembourg
Business Consulting
Analyst
View Job Details
YO IT Consulting
Related

Backend Engineer - Remote

2026-05-24

Full-time
Not Applicable
United Kingdom
Software Development
Engineering