-
YO IT Consulting

PhD Rater - Remote

YO IT Consulting
Finland · Full-time · Not Applicable

Seeking experienced researchers and technical experts to support a frontier-model evaluation project focused on agentic workflows. You will design and validate challenging benchmark tasks in data science, machine learning, finance, and coding to help identify reasoning and problem-solving gaps in advanced STEM models. The role involves building real-world tasks with executable tests and analyzing model or agent behavior.

Key Responsibilities

  • Design challenging, real-world STEM problems
  • Implement each task within an agentic development environment using Python

Contract and Payment Terms

  • You will be engaged as an independent contractor.
  • This is a fully remote role that can be completed on your own schedule.
  • Projects can be extended, shortened, or concluded early depending on needs and performance.
  • Payments are weekly on Stripe or Wise based on services rendered.

Key Skills

Ranked by relevance

machine learning
Login to Apply
Posted
Mar 31, 2026
Type
Full-time
Level
Not Applicable
Location
Finland

Industries

Software Development

Categories

Research Analyst Information Technology

Related Jobs

3 roles aligned with this opportunity

View all jobs
View Job Details
Banca Mediolanum
Related

Data Analytics & Reporting

2026-04-11

Full-time
Not Applicable
Italy
Banking
Research
View Job Details
Hostaway
Related

Senior Product Designer - Design Systems - 100% Remote - EMEA

2026-04-10

Full-time
Not Applicable
Finland
Software Development
Design
View Job Details
Service Driven Professionals
Related

Senior Backend Engineer .NET & Azure Cloud

2026-04-11

Full-time
Mid-Senior
Netherlands
Technology
Engineering