Mercor
Adversarial ML Engineer Remote
MercorGermany1 day ago
Part-timeRemote FriendlyResearch
About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position: Red Team Specialist

Type: Full-time or Part-time Contract Work

Compensation: $56/hour

Location: Remote

Commitment: 20+ hours/week

Role Responsibilities

  • Red team conversational AI models and agents, focusing on jailbreaks, prompt injections, and bias exploitation.
  • Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
  • Apply structure using taxonomies, benchmarks, and playbooks to maintain consistent testing.
  • Document reproducibly by producing reports, datasets, and attack cases for customer action.
  • Work independently and asynchronously to meet deadlines while improving AI model performance.

Qualifications

Must-Have

  • Fluent Language Skills Required: Native-level fluency in English & German.
  • Prior experience in red teaming, AI adversarial work, cybersecurity, or socio-technical probing.
  • Ability to explain risks clearly to both technical and non-technical stakeholders.

Preferred

  • Experience in Adversarial ML, Cybersecurity, or socio-technical risk analysis.
  • Skills in jailbreak datasets, prompt injection, or RLHF/DPO attacks.

Compensation & Legal

  • Hourly contractor, Paid weekly via Stripe Connect.

Application Process (Takes 20–30 mins to complete)

  • Upload resume
  • AI interview based on your resume
  • Submit form

Resources & Support

  • For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
  • For any help or support, reach out to: [email protected]

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

,

Key Skills

Ranked by relevance