Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
Position: Red Team Specialist
Type: Full-time or Part-time Contract Work
Compensation: $56/hour
Location: Remote
Commitment: 20+ hours/week
Role Responsibilities
- Evaluate conversational AI models for vulnerabilities such as jailbreaks, prompt injections, and bias exploitation.
- Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
- Apply structure by following taxonomies, benchmarks, and playbooks to ensure consistent testing.
- Document reproducibly by producing reports, datasets, and attack cases that customers can act on.
- Collaborate with AI research teams to improve model safety and robustness.
Must-Have
- Fluent Language Skills Required: English & German
- Prior experience in red teaming (AI adversarial work, cybersecurity, socio-technical probing).
- Strong communication skills to explain risks clearly to technical and non-technical stakeholders.
- Ability to work independently and adapt across projects and customers.
- Experience in Adversarial ML: jailbreak datasets, prompt injection, RLHF/DPO attacks, model extraction.
- Cybersecurity skills: penetration testing, exploit development, reverse engineering.
- Expertise in socio-technical risk: harassment/disinfo probing, abuse analysis, conversational AI testing.
- Creative probing skills: psychology, acting, writing for unconventional adversarial thinking.
- Hourly contractor
- Paid weekly via Stripe Connect
- Upload resume
- AI interview based on your resume
- Submit form
- For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
- For any help or support, reach out to: [email protected]
,
Key Skills
Ranked by relevanceReady to apply?
Join Mercor and take your career to the next level!
Application takes less than 5 minutes

