AI Red Team Specialist

New
United StatesContract
Salary20 - 22 USD per hour
Apply NowOpens the employer's application page

Job Details

Languages
English, Marathi
Required Skills
Cybersecurity

Requirements

  • Fluent in English and Marathi.
  • Prior experience in red teaming (AI adversarial work, cybersecurity, socio-technical probing).
  • Ability to explain risks clearly to technical and non-technical stakeholders.
  • Adaptable and able to thrive across projects and customers.
  • Experience with Adversarial ML: jailbreak datasets, prompt injection, RLHF/DPO attacks, model extraction.
  • Background in Cybersecurity: penetration testing, exploit development, reverse engineering.
  • Expertise in socio-technical risk: harassment/disinfo probing, abuse analysis, conversational AI testing.
  • Creative probing skills: psychology, acting, writing for unconventional adversarial thinking.

Responsibilities

  • Red team conversational AI models and agents to identify jailbreaks, prompt injections, misuse cases, and bias exploitation.
  • Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
  • Apply structure by following taxonomies, benchmarks, and playbooks to ensure consistent testing.
  • Document reproducibly by producing reports, datasets, and attack cases for customer action.
View Full Description & ApplyYou'll be redirected to the employer's site
20 - 22 USD per hour
Apply Now