AI Red Team Specialist
New
United StatesContract
Salary20 - 22 USD per hour
Apply NowOpens the employer's application page
Job Details
- Languages
- English, Marathi
- Required Skills
- Cybersecurity
Requirements
- Fluent in English and Marathi.
- Prior experience in red teaming (AI adversarial work, cybersecurity, socio-technical probing).
- Ability to explain risks clearly to technical and non-technical stakeholders.
- Adaptable and able to thrive across projects and customers.
- Experience with Adversarial ML: jailbreak datasets, prompt injection, RLHF/DPO attacks, model extraction.
- Background in Cybersecurity: penetration testing, exploit development, reverse engineering.
- Expertise in socio-technical risk: harassment/disinfo probing, abuse analysis, conversational AI testing.
- Creative probing skills: psychology, acting, writing for unconventional adversarial thinking.
Responsibilities
- Red team conversational AI models and agents to identify jailbreaks, prompt injections, misuse cases, and bias exploitation.
- Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
- Apply structure by following taxonomies, benchmarks, and playbooks to ensure consistent testing.
- Document reproducibly by producing reports, datasets, and attack cases for customer action.
View Full Description & ApplyYou'll be redirected to the employer's site