AI Adversarial Specialist
New
United KingdomContractEntry
Salary20 - 22 USD per hour
Apply NowOpens the employer's application page
Job Details
- Languages
- Native fluency in English and Urdu.
- Required Skills
- CybersecurityGenerative AI
Requirements
- Native fluency in English and Urdu.
- Prior experience in red teaming (AI adversarial work, cybersecurity, socio-technical probing).
- Ability to explain risks clearly to both technical and non-technical stakeholders.
- Experience in Adversarial ML (preferred).
- Skills in creative probing such as psychology, acting, or writing for unconventional adversarial thinking (preferred).
Responsibilities
- Red team conversational AI models and agents to identify jailbreaks, prompt injections, and misuse cases.
- Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
- Apply structure by following taxonomies, benchmarks, and playbooks to ensure consistent testing.
- Document reproducibly to produce reports, datasets, and attack cases that customers can act on.
- Review AI outputs on sensitive topics like bias and misinformation, with optional participation in higher-sensitivity projects.
View Full Description & ApplyYou'll be redirected to the employer's site