AI Safety & Red Teaming Specialist
New
W
Weekday AIAI Safety
India. United StatesContractMiddle
Salary50 - 90 USD per hour
Apply NowOpens the employer's application page
Job Details
- Experience
- 2+ years
- Required Skills
- Prompt Engineering
Requirements
- 2+ years of experience in AI Safety, Adversarial Machine Learning, LLM Red Teaming, AI Security, or a related field.
- Hands-on experience researching, testing, or identifying vulnerabilities involving prompt injection, ethical jailbreaks, adversarial attacks, or tool-use exploitation.
- Strong understanding of modern LLM architectures, prompt engineering, and AI safety evaluation methodologies.
- Experience developing structured security assessments, regression testing frameworks, and adversarial evaluation strategies.
- Excellent analytical, documentation, and communication skills.
- Ability to collaborate effectively within cross-functional technical teams.
Responsibilities
- Design and implement advanced evaluation methodologies for AI system safety, including ethical jailbreak testing, prompt injection detection, LLM red teaming, and tool-use abuse scenarios.
- Develop cross-domain adversarial testing strategies to uncover complex, multi-turn attack patterns and model vulnerabilities.
- Build, maintain, and enhance regression test suites to continuously assess jailbreak susceptibility and prompt injection risks.
- Create comprehensive evaluation frameworks that simulate real-world adversarial threats to improve AI robustness and reliability.
- Collaborate with technical teams to translate security findings into actionable recommendations for AI safety improvements.
- Document testing methodologies, findings, and best practices through clear technical reports and presentations for both technical and non-technical stakeholders.
View Full Description & ApplyYou'll be redirected to the employer's site