AI Safety & Red Teaming Specialist

New

India. United StatesContractMiddle

Salary50 - 90 USD per hour

Apply NowOpens the employer's application page

Job Details

2+ years of experience in AI Safety, Adversarial Machine Learning, LLM Red Teaming, AI Security, or a related field.
Hands-on experience researching, testing, or identifying vulnerabilities involving prompt injection, ethical jailbreaks, adversarial attacks, or tool-use exploitation.
Strong understanding of modern LLM architectures, prompt engineering, and AI safety evaluation methodologies.
Experience developing structured security assessments, regression testing frameworks, and adversarial evaluation strategies.
Excellent analytical, documentation, and communication skills.
Ability to collaborate effectively within cross-functional technical teams.

Design and implement advanced evaluation methodologies for AI system safety, including ethical jailbreak testing, prompt injection detection, LLM red teaming, and tool-use abuse scenarios.
Develop cross-domain adversarial testing strategies to uncover complex, multi-turn attack patterns and model vulnerabilities.
Build, maintain, and enhance regression test suites to continuously assess jailbreak susceptibility and prompt injection risks.
Create comprehensive evaluation frameworks that simulate real-world adversarial threats to improve AI robustness and reliability.
Collaborate with technical teams to translate security findings into actionable recommendations for AI safety improvements.
Document testing methodologies, findings, and best practices through clear technical reports and presentations for both technical and non-technical stakeholders.

View Full Description & ApplyYou'll be redirected to the employer's site