Freelance AI Evaluation Scenario Writer

Posted 18 days agoViewed
BrazilMexicoPhilippinesSouth AfricaContractAI/ML
Company:Mindrift
Location:Brazil, Mexico, Philippines, South Africa
Languages:English
Skills:
PythonArtificial IntelligenceData AnalysisJavascriptMachine LearningQAQA AutomationJSONSoftware Engineering
Requirements:
Bachelor's and/or Master’s Degree in Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields. Background in QA, software testing, data analysis, or NLP annotation. Good understanding of test design principles Strong written communication skills in English. Comfortable with structured formats like JSON/YAML for scenario description. Can define expected agent behaviors (gold paths) and scoring logic. Basic experience with Python and JS. Curious and open to working with AI-generated content, agent logs, and prompt-based behavior.
Responsibilities:
Design realistic and structured evaluation scenarios for LLM-based agents Create test cases that simulate human-performed tasks Define gold-standard behavior and scoring logic to evaluate agent actions Analyze agent logs, failure modes, and decision paths Iterate on prompts, instructions, and test cases Ensure scenarios are production-ready, easy to run, and reusable
Similar Jobs:
Posted 1 day ago
United States, Latin America, IndiaFull-TimeData Strategy
Senior Consultant, AI & Data Strategy
Company:phData
Posted 1 day ago
Latin AmericaContractSoftware Development
AI Automations Engineer-LATAM
Company:Engine
Posted 1 day ago
BrazilPart-TimeAI Training
Freelance Machine Learning AI Trainer (Python)
Company:Mindrift