AI Query Evaluation Specialist (Copilot Competitive Intelligence)

New

Remote (U.S.-based)Full-TimeMiddle

Salary80,000 - 95,000 USD per year

Apply NowOpens the employer's application page

Job Details

Strong English reading comprehension with the ability to interpret subtle differences in user intent
Demonstrated analytical thinking and logical reasoning skills
Experience working with structured data or annotation workflows
Familiarity with Microsoft Excel or similar data analysis tools
Strong user empathy and understanding of how diverse users formulate queries
Curiosity and familiarity with modern AI tools (e.g., Copilot, ChatGPT, Gemini)
High attention to detail with a track record of delivering consistent, high-quality work
Reliable, proactive, and adaptable in fast-changing environments
Prior experience in data annotation, content evaluation, or dataset curation for AI or search products (Preferred)
Experience with AI evaluation, search relevance, or linguistic analysis (Preferred)
Basic statistical or data analysis knowledge (Preferred)
Demonstrated ability to quickly learn and interpret unfamiliar domains (Preferred)
Fluency in English plus at least one additional language: Japanese, Korean, French, Chinese, German, or Italian (Preferred)

Support an initiative focused on evaluating and improving AI-powered search and assistant experiences
Work with real user queries to build and refine evaluation datasets
Benchmark Microsoft Copilot against leading AI systems
Review and analyze real user query logs to identify queries with clear intent and representativeness
Curate and maintain high-quality datasets for AI system evaluation
Annotate queries across multiple evaluation dimensions (web search need, PII presence, domain expertise requirement, AI response quality attributes)
Ensure annotations are consistent, structured, and aligned with evaluation guidelines
Use tools such as Excel to organize, review, and summarize evaluation outputs
Identify trends in query patterns and provide feedback to improve dataset coverage and quality
Maintain high attention to detail, documentation quality, and evaluation integrity

View Full Description & ApplyYou'll be redirected to the employer's site