Design, test, and iteratively refine creative prompts to enhance AI model capabilities in reasoning, instruction following, and contextual understanding. Compare, choose and score AI-generated outputs, providing detailed rationales and to identify the superior response. Perform in-depth fact-checking and analysis to ensure model responses are accurate, relevant, and grounded in reliable sources. Develop ideal responses to serve as benchmarks for model training and performance analysis.