Senior AI Engineer - Agentic Evaluation & V&V

New
S
Slingshot AerospaceSpace Operations
Remote, USFull-TimeSenior
Salary150,000 - 250,000 USD per year
Apply NowOpens the employer's application page

Job Details

Experience
6+ years
Required Skills
Python

Requirements

  • 6+ years of experience in software engineering, machine learning engineering, applied AI, or equivalent experience
  • Strong Python engineering skills with experience building SDKs, libraries, or evaluation tooling
  • Experience designing evaluation frameworks, benchmarks, metrics, or test harnesses for AI/ML systems
  • Ability to analyze system behavior, identify failure modes, and evaluate performance in complex autonomous or semi-autonomous systems
  • Familiarity with modern agent frameworks, orchestration patterns, or protocol-based integrations
  • Experience working in cross-functional, multidisciplinary teams
  • Strong written and verbal communication skills
  • Bachelor’s degree in a relevant science or engineering field, or equivalent experience

Responsibilities

  • Extend and maintain Slingshot’s V&V SDK and evaluation framework for simulation-backed validation of agentic AI systems
  • Design and implement agent-level and end-to-end evaluations, including benchmark scenarios, scoring logic, and experiment harnesses
  • Build benchmark scenarios and tooling that measure planning, reasoning, and operational performance for autonomous mission planning systems
  • Translate astrodynamics and mission-domain concepts into executable evaluation scenarios and simulation configurations
  • Develop reusable SDK interfaces, adapters, and evaluation utilities that connect V&V systems, TALOS benchmarks, and agent workflows
  • Define and apply metrics for capability evaluation, failure analysis, regression detection, and comparative benchmarking
  • Partner with cross-functional teams to identify evaluation needs and contribute to improving coverage of critical capabilities
  • Contribute to best practices for evaluating complex, autonomous AI systems
  • Uphold strong engineering standards through testing, documentation, reproducibility, and maintainable system design
View Full Description & ApplyYou'll be redirected to the employer's site
150,000 - 250,000 USD per year
Apply Now