Staff Platform Engineer

New
This is a remote position, though it could also be a hybrid role from our Mexico City officeFull-TimeStaff
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Experience
7+ years
Required Skills
AWSPythonKubernetesGoCI/CDTerraformDatadogLLMLangChain

Requirements

  • Bachelor's or Master's degree in Computer Science, Engineering, or related field.
  • 7+ years of experience in cloud infrastructure and managing large-scale distributed systems.
  • Demonstrated experience architecting and scaling AI-driven systems in production.
  • Expertise in AWS (EKS, Lambda, Bedrock) and containerized/serverless architectures.
  • Strong expertise in Kubernetes at scale.
  • Knowledge of IaC tools (Terraform, Ansible) and agentic automation.
  • Mastery of Datadog and observability.
  • Experience with LLM orchestration frameworks (LangChain, LlamaIndex, CrewAI).
  • Strong coding expertise in Python and/or Go.
  • Proven ability to drive cross-functional initiatives.

Responsibilities

  • Design foundational patterns and guardrails for building and deploying AI agents in production.
  • Own agent governance including model selection, safety, and observability.
  • Architect agentic cloud infrastructure and mentor senior engineers in advanced patterns.
  • Lead large-scale distributed systems on AWS, focusing on performance and reliability.
  • Evolve the developer control plane (Cortex) into an AI-augmented self-service platform.
  • Drive AI-powered golden paths for CI/CD and infrastructure management.
  • Participate in on-call rotations and use post-mortems to improve system reliability.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now