Staff Platform Engineer
New
This is a remote position, though it could also be a hybrid role from our Mexico City officeFull-TimeStaff
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 7+ years
- Required Skills
- AWSPythonKubernetesGoCI/CDTerraformDatadogLLMLangChain
Requirements
- Bachelor's or Master's degree in Computer Science, Engineering, or related field.
- 7+ years of experience in cloud infrastructure and managing large-scale distributed systems.
- Demonstrated experience architecting and scaling AI-driven systems in production.
- Expertise in AWS (EKS, Lambda, Bedrock) and containerized/serverless architectures.
- Strong expertise in Kubernetes at scale.
- Knowledge of IaC tools (Terraform, Ansible) and agentic automation.
- Mastery of Datadog and observability.
- Experience with LLM orchestration frameworks (LangChain, LlamaIndex, CrewAI).
- Strong coding expertise in Python and/or Go.
- Proven ability to drive cross-functional initiatives.
Responsibilities
- Design foundational patterns and guardrails for building and deploying AI agents in production.
- Own agent governance including model selection, safety, and observability.
- Architect agentic cloud infrastructure and mentor senior engineers in advanced patterns.
- Lead large-scale distributed systems on AWS, focusing on performance and reliability.
- Evolve the developer control plane (Cortex) into an AI-augmented self-service platform.
- Drive AI-powered golden paths for CI/CD and infrastructure management.
- Participate in on-call rotations and use post-mortems to improve system reliability.
View Full Description & ApplyYou'll be redirected to the employer's site