Senior Software Engineer – AI Infrastructure

New
BrazilFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Experience
5+ years
Required Skills
RustMLOpsDistributed Systems

Requirements

  • 5+ years of experience building and operating large-scale production systems.
  • Strong expertise in Rust or similar systems programming languages.
  • Deep understanding of distributed systems, reliability engineering, and performance optimization.
  • Proven experience supporting high-throughput or large user-base production environments.
  • Hands-on experience with ML infrastructure, model serving, or MLOps in production.
  • Strong knowledge of observability tools, monitoring practices, and incident/failure management.
  • Experience working cross-functionally with infrastructure and applied engineering teams.
  • Strong ownership mindset and ability to operate in high-stakes production environments.

Responsibilities

  • Design and build the core infrastructure layer powering AI agent systems in production environments.
  • Develop and maintain high-performance backend services (primarily in Rust) for inference, orchestration, and execution workloads.
  • Architect distributed systems capable of handling high throughput, low latency, and global-scale traffic.
  • Build and improve ML infrastructure, including model deployment, monitoring, evaluation, and lifecycle management.
  • Implement observability, reliability, and failure recovery mechanisms for critical agent-driven workflows.
  • Optimize system performance across latency, cost, and scalability constraints.
  • Collaborate closely with applied AI and infrastructure teams to productionize experimental systems.
  • Contribute to architectural decisions shaping long-term platform evolution.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now