Senior Software Engineer – AI Infrastructure
New
BrazilFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 5+ years
- Required Skills
- RustMLOpsDistributed Systems
Requirements
- 5+ years of experience building and operating large-scale production systems.
- Strong expertise in Rust or similar systems programming languages.
- Deep understanding of distributed systems, reliability engineering, and performance optimization.
- Proven experience supporting high-throughput or large user-base production environments.
- Hands-on experience with ML infrastructure, model serving, or MLOps in production.
- Strong knowledge of observability tools, monitoring practices, and incident/failure management.
- Experience working cross-functionally with infrastructure and applied engineering teams.
- Strong ownership mindset and ability to operate in high-stakes production environments.
Responsibilities
- Design and build the core infrastructure layer powering AI agent systems in production environments.
- Develop and maintain high-performance backend services (primarily in Rust) for inference, orchestration, and execution workloads.
- Architect distributed systems capable of handling high throughput, low latency, and global-scale traffic.
- Build and improve ML infrastructure, including model deployment, monitoring, evaluation, and lifecycle management.
- Implement observability, reliability, and failure recovery mechanisms for critical agent-driven workflows.
- Optimize system performance across latency, cost, and scalability constraints.
- Collaborate closely with applied AI and infrastructure teams to productionize experimental systems.
- Contribute to architectural decisions shaping long-term platform evolution.
View Full Description & ApplyYou'll be redirected to the employer's site