Senior DevOps Engineer – Cloud Infrastructure

New

Based in Brazil ... flexibility to work from anywhere in Latin AmericaFull-TimeSenior

Salary not disclosed

Apply NowOpens the employer's application page

Job Details

5+ years of experience in DevOps, Site Reliability Engineering, or infrastructure-focused roles.
Strong hands-on experience with AWS in production-grade, multi-region environments.
Advanced expertise with Terraform and Infrastructure as Code practices.
Deep knowledge of Kubernetes (EKS), container orchestration, and cloud-native architectures.
Proven experience designing and maintaining CI/CD pipelines in modern DevOps environments.
Strong understanding of observability practices, including monitoring, logging, and alerting systems.
Experience with GitLab or similar DevOps platforms for repository management and automation workflows.
Solid knowledge of networking concepts, IAM, VPC design, and cloud security best practices.
Experience working in regulated environments or industries with compliance requirements.
Proficiency in scripting languages such as Python, Bash, or similar.
Strong incident management experience with the ability to troubleshoot and resolve production issues under pressure.
Excellent communication skills and ability to collaborate effectively across engineering, security, and product teams.

Design, build, and maintain scalable AWS cloud infrastructure using Infrastructure as Code (Terraform), ensuring high availability, security, and maintainability across multi-region environments.
Manage and optimize Kubernetes (EKS) clusters, supporting containerized workloads with strong focus on resilience, performance, and autoscaling strategies.
Develop, maintain, and improve CI/CD pipelines to enable fast, secure, and reliable software delivery across engineering teams.
Administer and evolve core AWS services including networking, compute, storage, IAM, and databases (RDS), ensuring robust and compliant cloud architecture.
Implement observability solutions using tools such as Grafana and CloudWatch, including monitoring, alerting, and incident detection systems.
Lead incident response, root cause analysis, and post-incident improvements to ensure long-term system stability and reliability.
Collaborate with cross-functional teams to support compliance requirements (e.g., SOC 2, GDPR) and maintain strong security posture across infrastructure.
Contribute to architecture design, disaster recovery planning, capacity forecasting, and cost optimization initiatives.
Maintain technical documentation, infrastructure standards, and operational runbooks to support scalable engineering practices.
Mentor junior engineers and actively contribute to building a strong DevOps and SRE culture.

View Full Description & ApplyYou'll be redirected to the employer's site