Senior DevOps Engineer – Cloud Infrastructure
New
Based in Brazil ... flexibility to work from anywhere in Latin AmericaFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 5+ years
- Required Skills
- AWSPythonBashKubernetesGrafanaCI/CDTerraformGitLab
Requirements
- 5+ years of experience in DevOps, Site Reliability Engineering, or infrastructure-focused roles.
- Strong hands-on experience with AWS in production-grade, multi-region environments.
- Advanced expertise with Terraform and Infrastructure as Code practices.
- Deep knowledge of Kubernetes (EKS), container orchestration, and cloud-native architectures.
- Proven experience designing and maintaining CI/CD pipelines in modern DevOps environments.
- Strong understanding of observability practices, including monitoring, logging, and alerting systems.
- Experience with GitLab or similar DevOps platforms for repository management and automation workflows.
- Solid knowledge of networking concepts, IAM, VPC design, and cloud security best practices.
- Experience working in regulated environments or industries with compliance requirements.
- Proficiency in scripting languages such as Python, Bash, or similar.
- Strong incident management experience with the ability to troubleshoot and resolve production issues under pressure.
- Excellent communication skills and ability to collaborate effectively across engineering, security, and product teams.
Responsibilities
- Design, build, and maintain scalable AWS cloud infrastructure using Infrastructure as Code (Terraform), ensuring high availability, security, and maintainability across multi-region environments.
- Manage and optimize Kubernetes (EKS) clusters, supporting containerized workloads with strong focus on resilience, performance, and autoscaling strategies.
- Develop, maintain, and improve CI/CD pipelines to enable fast, secure, and reliable software delivery across engineering teams.
- Administer and evolve core AWS services including networking, compute, storage, IAM, and databases (RDS), ensuring robust and compliant cloud architecture.
- Implement observability solutions using tools such as Grafana and CloudWatch, including monitoring, alerting, and incident detection systems.
- Lead incident response, root cause analysis, and post-incident improvements to ensure long-term system stability and reliability.
- Collaborate with cross-functional teams to support compliance requirements (e.g., SOC 2, GDPR) and maintain strong security posture across infrastructure.
- Contribute to architecture design, disaster recovery planning, capacity forecasting, and cost optimization initiatives.
- Maintain technical documentation, infrastructure standards, and operational runbooks to support scalable engineering practices.
- Mentor junior engineers and actively contribute to building a strong DevOps and SRE culture.
View Full Description & ApplyYou'll be redirected to the employer's site