Senior DevOps Engineer
New
S
Software Vison AI LtdCloud Infrastructure
Remote - within LATAM time zones (GMT-3 to GMT-5) - Remote with meetings across U.S. time zones, GMT-3 to GMT-5; overlaps with U.S. time zonesContractSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 5+ years
- Required Skills
- AWSDockerPythonBashGCPKubernetesAzureCI/CDLinuxTerraform
Requirements
- 5+ years of experience in DevOps, Site Reliability Engineering (SRE), Cloud Infrastructure Engineering, or related roles.
- Strong hands-on experience with at least one major cloud platform: AWS, Azure, or GCP.
- Experience with Infrastructure as Code tools such as Terraform, CloudFormation, or Pulumi.
- Strong experience building and maintaining CI/CD pipelines using tools like GitHub Actions, GitLab CI, Jenkins, or CircleCI.
- Experience with containerization and orchestration tools such as Docker and Kubernetes.
- Strong knowledge of observability and monitoring tools such as Datadog, Prometheus, Grafana, CloudWatch, or ELK Stack.
- Solid Linux systems administration, networking, and cloud security knowledge.
- Strong hands-on experience with scripting and automation using Python (preferred), Bash, or similar languages.
- Experience developing internal automation tools and infrastructure workflows through code.
- Familiarity with Git workflows and modern DevOps best practices.
Responsibilities
- Design, implement, and optimize scalable cloud infrastructure and deployment solutions.
- Build, maintain, and improve CI/CD pipelines to enable fast, reliable software delivery.
- Manage cloud environments using Infrastructure as Code (IaC) tools such as Terraform or CloudFormation.
- Automate infrastructure provisioning, configuration management, and deployment processes.
- Develop and maintain automation scripts and internal tooling using Python, Bash, or similar scripting languages.
- Implement and maintain monitoring, logging, and alerting systems.
- Collaborate with engineering teams to improve application scalability, performance, and resiliency.
- Support and troubleshoot production and development environments.
- Ensure infrastructure follows security best practices, operational standards, and compliance requirements.
- Participate in disaster recovery planning, backup strategies, and incident response initiatives.
View Full Description & ApplyYou'll be redirected to the employer's site