Senior Site Reliability Engineer

New
Anywhere in Brazil - RemoteFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Languages
English
Experience
Over 5 years
Required Skills
AWSDockerPythonBashKubernetesCI/CDTerraformAnsible

Requirements

  • Over 5 years of experience in Cloud Computing, SRE/DevOps
  • Proficient in English communication (both written and spoken)
  • Detail-oriented with high initiative and self-motivation
  • Strong understanding of software engineering principles
  • In-depth knowledge of modern networking and operating systems
  • Proficiency in AWS, cloud environments, containers, Kubernetes, Docker, and DevOps engineering
  • Experience managing tests and CI/CD pipelines
  • Familiarity with automation tools and provisioners like Terraform, Ansible, or Chef
  • Solid troubleshooting and system engineering experience in UNIX/Linux production environments
  • Experience with monitoring, alerting, and incident management
  • Proficiency in automating tasks with scripting languages like Python, Bash, etc.

Responsibilities

  • Collaborate with and support our creative, tight-knit development team
  • Design, deploy, and operate Loadsmart's critical systems while balancing reliability, cost, and agility
  • Play a key role in driving reliability projects with engineering teams
  • Collect metrics and understand their business impact
  • Perform troubleshooting and root-cause analysis of system operation issues
  • Be accountable for the platform's Service Level Agreements and Objectives
  • Provide infrastructure support during off-hours as needed
  • Take ownership of software infrastructure projects
  • Seek, give, and receive constructive feedback through code and specification reviews
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now