Apply

Site Reliability Engineering (English required)

Posted 13 days agoViewed

View full description

πŸ’Ž Seniority level: Senior, 5+ years

πŸ“ Location: Mexico, Colombia, Argentina, Uruguay

πŸ” Industry: Software Development

🏒 Company: DaCodes

πŸ—£οΈ Languages: English

⏳ Experience: 5+ years

πŸͺ„ Skills: AWSDockerPythonBashElasticSearchJenkinsKubernetesRubyGoGrafanaPrometheusCI/CDTerraformNetworkingAnsible

Requirements:
  • 5+ years of experience in Site Reliability Engineering or similar roles.
  • Proficiency in cloud computing platforms like AWS, with advanced expertise in network infrastructure (load balancers, subnets, gateways, NAT, etc.).
  • Strong experience with container orchestration tools like Kubernetes, ECS, and Docker.
  • Advanced skills with CI/CD tools (Jenkins, ArgoCD, Terraform, CloudFormation).
  • Experience with monitoring tools such as Prometheus, Grafana, and Elasticsearch.
  • Proficient in scripting and development languages (Go, Python, Ruby, Bash).
  • Experience with system and application debugging, and ensuring high availability.
  • Strong problem-solving and troubleshooting abilities in cloud and on-prem environments.
  • In-depth understanding of networking (IPv4, IPv6, BGP, etc.).
  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).
Responsibilities:
  • Automate infrastructure management using tools such as Terraform, Ansible, and CloudFormation.
  • Develop and manage CI/CD pipelines using tools like Jenkins.
  • Architect and maintain scalable systems in data centers and cloud environments.
  • Manage containerized environments, with hands-on experience in Kubernetes and ECS.
  • Automate routine tasks, optimize deployments, and ensure reliability of production systems.
  • Collaborate with cross-functional teams to improve performance, reliability, and scalability.
  • Analyze and debug issues, ensuring timely resolutions and minimal downtime.
  • Monitor applications, systems, and databases using tools like Prometheus, Grafana, and Elasticsearch.
  • Troubleshoot network issues and automate network configurations with pipeline tools.
  • Participate in technical discussions, bringing real-world solutions and contributing to architectural decisions.
Apply