Apply

Site Reliability Engineer

Posted 2024-11-12

View full description

πŸ’Ž Seniority level: Senior, 5+ years

πŸ“ Location: Mexico

πŸ” Industry: Digital engineering and modernization

🏒 Company: Encora

⏳ Experience: 5+ years

πŸͺ„ Skills: KubernetesJiraAzureGrafanaPrometheusRelease ManagementAnalytical SkillsCollaborationDevOpsTerraform

Requirements:
  • Proficiency in modern monitoring tools, project tracking, and version management.
  • Experience with Infrastructure as Code (IaC) tools and release management tooling.
  • Familiarity with incident alert tools and container orchestration platforms.
  • Proven experience managing applications in production environments.
  • Strong understanding of incident management and root cause analysis processes.
  • Ability to streamline processes, especially in change and release management.
  • Technologies include Azure Monitoring, App Insights, Prometheus, Grafana, JIRA, SVN, GitHub, Terraform, ARM/Bicep, Pulumi, ArgoCD, Harness, Octopus, PagerDuty, Opsgenie, Kubernetes, AKS.
Responsibilities:
  • Continuously monitor applications using automated tools to ensure optimal reliability.
  • Act promptly in production environments to resolve reliability issues.
  • Conduct thorough root cause analysis during ongoing incidents.
  • Oversee change management and release processes to ensure smooth deployments.
  • Partner with development teams to address system-related issues through automation.
  • Ensure systems are reliable and scalable with high-performance standards.
Apply

Related Jobs

Apply

πŸ“ LATAM

🧭 Full-Time

πŸ’Έ 41000 - 70000 USD per year

πŸ” Remote employment services

🏒 Company: RemoteπŸ‘₯ 1001-5000πŸ’° $300.0m Series C on 2022-04-05πŸ«‚ on 2022-07-08Human Resources Services

  • Knowledge and experience in Kubernetes, AWS (or similar Cloud Provider) and Terraform.
  • Knowledge of CI/CD tools (GitLab, Github, Jenkins or similar).
  • Experience with at least 1 back-end programming language (Elixir, Clojure, Java, Node.js, Python, etc.).
  • Experience with Bash or Python Scripting.
  • Excellent communication and interpersonal skills.
  • Ability to work independently and self-guidedness.
  • Curiosity and willingness to learn and develop.
  • Holistic debugging skills.
  • Security knowledge and capabilities from a defensive and offensive standpoint.

  • Managing and improving our existing infrastructure.
  • Helping us build the next generation of our platform: using tools like Kubernetes, Terraform and Docker.
  • Streamlining and automating our deployment processes.
  • Work closely with our Security team to keep on top of potential threats/patches.
  • Support our engineers and product teams to improve overall scalability, stability and reliability.

AWSDockerNode.jsPythonBashJavaKubernetesPostgresCI/CDTerraform

Posted 2024-12-03
Apply
Apply

πŸ“ Argentina, Bolivia, Brazil, Chile, Colombia, Ecuador, Guyana, Mexico, Nicaragua, Panama, Paraguay, Peru, Suriname, Uruguay, Venezuela

🧭 Employee

πŸ’Έ 41500 - 70000 USD per year

πŸ” Employment solutions for remote organizations

🏒 Company: Remote - Referral Board

  • Significant and demonstrated experience as a Site Reliability Engineer, including architecting, implementing, and maintaining a platform.
  • Solid knowledge and experience in Kubernetes, AWS (or similar Cloud Provider), and Terraform.
  • Knowledge of CI/CD tools, preferably GitLab CI.
  • Experience with at least one back-end programming language (Elixir, Clojure, Java, Node.js, Python, etc.).
  • Experience with one programming language for developing SRE tooling (Go, Python).
  • Excellent communication and interpersonal skills.
  • Ability to work independently and self-guided.
  • Curiosity and willingness to learn and develop.

  • Managing and improving existing infrastructure.
  • Helping build the next generation of the platform using tools like Kubernetes, Terraform, and Docker.
  • Streamlining and automating deployment processes.
  • Collaborating closely with the Security team to address potential threats.
  • Supporting engineers and product teams to enhance scalability, stability, and reliability.

AWSDockerNode.jsPythonJavaKubernetesGoCI/CDTerraform

Posted 2024-11-02
Apply
Apply

πŸ“ Americas

🧭 Full-Time

πŸ” Open source technology

🏒 Company: CanonicalπŸ‘₯ 1001-5000πŸ’° $12.8m Crowdfunding on 2013-08-22Internet of ThingsOpen SourceCloud ComputingLinuxSoftware

  • Software Engineering or Computer Science degree.
  • Linux experience and familiarity with Linux networking and storage.
  • Python software development experience.
  • Demonstrated drive for continual learning.
  • DevOps experience.

  • Bring Python software-engineering skills to the operations domain.
  • Architect and run OpenStack, Kubernetes, and software-defined storage.
  • Enable devsecops for applications running on the managed infrastructure.
  • Work in high-pressure operations environment with mission-critical services.

PythonSoftware DevelopmentKubernetesServerless

Posted 2024-08-07
Apply