Apply

Senior Site Reliability Engineer

Posted 5 months agoViewed

View full description

📍 Location: Spain

💸 Salary: $72,000 - $99,000 per year

🔍 Industry: Mobility

🗣️ Languages: English

🪄 Skills: AWSAWS EKSKubernetes

Requirements:
  • Think Unix, you know the networking stack, the OSI model, containers (and schedulers), and you know your way around monitoring, logging and the CAP theorem (bonus!).
  • Have strong programming skills in at least one language, and know your way around a few more or can learn them if the opportunity arises.
  • Automate yourself out of everything by nature, making machines do the toil.
  • Communicate effectively and asynchronously.
  • Care about the things that affect the company, your team, and yourself.
  • Embrace diversity and humbleness (and a bit of trolling).
  • Prefer taking iterative action over waiting for things to happen or to be perfect.
  • Strongly favor simplicity over complexity. Ie, adhering to the KISS principle.
  • Have a sense for identifying, exploiting and elevating bottlenecks.
  • Are not afraid of expressing yourself in English. We aren't expecting you to have the Queen's accent, but you'll be part of an international team and we communicate in English, so you should be comfortable with that.
  • Enjoy herding cats and shaving yaks. Ie, being a great influence to other product teams and teaching them best practices. As well as analyzing and simplifying our setup.
Responsibilities:
  • Evolving our infrastructure platform building self-service components that will be used by all the engineering team and by millions of users around the world.
  • Working closely with our Product and Infrastructure teams to architecture and develop world-class infrastructure components.
  • Designing and implementing tooling to improve the availability, scalability, observability and latency of our services, which are used by internal customers to deploy and operate their services.
  • Increasing reliability awareness with other teams, helping with the adoption of reliability principles and reviewing observability implementations or software architectures.
  • Defining SLIs, SLOs and SLAs as part of the services' lifecycle.
  • Sharing an on-call schedule for the platform services you own.
  • Solving problems in our highly available platform together with other teams, then build automations to prevent incidents from happening again.
  • Participating in our recruiting process to help grow our engineering team.
Apply

Related Jobs

Apply

📍 Spain

🧭 Full-Time

💸 72000.0 EUR per year

🔍 Mobility and transportation

  • Think Unix, you know the networking stack, the OSI model, containers (and schedulers), and you know your way around monitoring, logging and the CAP theorem (bonus!).
  • Have strong programming skills in at least one language, and know your way around a few more or can learn them if the opportunity arises.
  • Automate yourself out of everything by nature, making machines do the toil.
  • Communicate effectively and asynchronously.
  • Care about the things that affect the company, your team, and yourself.
  • Embrace diversity and humbleness (and a bit of trolling).
  • Prefer taking iterative action over waiting for things to happen or to be perfect.
  • Strongly favor simplicity over complexity. Ie, adhering to the KISS principle.
  • Have a sense for identifying, exploiting and elevating bottlenecks.
  • Are not afraid of expressing yourself in English.
  • Evolving our infrastructure platform building self-service components that will be used by all the engineering team and by millions of users around the world.
  • Working closely with our Product and Infrastructure teams to architecture and develop world-class infrastructure components.
  • Designing and implementing tooling to improve the availability, scalability, observability and latency of our services, which are used by internal customers to deploy and operate their services.
  • Increasing reliability awareness with other teams, helping with the adoption of reliability principles and reviewing observability implementations or software architectures.
  • Defining SLIs, SLOs and SLAs as part of the services' lifecycle.
  • Sharing an on-call schedule for the platform services you own.
  • Solving problems in our highly available platform together with other teams, then build automations to prevent incidents from happening again.
  • Participating in our recruiting process to help grow our engineering team.

DockerPythonAWS EKSGitJavascriptKubernetesRubyGoGrafanaPrometheusCI/CDLinuxMicroservices

Posted 14 days ago
Apply
Apply

📍 Spain

🧭 Full-Time

🔍 Mobility services

🏢 Company: Cabify👥 1001-5000💰 $16,473,668 Debt Financing about 1 year agoInternetLogisticsRide SharingTransportationMobile

  • Strong knowledge of Unix, networking stack, OSI model, containers, and monitoring.
  • Programming skills in at least one language; capability to learn others.
  • Natural tendency to automate tasks.
  • Effective and asynchronous communication skills.
  • Care for the company, team, and self.
  • Embrace diversity and humility.
  • Action-oriented and iterative problem solving.
  • Preference for simplicity over complexity.
  • Ability to identify and address bottlenecks.
  • Proficiency in English communication.
  • Evolving our infrastructure platform building self-service components.
  • Working closely with Product and Infrastructure teams to develop infrastructure components.
  • Designing and implementing tooling for service availability, scalability, observability, and latency improvements.
  • Increasing reliability awareness with teams and reviewing implementations.
  • Defining SLIs, SLOs and SLAs as part of services' lifecycle.
  • Sharing an on-call schedule for owned platform services.
  • Solving problems in a highly available platform and building automations to prevent incidents.
  • Participating in the recruiting process to grow the engineering team.

AWSAWS EKSKubernetesMicroservicesNetworking

Posted 2 months ago
Apply