ApplySenior Site Reliability Engineer (SRE) - Poland
Posted over 1 year ago
View full description
📍 Location: Poland
🔍 Industry: Observability software
🗣️ Languages: English
🪄 Skills: AWSBackend DevelopmentDockerNode.jsSoftware DevelopmentFrontend DevelopmentFull Stack DevelopmentJavaJavascriptTypeScriptC (Programming language)
Requirements:
Enterprise scale continuous delivery experience, javascript/node.js/typescript development experience, sustainable incident response knowledge, familiarity with cloud platforms and container+orchestration technologies, experience with apm and observability tools, linux systems engineering background, familiarity with incident response tools, ability to work autonomously and with a distributed team
Responsibilities:
Improve service delivery and reliability, monitor production systems, identify and resolve errors and instability, advocate for changes to improve reliability, automate processes, participate in on-call responsibilities
ApplyRelated Jobs
Apply📍 Poland
🔍 IT and Security
- Extensive experience with enterprise scale continuous delivery environments
- Development with JavaScript/Node.js/TypeScript in a Linux/Mac environment
- Experience with sustainable incident response in a blameless environment
- Experience with Configuration Management Tools like Terraform (preferred) or Puppet, Chef, Ansible
- Knowledge of cloud platforms (prefer AWS) and container + orchestration technologies
- Experience with APM and Observability and related tools such as, New Relic, Splunk, CloudWatch, Prometheus, Grafana/Kibana, Sentry etc.
- Background in Linux Systems Engineering
- Experience with Incident response related tools for instance, PagerDuty, FireHydrant, Blameless etc.
- Comfortable with a high level of autonomy and working with a distributed team
- Engage with teams and improve service delivery and reliability across their entire lifecycle
- Measure and monitor all production systems with an eye towards availability, latency and overall system health
- Seek out the cause of errors and instability in our production cloud services and drive teams towards better operational excellence
- Engage with product and platform teams to improve and evolve systems by lobbying for changes that improve reliability, resilience, and observability
- Help Identify and drive down toil with creative innovation and automation
- On-call responsibilities
AWSNode.jsDesign PatternsJavascriptKibanaTypeScriptGrafanaPrometheusLinuxTerraform
Posted 14 days ago
Apply