Site Reliability Engineer

New
D
deepsetAI Platform
Berlin, BarcelonaFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Experience
2-5 years
Required Skills
AWSKubernetesCI/CDTerraformDistributed Systems

Requirements

  • 2-5 years of experience working with large-scale production infrastructure.
  • Experience with distributed or service-oriented architectures.
  • Hands-on expertise with AWS.
  • Hands-on expertise with Kubernetes.
  • Hands-on expertise with CI/CD and GitOps (e.g., ArgoCD).
  • Working knowledge of Infrastructure as Code (Terraform preferred).
  • Solid troubleshooting skills across complex, multi-layered systems.
  • Pragmatic mindset balancing speed, simplicity, and reliability.
  • Strong sense of ownership and accountability for end-to-end systems.
  • Ability to work independently and align with team goals.

Responsibilities

  • Design, configure, and evolve infrastructure across SaaS, private cloud, and on-prem environments.
  • Deliver a production-grade, self-hosted platform compatible with Kubernetes.
  • Improve CI/CD pipelines, GitHub workflows, and GitOps setups to increase shipping speed.
  • Simplify systems and optimize infrastructure costs without compromising reliability.
  • Champion best practices in reliability, scalability, and security across the organization.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now