Site Reliability Engineer
New
D
deepsetAI Platform
Berlin, BarcelonaFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 2-5 years
- Required Skills
- AWSKubernetesCI/CDTerraformDistributed Systems
Requirements
- 2-5 years of experience working with large-scale production infrastructure.
- Experience with distributed or service-oriented architectures.
- Hands-on expertise with AWS.
- Hands-on expertise with Kubernetes.
- Hands-on expertise with CI/CD and GitOps (e.g., ArgoCD).
- Working knowledge of Infrastructure as Code (Terraform preferred).
- Solid troubleshooting skills across complex, multi-layered systems.
- Pragmatic mindset balancing speed, simplicity, and reliability.
- Strong sense of ownership and accountability for end-to-end systems.
- Ability to work independently and align with team goals.
Responsibilities
- Design, configure, and evolve infrastructure across SaaS, private cloud, and on-prem environments.
- Deliver a production-grade, self-hosted platform compatible with Kubernetes.
- Improve CI/CD pipelines, GitHub workflows, and GitOps setups to increase shipping speed.
- Simplify systems and optimize infrastructure costs without compromising reliability.
- Champion best practices in reliability, scalability, and security across the organization.
View Full Description & ApplyYou'll be redirected to the employer's site