Senior Infrastructure Engineer - Streaming, Caching, DBaaS
New
S
SentinelOneCybersecurity
Hybrid work in Prague (Karlin), Brno (Clubco) or remote in CZ/SK.Full-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 5+ of experience
- Required Skills
- AWSPythonGCPKafkaKubernetesAzureGoRedisTerraform
Requirements
- 5+ of experience in infrastructure, platform, or DevOps engineering with a proven track record of operating distributed systems at scale.
- Hands-on experience with self-hosted Kafka and/or Redis running in Kubernetes, including performance tuning, scaling, and operator-based lifecycle management.
- Strong understanding of Kubernetes internals and best practices for managing both stateless and stateful workloads in production or air-gapped deployments.
- Multi-cloud experience with production expertise in at least one major provider (AWS, GCP, or Azure).
- Solid experience with Infrastructure as Code (IaC) and GitOps practices using tools like Terraform, ArgoCD, and CI/CD workflow automation (GitHub Actions).
- Strong scripting or development skills in at least one mainstream language like Python, Go, or Ruby.
Responsibilities
- Lead the design, operation, and end-to-end platform experience of mission-critical distributed data services, including Kafka or Redis, across Kubernetes clusters and multi-cloud environments.
- Unlock complete cloud portability for SentinelOne apps and services by building a highly automated, self-service infrastructure across AWS, GCP, and air-gapped on-prem environments.
- Manage and scale data infrastructure supporting petabytes/day ingestion to ensure low-latency, high-throughput, and cost-effective operations.
- Consolidate and optimize multi-tenant Kafka clusters to reduce cost, improve resilience, and streamline operations.
- Drive lifecycle automation for distributed data services using GitOps principles and tools like GitHub Actions, ArgoCD, and Terraform to minimize operational toil and pager burden.
- Define and implement standards for observability, high availability (HA), backup, and disaster recovery (DR) of workloads in Kubernetes.
- Partner with FinOps and engineering stakeholders to continuously optimize performance, cost, and operational overhead across data platform components.
View Full Description & ApplyYou'll be redirected to the employer's site