Apply

Site Reliability Engineer (Remote - Czechia)

Posted 5 days agoViewed

View full description

💎 Seniority level: Middle, 4+ years

📍 Location: Czechia

🏢 Company: Jobgether👥 11-50💰 $1,493,585 Seed over 2 years agoInternet

🗣️ Languages: English

⏳ Experience: 4+ years

🪄 Skills: AWSPythonElasticSearchKafkaKubernetesGoGrafanaPrometheusRedisCI/CDLinuxDevOpsTerraformNodeJS

Requirements:
  • 4+ years of experience in Site Reliability Engineering or a similar role
  • Strong expertise in AWS cloud services and scalable architecture design
  • Proficient in Infrastructure as Code (IaC) using Terraform and Terragrunt
  • Deep understanding of Kubernetes, including Helm, ArgoCD, and Istio
  • Hands-on experience with Kafka, Redis, and Confluent Cloud
  • Proficient in monitoring stacks (Prometheus, Grafana, Alert Manager)
  • Development experience with NodeJS, Python, or GoLang
  • Familiarity with OpenSearch, Elasticsearch, or Chaos Search
  • Excellent troubleshooting, problem-solving, and communication skills
  • Experience with agile teams and collaborative environments
Responsibilities:
  • Design, build, and maintain scalable infrastructure using Terraform and Terragrunt
  • Optimize and manage AWS environments for cost-efficiency, security, and availability
  • Administer and scale Kafka and Confluent Cloud for real-time data streaming
  • Deploy and maintain Redis to support caching and high-speed data processing
  • Implement monitoring and alerting with Prometheus, Grafana, Alert Manager, and OpsGenie
  • Manage Kubernetes clusters using Helm, ArgoCD, Istio, and Kustomize
  • Ensure safe and controlled deployments using LaunchDarkly for feature flagging
  • Collaborate with development teams to support feature integration and performance optimization
  • Drive continuous improvement by adopting emerging tools and best practices
Apply