ApplySenior Site Reliability Engineer - (Remote - Europe)
Posted about 12 hours agoViewed
View full description
💎 Seniority level: Senior, 5+ years
📍 Location: Germany, Spain, Portugal
🏢 Company: Jobgether👥 11-50💰 $1,493,585 Seed about 2 years agoInternet
🗣️ Languages: English
⏳ Experience: 5+ years
🪄 Skills: AWSPythonJavaKubernetesGoCI/CDRESTful APIsLinuxTerraformScripting
Requirements:
- 5+ years of experience in a Site Reliability Engineer or similar role.
- 3+ years of experience with AWS services and container orchestration tools.
- 2+ years of Kubernetes experience.
- Strong knowledge of observability tools and principles (monitoring, logging, tracing).
- Hands-on experience with Terraform for infrastructure as code.
- Proficiency in at least one programming language (e.g., Python, Go, Java).
- Experience in incident management, postmortem analysis, and risk mitigation.
- Familiarity with messaging systems like SNS, SQS, and experience with CI/CD tools.
Responsibilities:
- Develop and maintain systems that are reliable, scalable, and efficient.
- Define and track Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to ensure optimal system performance.
- Conduct blameless post-incident reviews, identify root causes, and implement preventive actions.
- Automate operational tasks, incident responses, and contribute to system performance optimizations.
- Work with engineering teams to ensure systems are designed for reliability, scalability, and maintainability.
- Continuously evaluate and improve system performance, capacity, and cost efficiency.
- Participate in the on-call rotation, providing troubleshooting and resolution support for critical issues.
Apply