Senior Site Reliability Engineer (DevTools)

New
Based in GermanyFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
PythonJavaJVMKotlinRubyGoGitLab

Requirements

  • Proven experience combining Site Reliability Engineering and Software Engineering responsibilities in production environments.
  • Strong programming skills and hands-on development experience with languages such as Java, Kotlin, Go, Python, Ruby, or similar.
  • Solid understanding of Unix/Linux operating systems, system internals, and infrastructure troubleshooting.
  • Strong knowledge of JVM-based applications, performance optimization, and operational best practices.
  • Experience designing, operating, and improving highly available and scalable systems.
  • Passion for enhancing user experience through engineering excellence and continuous improvement.
  • Ability to adapt quickly, solve complex technical problems, and perform effectively in fast-changing environments.
  • Strong analytical thinking, troubleshooting capabilities, and attention to detail.
  • Excellent communication and collaboration skills within cross-functional engineering teams.
  • Experience in Platform Engineering, developer platforms, or internal tooling environments is highly valued.
  • Familiarity with version control systems, CI/CD platforms, and build infrastructure such as GitLab, TeamCity, or equivalent solutions is advantageous.
  • Experience with Spring Framework, Java-based monolithic applications, or large-scale enterprise systems is considered a plus.

Responsibilities

  • Design, operate, and continuously improve large-scale developer infrastructure and internal tooling platforms.
  • Build and maintain reliable, fault-tolerant, and self-healing systems that ensure high availability and performance.
  • Analyze user feedback, identify pain points, and implement solutions that enhance developer experience and productivity.
  • Optimize system performance, reduce operational friction, and improve the efficiency of development workflows.
  • Develop, customize, and extend both open-source and commercial tools to better meet organizational needs.
  • Contribute to software development initiatives across multiple programming languages and technology stacks.
  • Monitor platform health, troubleshoot incidents, and implement preventive measures to improve reliability.
  • Collaborate with engineering teams to define meaningful operational metrics and validate improvements through measurable outcomes.
  • Support users by resolving technical issues, providing guidance, and ensuring platform stability.
  • Explore and integrate emerging technologies, including AI-assisted workflows and developer productivity solutions.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now