Senior Site Reliability Engineer (DevTools)
New
Based in GermanyFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- PythonJavaJVMKotlinRubyGoGitLab
Requirements
- Proven experience combining Site Reliability Engineering and Software Engineering responsibilities in production environments.
- Strong programming skills and hands-on development experience with languages such as Java, Kotlin, Go, Python, Ruby, or similar.
- Solid understanding of Unix/Linux operating systems, system internals, and infrastructure troubleshooting.
- Strong knowledge of JVM-based applications, performance optimization, and operational best practices.
- Experience designing, operating, and improving highly available and scalable systems.
- Passion for enhancing user experience through engineering excellence and continuous improvement.
- Ability to adapt quickly, solve complex technical problems, and perform effectively in fast-changing environments.
- Strong analytical thinking, troubleshooting capabilities, and attention to detail.
- Excellent communication and collaboration skills within cross-functional engineering teams.
- Experience in Platform Engineering, developer platforms, or internal tooling environments is highly valued.
- Familiarity with version control systems, CI/CD platforms, and build infrastructure such as GitLab, TeamCity, or equivalent solutions is advantageous.
- Experience with Spring Framework, Java-based monolithic applications, or large-scale enterprise systems is considered a plus.
Responsibilities
- Design, operate, and continuously improve large-scale developer infrastructure and internal tooling platforms.
- Build and maintain reliable, fault-tolerant, and self-healing systems that ensure high availability and performance.
- Analyze user feedback, identify pain points, and implement solutions that enhance developer experience and productivity.
- Optimize system performance, reduce operational friction, and improve the efficiency of development workflows.
- Develop, customize, and extend both open-source and commercial tools to better meet organizational needs.
- Contribute to software development initiatives across multiple programming languages and technology stacks.
- Monitor platform health, troubleshoot incidents, and implement preventive measures to improve reliability.
- Collaborate with engineering teams to define meaningful operational metrics and validate improvements through measurable outcomes.
- Support users by resolving technical issues, providing guidance, and ensuring platform stability.
- Explore and integrate emerging technologies, including AI-assisted workflows and developer productivity solutions.
View Full Description & ApplyYou'll be redirected to the employer's site