Staff Platform Engineer - WunderGraph Cosmo Platform
EMEA, European (CET) business hoursFull-TimeStaff
Salary100000 - 130000 EUR per year
Apply NowOpens the employer's application page
Job Details
- Required Skills
- AWSGCPKubernetesAzureClickhouseGoGrafanaPrometheusCI/CDTerraform
Requirements
- Proven experience architecting and operating scalable, highly available, and secure cloud-native platforms in production.
- Strong proficiency in Go.
- Deep expertise in Kubernetes.
- Thrive in the dynamic environment of a scaling, remote-first company.
- Deep expertise in a major cloud provider (AWS, GCP, Azure).
- Proficiency with Infrastructure as Code tools (e.g., Terraform, Pulumi).
- Strong understanding of system architecture and distributed systems.
- Understanding of the challenges of running high-performance API gateways.
- Familiarity with GraphQL Federation is a significant plus.
- Experience building or managing modern observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, ClickHouse).
- A self-starter attitude and a leader’s mindset, comfortable with ambiguity.
- Excellent written and verbal communication skills, with ability to articulate complex technical concepts clearly.
Responsibilities
- Ensure reliability, performance, and scalability of the WunderGraph Cosmo platform by defining and meeting stringent SLOs.
- Architect internal systems for scale and build/operate key product infrastructure including the customer-facing telemetry pipeline and AI pipeline.
- Enable engineering teams to ship features for WunderGraph Cosmo fast, reliably, and with confidence through a world-class Internal Developer Platform (IDP).
- Take full ownership of core platform infrastructure and services, from architecture to operation.
- Drive the architectural vision for the platform, making key decisions on technologies like Kubernetes, Infrastructure as Code, and observability stack.
- Level up the entire team through mentorship, architectural guidance, and championing best practices.
- Architect, build, and operate the core cloud-native infrastructure for WunderGraph Cosmo and Hub, primarily using Go and Kubernetes.
- Own and evolve the observability stack (OpenTelemetry, Prometheus, ClickHouse) and infrastructure supporting AI-driven features.
- Build and optimize CI/CD pipelines to improve build times, automate quality and security gates, and create a seamless path to production.
- Champion and implement Infrastructure as Code (IaC) best practices using tools like Terraform, building reusable and maintainable modules.
- Embed security best practices into the platform by designing and implementing network policies, RBAC, and automated checks to meet enterprise and SOC 2 compliance standards.
- Mentor other engineers, provide insightful code and design reviews, and document platform features and architectural decisions.
View Full Description & ApplyYou'll be redirected to the employer's site