Senior Software Engineer - Grafana Databases, Managed Services
This is a remote opportunity and we would be interested in applicants living in Ireland time zones only at this time., Ireland time zones onlyFull-TimeSenior
Salary104,000 - 124,800 EUR per year
Apply NowOpens the employer's application page
Job Details
- Experience
- 6+ years of engineering experience
- Required Skills
- AWSGCPKafkaKubernetesSnowflakeAzureCassandraClickhouseGoPostgresTerraformNetworkingHelmDistributed Systems
Requirements
- 6+ years of engineering experience, including meaningful time in SRE, platform engineering, production engineering, infrastructure engineering, or distributed systems roles.
- Experience operating distributed systems in production (e.g., streaming systems, analytical databases, large-scale storage backends).
- Examples of these include Kafka, Redpanda, WarpStream, Postgres, ClickHouse, Snowflake, or Cassandra.
- Strong Kubernetes experience in AWS, GCP, or Azure, and familiarity with infrastructure-as-code tooling (Helm, Terraform, Jsonnet, etc.).
- Solid understanding of distributed systems design and large-scale system trade-offs.
- Proficiency in at least one programming language (Go preferred, but not required).
- Working knowledge of Linux internals, networking, cloud storage, and performance/scaling behavior.
- Experience participating in blameless incident response and writing high-quality post-incident reviews.
- Clear communicator who can collaborate across teams and work autonomously.
Responsibilities
- Operating and evolving 100+ multi-cloud streaming clusters and related database infrastructure
- Diagnosing and eliminating cross-layer failure modes (e.g., object storage latency, noisy neighbors, control-plane bottlenecks, query performance regressions, etc.)
- Designing safe upgrade and rollout strategies at scale
- Improving observability, automation, and operational ergonomics
- Partnering closely with database and platform teams to ensure safe scaling, partitioning, consumer fan-out, and query performance
- Working directly with distributed systems behavior, Kubernetes scheduling dynamics, storage engines, compression trade-offs, etc.
- Serving as a primary escalation point and on-call for relevant incidents
- Owning the relationship with all system vendors, including WarpStream Labs and others.
View Full Description & ApplyYou'll be redirected to the employer's site