Staff Software Engineer - Platform, SysEng
New
G
Grafana LabsObservability Software
United States (Remote), EST + CST highly preferredFull-TimeStaff
SalaryUSD 174,986 - USD 209,983
Apply NowOpens the employer's application page
Job Details
- Required Skills
- PythonKubernetesGoMicroservicesDistributed Systems
Requirements
- Proven delivery of large distributed systems with clear evidence of technical leadership.
- Deep understanding of system design tradeoffs: latency, consistency, availability, scaling, and cost.
- Hands-on experience with cloud-native architectures including microservices, containers, and Kubernetes.
- Experience with IaC and operational practices for production health.
- Skilled in Go; Python, C, C++, or Rust experience also valued.
- Comfortable defining and owning reliability metrics like SLOs/SLIs.
- Practical experience using AI-powered developer tools to accelerate development workflows.
- Ability to influence cross-functional stakeholders and drive outcomes in a remote environment.
- Excellent written and verbal communication skills.
- Experience in open source or community-based projects is a plus.
- Familiarity with Kubernetes scheduling, Karpenter, Terraform, Crossplane, Tanka, or Jsonnet is a bonus.
Responsibilities
- Scale platform infrastructure to handle hundreds of millions of metrics, logs, and traces per second.
- Manage and improve Kubernetes clusters to support application engineers.
- Reduce regional build timelines to meet customer demands.
- Define and drive SLOs/SLIs and capacity planning for production services.
- Participate in on-call rotations to maintain system health.
- Lead technical designs and write design documentation.
- Integrate and leverage internal product offerings in day-to-day operations.
View Full Description & ApplyYou'll be redirected to the employer's site