Staff Engineer (Engineering Observability)

New
AustraliaFull-TimeStaff
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
DatadogDistributed Systems

Requirements

  • Extensive experience in observability engineering, platform engineering, or similar infrastructure-focused roles in large-scale environments.
  • Strong expertise in SLIs/SLOs, telemetry design, distributed systems observability, and reliability engineering principles.
  • Hands-on experience designing and implementing observability pipelines using tools such as OpenTelemetry and modern telemetry stacks.
  • Experience with observability platforms such as Datadog (or equivalent tools).
  • Proven ability to design scalable architectures for logs, metrics, traces, and event-driven monitoring systems.
  • Strong influence and stakeholder management skills, with the ability to drive alignment across multiple engineering teams.
  • Passion for building engineering standards, frameworks, and reusable solutions that improve developer productivity.
  • Excellent communication skills and the ability to translate complex technical challenges into clear, actionable guidance.

Responsibilities

  • Define and drive the organisation’s observability strategy, aligning technical initiatives with key customer journeys and measurable reliability outcomes.
  • Design and evolve end-to-end observability architectures, including telemetry pipelines, instrumentation frameworks, and OpenTelemetry-based solutions.
  • Establish and enforce best practices for metrics, logs, traces, tagging, alerting, and incident response across engineering teams.
  • Develop SLI/SLO frameworks to improve service reliability, visibility, and operational decision-making.
  • Build scalable “paved roads,” reusable patterns, and tooling to simplify adoption of observability practices.
  • Partner with engineering, product, and platform teams to integrate observability into the full software development lifecycle.
  • Improve signal quality, alerting systems, and incident response workflows to accelerate detection and resolution of issues.
  • Provide technical leadership and mentorship, influencing engineering practices across multiple teams without formal authority.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now