Apply📍 Canada
🧭 Full-Time
🔍 Observability and data management
🏢 Company: Cribl👥 251-500💰 $150,000,000 Series D over 2 years agoReal TimeBig DataInformation TechnologySoftware
- Extensive experience with enterprise-scale continuous delivery environments.
- Development with JavaScript/Node.js/TypeScript in a Linux/Mac environment.
- Experience with Configuration Management Tools like Terraform (preferred) or Puppet, Chef, Ansible.
- Knowledge of cloud platforms (prefer AWS and Azure, GCP is nice to have) and container + orchestration technologies.
- Extensive experience designing and implementing Observability platforms based on OpenSource tools like Grafana, Prometheus, OpenSearch.
- Experience mentoring engineers and acting as Subject Matter Expert in areas of Monitoring and Observability.
- Experience with native monitoring services in AWS, Azure and other popular Cloud Platforms.
- Background in Linux Systems Engineering.
- Experience with Incident response tools, e.g., PagerDuty, FireHydrant.
- Experience with sustainable incident response in a blameless environment.
- Comfortable with a high level of autonomy and working with a distributed team.
- Engage with teams and improve service delivery and reliability across their entire lifecycle.
- Measure and monitor all production systems with an eye towards availability, latency, and overall system health.
- Design observability systems for different types of applications, using Cribl products and other OpenSource tools.
- Seek out the cause of errors and instability in production cloud services and drive teams towards better operational excellence.
- Engage with product and platform teams to evolve systems by lobbying for changes that improve reliability, resilience, and observability.
- Lead efforts enabling shift-left monitoring in the organization.
- Help identify and drive down toil with creative innovation and automation.
- On-call responsibilities.
AWSDockerNode.jsGCPJavascriptTypeScriptAzureGrafanaPrometheusLinuxTerraform
Posted about 1 month ago
Apply