Senior Software Engineer - Reliability

Posted about 2 months agoViewed
131325 - 201000 USD per year
United StatesFull-TimeSoftware Development
Company:Freenome
Location:United States
Languages:English
Seniority level:Senior, 5+ years
Experience:5+ years
Skills:
PythonCloud ComputingKubernetesGoGrafanaPrometheusDevOpsTerraformMentoringComplianceSoftware Engineering
Requirements:
Bachelor’s degree in Computer Science, Engineering, or equivalent experience 5+ years in software engineering or Infra/DevOps/SRE roles (Python or Go preferred) Experience deploying cloud infrastructure via automation (e.g. Terraform, Pulumi, Bicep/ARM) Incident management experience in cloud/software engineering Hands-on experience operating production workloads in cloud environments Familiarity with Kubernetes (AKS, GKE, or EKS) Strong troubleshooting and root-cause analysis skills in distributed systems Experience with observability platforms (e.g., DataDog, Prometheus/Grafana, OpenTelemetry) Ability to define and implement metrics, dashboards, and alerting
Responsibilities:
Define and implement observability practices for production systems Develop and maintain incident response playbooks and escalation procedures Define SLIs/SLOs and establish error budgets Automate operational tasks Contribute to production systems and designs Use Infrastructure as Code (IaC) to manage infrastructure Build out the SRE practice
Similar Jobs:
Posted 2 days ago
United StatesFull-TimeSoftware Development
Senior Full Stack Engineer
Company:Five9
Posted 2 days ago
United States, CanadaFull-TimeHealthcare Technology
AI Solutions Engineer (Remote Opportunity)
Company:VetsEZ
Posted 2 days ago
United StatesFull-TimeMental Health
Senior Data Engineer