Principal Site Reliability Engineer - Remote

Posted 7 months agoInactiveViewed
150000.0 - 160000.0 USD per year
United States of AmericaFull-TimeSoftware Development
Company:external-northamerica
Location:United States of America
Languages:English
Seniority level:Principal, 5 years in the SRE field, with a proven track record of progressively increasing responsibilities
Experience:5 years in the SRE field, with a proven track record of progressively increasing responsibilities
Skills:
AWSPythonBashCloud ComputingKubernetesAzureCI/CDDevOpsTerraformAnsibleScriptingSoftware Engineering
Requirements:
Bachelor’s degree in Computer Science, Engineering, or related field A minimum of 10 years of experience, including at least 5 years in the SRE field, with a proven track record of progressively increasing responsibilities Strong understanding and experience in automation tools and programming/scripting languages (e.g., PowerShell, Python, Bash) Strong understanding of Observability tools (e.g., Dynatrace, Datadog, New Relic etc.) and best practices Strong experience and understanding of software engineering, Infrastructure as Code (Ansible or Terraform) and build/deployment pipelines. Strong troubleshooting skills coupled with making data-driven decisions during incidents Strong understanding of cloud computing platforms (Azure or Google Cloud) and cloud-native setups (AKS, serverless, etc.).
Responsibilities:
Contribute significantly to the reliability, scalability and availability of Bright Horizons' digital infrastructure. Implement robust infrastructure, application and digital-experience monitoring in our enterprise-wide APM tool Dynatrace. Drive troubleshooting of critical incidents Drive the development and implementation of automation solutions to streamline processes. Create a roadmap to expand and consolidate tools. Collaborate with the above cross-functional teams. Work closely with Infrastructure and Architecture teams to design and implement roadmaps for scaling server and serverless architecture.
Similar Jobs:
Posted 34 minutes ago
United StatesFull-TimeDatabase DevOps
Technical Support Engineer (Remote, US-Based, Pacific Time Zone)
Company: Liquibase
Posted 41 minutes ago
US, CanadaFull-TimeGame Development
Senior/Staff Backend Engineer
Posted 43 minutes ago
United StatesFull-TimePeople Analytics
Analytics Engineer, People Analytics