Senior Site Reliability Engineer
New
CanadaFull-TimeSenior
Salary195,700 - 225,000 USD per year
Apply NowOpens the employer's application page
Job Details
- Experience
- 5+ years
- Required Skills
- AWSPythonBashGoDevOpsTerraformDatadog
Requirements
- 5+ years of experience in Software Engineering, Site Reliability Engineering, or DevOps roles.
- Strong communication skills with experience working in collaborative tooling environments.
- Hands-on experience with Infrastructure as Code, particularly Terraform and AWS.
- Solid understanding of containerization and orchestration tools (e.g., ECS or equivalent).
- Strong programming skills in languages such as Bash, Python, and/or Golang.
- Experience with observability tools and practices (e.g., Datadog).
- Strong problem-solving mindset.
Responsibilities
- Partner with software engineering teams to design, build, and maintain reliable and resilient services in production environments.
- Develop and improve infrastructure automation to enhance usability and reduce operational friction for internal teams.
- Build and maintain observability solutions, including monitoring, logging, and alerting systems.
- Identify and eliminate operational toil by automating repetitive manual processes.
- Contribute to infrastructure design through documentation, technical designs, and runbooks.
- Ensure systems and processes meet security and compliance requirements.
- Participate in on-call rotations to support incident response and ensure system uptime.
View Full Description & ApplyYou'll be redirected to the employer's site