Apply

Senior Infrastructure Site Reliability Engineer

Posted 10 days agoViewed

View full description

💎 Seniority level: Senior, Several years

📍 Location: Canada

💸 Salary: 125000.0 - 175000.0 CAD per year

🔍 Industry: Software Development

🗣️ Languages: English

⏳ Experience: Several years

🪄 Skills: PythonSQLBashCloud ComputingElasticSearchGitJenkinsKubernetesGoGrafanaPrometheusREST APICI/CDLinuxDevOpsTerraformComplianceNetworkingAnsibleScriptingDebugging

Requirements:
  • Several years of experience in SRE, DevOps, or related roles.
  • Proven experience working in hyperscale cloud environments.
  • Demonstrated ability to lead infrastructure projects.
  • Strong understanding of network protocols and configurations.
  • Experience with automation tools (e.g., Ansible, Terraform) and scripting languages (e.g., Python, Bash, Golang).
  • Experience automating component deployment across multiple environments using tools like Jenkins, CircleCI, or GitHub Actions.
  • Proficient observability and log analysis techniques to detect and resolve system issues.
  • Effective communication skills for both technical and non-technical stakeholders.
  • Familiarity with compliance requirements and frameworks: PCI, ISO 2701, HIPAA, SOC
Responsibilities:
  • Manage full infrastructure lifecycle from design to decommission, ensuring systems are reliable and efficient.
  • Participate in an on-call rotation for the compute platform and related systems.
  • Automate routine tasks and develop tools to improve system efficiency and reduce the human intervention time on any tasks.
  • Conduct system performance tuning and troubleshooting, as well as capacity planning, to ensure system reliability and efficiency.
  • Participate in the creation and testing of disaster recovery plans.
  • Monitor and maintain observability systems to ensure issues are identified and resolved proactively.
  • Educate team members on security best practices and emerging threats.
Apply