Apply

Site Reliability Engineer, Edge

Posted 2024-11-13

View full description

💎 Seniority level: Senior, At least 3 years in an SRE role or at least 5 years in an adjacent role

📍 Location: United States

💸 Salary: 192000 - 288000 USD per year

🔍 Industry: Frontend Cloud and web services

🏢 Company: Vercel

⏳ Experience: At least 3 years in an SRE role or at least 5 years in an adjacent role

🪄 Skills: AWSProblem Solving

Requirements:
  • At least 3 years of experience in an SRE role, or at least 5 years of experience in an adjacent role (e.g., platform engineering), operating in a scaled environment.
  • Firm grasp of the SRE philosophy and mindset, with practical experience working on or directly with SRE teams that have proactively engaged in system design and improvement.
  • Strong sense of accountability and commitment to problem-solving, backed by curiosity to dig deep and identify root causes.
  • Willingness to proactively engage with development teams to influence the course of software design and operational practices.
  • Capability to manage risk, make decisions, and exhibit sound judgment.
  • Demonstrated ability to plan and deliver long-term projects.
  • Familiarity with networking protocols and application serving.
  • Experience deploying and operating systems on AWS infrastructure at scale.
  • Bonus: Experience working with Terraform, Kubernetes, Golang, and/or Lua.
Responsibilities:
  • Ensure that our products are built for reliability and scale by engaging in the end-to-end design, development, and deployment of new software.
  • Drive continuous risk mitigation and reduction through direct involvement in incident management, blameless postmortems, and follow-ups.
  • Drive measurable improvements to the reliability, performance, and efficiency of our production systems through instrumentation, analysis, and implementation of engineering improvements.
  • Devise repeatable, low-toil operational practices through the development of automated systems for software delivery, system failover, and capacity management.
Apply