ApplySite Reliability Engineer
Posted 5 months agoViewed
View full description
Requirements:
- A comprehensive understanding of Site Reliability Engineering
- Experience working with a cloud service provider (ideally Azure or AWS)
- Strong examples of implementing automation/solutions by code (preferably Python, C#, Java, or Go)
- Commercial experience working with compute technologies (such as Kubernetes, or Serverless)
- Designed, implemented, and/or supported solutions in a production environment
- Strong interpersonal and communication skills to work in a fast-paced and rapidly changing dynamic environment
Responsibilities:
- Monitoring our client’s services using modern tools and SRE practices.
- Responding to incidents originating from 2nd line support within the times set out in the SLA (being on-call).
- Performing and assisting in root cause analysis and blameless post-mortems to enable incidents to be understood and avoided in the future.
- Improving the testing and release procedure.
- Planning for and making changes to capacity to balance the demand vs. cost saving equation better.
- Undertaking improvements to the infrastructure and product.
- Making changes to client’s services based upon operational or business needs.
- Advising and supporting the further development of Ensono Digital’s Intellectual Property to ensure future projects benefit from what we learn.
Apply