Sr. Site Reliability Engineer
New
A
AuthZedAuthorization Infrastructure
AuthZed is a fully remote company with employees across the US, Canada, and Europe., flexible schedule to accommodate different timezonesFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- DockerKubernetesGrafanaPrometheusCI/CDTerraform
Requirements
- Proven experience as a Site Reliability Engineer or in a similar role.
- Strong understanding of networking, operating systems, and cloud infrastructure.
- Experience with Site Reliability Engineering, System Design, and Distributed Computing.
- Experience in various programming languages (NodeJS, Java, Python, Ruby, Go).
- Experience with containerization technologies such as Docker and Kubernetes.
- Knowledge of infrastructure-as-code tools like Terraform and Pulumi.
- Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
- Experience with lower-level implementation details of relational databases.
- Experience working with Git and GitHub.
- Experience with continuous integration and deployment systems.
- Strong problem-solving and troubleshooting skills.
- Excellent communication and collaboration abilities.
Responsibilities
- Design, implement, and maintain highly available and scalable infrastructure solutions for our projects, products, and customers.
- Monitor and analyze system performance, identifying and resolving bottlenecks and issues.
- Automate infrastructure deployment and configuration management processes.
- Continuously improve system reliability, security, and efficiency through proactive monitoring, capacity planning, and performance tuning.
- Troubleshoot and resolve complex infrastructure and application issues in production and test environments.
- Collaborate with software engineering teams to design and implement systems that are resilient, scalable, and secure.
- Participate in on-call rotation and respond to production incidents.
- Document system configurations, troubleshooting procedures, and operational guidelines.
View Full Description & ApplyYou'll be redirected to the employer's site