ApplySite Reliability Engineer (SRE)
Posted 3 months agoViewed
View full description
Requirements:
- Approximately 3 years of experience in SRE, DevOps, or Infrastructure Engineer roles.
- Strong experience with at least one major cloud provider such as AWS, GCP, or Azure.
- Hands-on experience with Kubernetes and containerization tools.
- Proficient in scripting languages such as Python, Bash, or similar.
- Familiarity with Infrastructure as Code (IaC) tools like Terraform, Pulumi, or AWS CloudFormation.
- Strong understanding of networking concepts and HA architecture.
- Experience with CI/CD tools.
- Experience with modern monitoring and observability tools.
- Strong analytical skills for troubleshooting complex issues.
Responsibilities:
- Ensure high availability (HA) and scalability of critical infrastructure components.
- Identify and eliminate single points of failure across the cloud environment.
- Manage and optimize cloud-based workloads.
- Automate provisioning, scaling, and maintenance tasks using IaC tools.
- Manage Kubernetes clusters and related operations.
- Implement monitoring solutions and participate in incident response.
- Develop automation scripts to reduce manual efforts and advocate for configuration automation.
Apply