ApplySite Reliability Engineer (SRE)
Posted 3 months agoViewed
View full description
Requirements:
- Minimum 3+ years of experience in Site Reliability Engineering, DevOps, or a related role.
- Proficiency in the ELK stack (Elasticsearch, Logstash, Kibana) for log monitoring.
- Experience with the TICK stack (Telegraf, InfluxDB, Chronograf, Kapacitor) for metrics monitoring.
- Strong scripting skills in languages such as Python, Bash, or Ruby.
- Understanding of Operating System: Ubuntu(OpenStack) - Must have, Debian and Redhat etc.,
- DevOps Platforms Gitlab - Good to have, Or similar
- Solid understanding of Grafana and Prometheus.
- Having worked with ServiceNow or something similar.
- Experience with configuration management tools like Ansible, Puppet, or Chef.
- Familiarity with containerization and orchestration tools like Docker and Kubernetes.
- Understanding of cloud platforms (Any of AWS, Azure, or GCP) and their services.
- Bachelor’s degree in computer science, Information Technology, or a related field.
- Excellent problem-solving skills and attention to detail.
- Strong communication and collaboration abilities.
Responsibilities:
Ensuring the reliability, availability, and performance of our customer’s platforms and services, bridging the gap between development and operations.
Apply