Applyπ United States
π GenAI and data analytics
- 3+ years of experience in a DevOps or SRE role, ideally within a cloud-native environment (AWS preferred).
- Deep understanding of cloud services, particularly AWS and Azure.
- Strong experience with Terraform, CloudFormation, or similar tools.
- Proficient in automation with Rundeck, Jenkins, and scripting languages (Python, Bash, etc.).
- Experience with tools like Datadog, Prometheus, Grafana, and centralized logging systems (ELK, Splunk, etc.).
- Experience with setting up and managing CI/CD pipelines using tools like Jenkins, GitLab CI, etc.
- Understanding of best practices around security, compliance, and audits.
- Strong troubleshooting skills with a proactive approach to issue resolution and process optimization.
- Excellent communication skills with the ability to work in cross-functional teams.
- Design, deploy, and manage cloud infrastructure using tools such as Terraform, CloudFormation, and Docker.
- Implement and manage monitoring systems (e.g., Prometheus, Cloudwatch) to ensure service health, define SLIs and SLOs, and create a robust incident response plan.
- Build automation tools for system health monitoring, scaling, and performance optimization using Rundeck, Jenkins, and other automation frameworks.
- Design, improve, and maintain CI/CD pipelines to ensure fast, reliable, and consistent delivery of applications.
- Ensure the infrastructure meets compliance standards and assist with audits and security reviews.
- Provide operational support for production environments, troubleshoot issues, and implement solutions for system reliability and scalability.
AWSDockerPythonBashJenkinsKubernetesAzureGrafanaPrometheusCommunication SkillsCollaborationCI/CDDevOpsTerraform
Posted 2024-10-25
Apply