Senior Cloud & DevOps Engineer
New
United States, Mexico TZContractSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 5+ years
- Required Skills
- AWSDockerPythonBashKubernetesCI/CDTerraform
Requirements
- 5+ years of experience in a DevOps, Platform Engineering, or Site Reliability Engineering role.
- Advanced expertise in Amazon ECS – task definitions, services, capacity providers, Fargate & EC2 launch types.
- Advanced expertise in Amazon EKS – cluster provisioning, node groups, autoscaling, RBAC, and networking (VPC CNI, CoreDNS).
- Deep knowledge of Docker and container best practices (multi-stage builds, image optimization, security scanning).
- Strong experience with Kubernetes concepts: Deployments, StatefulSets, DaemonSets, Ingress, ConfigMaps, Secrets, HPA/VPA.
- Proficiency in Infrastructure as Code (Terraform preferred).
- Solid understanding of AWS networking (VPC, subnets, security groups, ALB/NLB, Route 53).
- Experience with CI/CD tools such as GitHub Actions, Jenkins, GitLab CI, or AWS CodePipeline.
- Strong scripting skills in Bash, Python, or similar languages.
- Familiarity with GitOps workflows (ArgoCD, Flux).
Responsibilities
- Design, deploy, and manage containerized workloads using Amazon ECS (Elastic Container Service) and Amazon EKS (Elastic Kubernetes Service).
- Build and maintain CI/CD pipelines to automate software delivery workflows.
- Develop and manage Docker container images, registries (ECR), and container lifecycle best practices.
- Implement Infrastructure as Code (IaC) using tools such as Terraform, CloudFormation, or CDK.
- Monitor, troubleshoot, and optimize cloud infrastructure performance, availability, and cost.
- Enforce security best practices across containerized environments (IAM roles, network policies, secrets management).
- Collaborate with software engineers to containerize applications and migrate workloads to ECS/EKS.
- Manage Kubernetes cluster configurations, namespaces, Helm charts, and service mesh integrations.
- Define and maintain observability standards using tools like CloudWatch, Prometheus, Grafana, or Datadog.
- Participate in on-call rotations and incident response processes.
View Full Description & ApplyYou'll be redirected to the employer's site