Senior Cloud Platform Engineer

New
United StatesFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Experience
5+ years
Required Skills
AWSNode.jsPythonDynamoDBKubernetesGoTerraformGitHub ActionsDatadogAWS LambdaCloudFormation

Requirements

  • 5+ years of experience with Infrastructure-as-Code tools such as Terraform, Terragrunt, Atlantis, or CDK in AWS environments.
  • 4+ years of experience working with containerized workloads using Kubernetes (EKS) and/or ECS at scale.
  • 4+ years of experience building or managing CI/CD systems in distributed engineering environments, ideally using GitHub Actions.
  • Strong expertise in Kubernetes and GitOps tooling such as ArgoCD; experience with service mesh tools (e.g., Istio) is a plus.
  • Deep knowledge of AWS services including EC2, ECS, EKS, Lambda, Fargate, IAM, Route53, CloudFront, and DynamoDB.
  • Proficiency in at least one programming language such as Python, Go, or Node.js.
  • Experience implementing observability and monitoring systems using tools like Datadog, CloudWatch, or similar platforms.
  • Strong background in incident response practices, including on-call operations and post-incident reviews.
  • Ability to work independently in distributed teams, make technical decisions, and communicate clearly across stakeholders.
  • Strong documentation, collaboration, and knowledge-sharing mindset with a focus on mentorship and continuous learning.

Responsibilities

  • Design, build, and maintain cloud infrastructure using Infrastructure-as-Code tools such as Terraform and Terragrunt, ensuring scalable, secure, and cost-efficient AWS environments.
  • Support and modernize legacy infrastructure built with CloudFormation and CDK, including services such as ECS, Lambda, and EMR.
  • Architect and promote Kubernetes-based deployments using EKS and GitOps practices with tools such as ArgoCD.
  • Lead migration of deployment pipelines from legacy systems to modern CI/CD frameworks using GitHub Actions and related tooling.
  • Develop and maintain centralized CI/CD platforms and reusable deployment frameworks supporting large-scale engineering teams.
  • Implement and enhance observability solutions using tools such as Datadog or CloudWatch for monitoring, alerting, and system analytics.
  • Contribute to SRE practices, including incident response, on-call support, blameless post-mortems, and reliability improvements.
  • Collaborate across engineering teams to influence AWS architecture decisions and promote best practices in scalability, security, and performance.
  • Document systems, architectural decisions, and processes while actively participating in knowledge sharing and technical discussions.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now