Senior DevOps Engineer

Remote, United States, United KingdomFull-TimeSenior
Salary136000 - 237000 USD per year
Apply NowOpens the employer's application page

Job Details

Required Skills
AWSDockerSoftware DevelopmentKubernetesCI/CDLinuxTerraformScriptingCloudFormationMLOps

Requirements

  • Proficiency in Infrastructure as Code (IaC) tools and practices, such as Terraform or CloudFormation
  • Experience with Software Development, including scripting and programming to support automation and management of DevOps workflows
  • Strong understanding of Continuous Integration and deployment pipelines
  • Expertise in System Administration, including network configuration and troubleshooting
  • Extensive experience with Linux systems, including performance optimization and security management
  • Excellent problem-solving skills and ability to collaborate across teams
  • Familiarity with containerization technologies such as Docker and Kubernetes is a plus
  • Experience with ML OPs (bonus)
  • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent professional experience (bonus)

Responsibilities

  • Partner with offshore and onshore engineering teams to design, implement, and scale cloud-native infrastructure supporting a new customer portal and ongoing platform refactoring efforts
  • Architect, build, and maintain Kubernetes-based environments that power production systems, ensuring scalability, resilience, and security
  • Lead Infrastructure as Code initiatives (primarily Terraform) to automate provisioning, configuration, and environment consistency across AWS
  • Design, implement, and optimize CI/CD pipelines to improve deployment velocity, reliability, and developer experience
  • Integrate and operationalize MLOps practices, enabling efficient deployment, monitoring, and lifecycle management of machine learning workflows
  • Embed DevSecOps best practices across the platform, incorporating security controls, compliance requirements, and monitoring into the development lifecycle
  • Drive automation initiatives that reduce manual processes and increase system reliability and repeatability
  • Collaborate closely with Platform, Engineering, and cross-functional stakeholders to gather requirements, troubleshoot issues, and continuously improve system architecture
  • Monitor system performance, identify bottlenecks, and proactively implement improvements to optimize availability and cost efficiency
  • Support incident response and root cause analysis efforts, driving long-term fixes and ensuring lessons learned translate into system improvements
View Full Description & ApplyYou'll be redirected to the employer's site
136000 - 237000 USD per year
Apply Now