Senior DevOps Engineer

Remote, United States, United KingdomFull-TimeSenior

Salary136000 - 237000 USD per year

Apply NowOpens the employer's application page

Job Details

Required Skills: AWSDockerSoftware DevelopmentKubernetesCI/CDLinuxTerraformScriptingCloudFormationMLOps

Proficiency in Infrastructure as Code (IaC) tools and practices, such as Terraform or CloudFormation
Experience with Software Development, including scripting and programming to support automation and management of DevOps workflows
Strong understanding of Continuous Integration and deployment pipelines
Expertise in System Administration, including network configuration and troubleshooting
Extensive experience with Linux systems, including performance optimization and security management
Excellent problem-solving skills and ability to collaborate across teams
Familiarity with containerization technologies such as Docker and Kubernetes is a plus
Experience with ML OPs (bonus)
Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent professional experience (bonus)

Partner with offshore and onshore engineering teams to design, implement, and scale cloud-native infrastructure supporting a new customer portal and ongoing platform refactoring efforts
Architect, build, and maintain Kubernetes-based environments that power production systems, ensuring scalability, resilience, and security
Lead Infrastructure as Code initiatives (primarily Terraform) to automate provisioning, configuration, and environment consistency across AWS
Design, implement, and optimize CI/CD pipelines to improve deployment velocity, reliability, and developer experience
Integrate and operationalize MLOps practices, enabling efficient deployment, monitoring, and lifecycle management of machine learning workflows
Embed DevSecOps best practices across the platform, incorporating security controls, compliance requirements, and monitoring into the development lifecycle
Drive automation initiatives that reduce manual processes and increase system reliability and repeatability
Collaborate closely with Platform, Engineering, and cross-functional stakeholders to gather requirements, troubleshoot issues, and continuously improve system architecture
Monitor system performance, identify bottlenecks, and proactively implement improvements to optimize availability and cost efficiency
Support incident response and root cause analysis efforts, driving long-term fixes and ensuring lessons learned translate into system improvements

View Full Description & ApplyYou'll be redirected to the employer's site