Senior DevOps Engineer
S
Slingshot AerospaceAerospace
Remote, United States, United KingdomFull-TimeSenior
Salary136000 - 237000 USD per year
Apply NowOpens the employer's application page
Job Details
- Required Skills
- AWSDockerSoftware DevelopmentKubernetesCI/CDLinuxTerraformScriptingCloudFormationMLOps
Requirements
- Proficiency in Infrastructure as Code (IaC) tools and practices, such as Terraform or CloudFormation
- Experience with Software Development, including scripting and programming to support automation and management of DevOps workflows
- Strong understanding of Continuous Integration and deployment pipelines
- Expertise in System Administration, including network configuration and troubleshooting
- Extensive experience with Linux systems, including performance optimization and security management
- Excellent problem-solving skills and ability to collaborate across teams
- Familiarity with containerization technologies such as Docker and Kubernetes is a plus
- Experience with ML OPs (bonus)
- Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent professional experience (bonus)
Responsibilities
- Partner with offshore and onshore engineering teams to design, implement, and scale cloud-native infrastructure supporting a new customer portal and ongoing platform refactoring efforts
- Architect, build, and maintain Kubernetes-based environments that power production systems, ensuring scalability, resilience, and security
- Lead Infrastructure as Code initiatives (primarily Terraform) to automate provisioning, configuration, and environment consistency across AWS
- Design, implement, and optimize CI/CD pipelines to improve deployment velocity, reliability, and developer experience
- Integrate and operationalize MLOps practices, enabling efficient deployment, monitoring, and lifecycle management of machine learning workflows
- Embed DevSecOps best practices across the platform, incorporating security controls, compliance requirements, and monitoring into the development lifecycle
- Drive automation initiatives that reduce manual processes and increase system reliability and repeatability
- Collaborate closely with Platform, Engineering, and cross-functional stakeholders to gather requirements, troubleshoot issues, and continuously improve system architecture
- Monitor system performance, identify bottlenecks, and proactively implement improvements to optimize availability and cost efficiency
- Support incident response and root cause analysis efforts, driving long-term fixes and ensuring lessons learned translate into system improvements
View Full Description & ApplyYou'll be redirected to the employer's site