Senior / Staff DevOps Engineer
New
Fully remote work environment with flexibility to work from anywhere within the United States.Full-TimeSenior
Salary200,000 - 225,000 USD per year
Apply NowOpens the employer's application page
Job Details
- Experience
- 6+ years
- Required Skills
- DockerPythonGCPKubernetesGoCI/CDTerraform
Requirements
- 6+ years of experience in DevOps, Site Reliability Engineering, Platform Engineering, or Infrastructure Engineering within a software development environment.
- Deep expertise with major cloud platforms, preferably Google Cloud Platform (GCP).
- Extensive experience with infrastructure-as-code tools such as Terraform, Pulumi, CloudFormation, or similar technologies.
- Strong hands-on experience with containerization and orchestration technologies including Docker, Kubernetes, ECS, or equivalent platforms.
- Proven ability to build, manage, and optimize CI/CD pipelines using tools such as GitHub Actions, CircleCI, or comparable solutions.
- Experience implementing and maintaining observability platforms, monitoring frameworks, and operational dashboards.
- Strong scripting and programming skills in languages such as Python, Go, Bash, TypeScript, or similar.
- Advanced knowledge of cloud security best practices, including IAM, least-privilege access controls, encryption, secrets management, and vulnerability management.
- Hands-on experience supporting compliance frameworks such as SOC 2, ISO 27001, GDPR, HIPAA, or related standards.
- Demonstrated proficiency using AI-powered development and productivity tools, including GitHub Copilot, Cursor, Claude Code, or equivalent technologies.
- Experience leading production incident response, participating in on-call rotations, and driving operational improvements based on postmortem analysis.
- Excellent communication, documentation, and collaboration skills.
- Prior experience working within startup environments.
Responsibilities
- Design, build, and maintain secure, scalable, and cost-efficient cloud infrastructure with a strong focus on automation, reliability, and operational excellence.
- Lead the development and continuous improvement of infrastructure-as-code frameworks, deployment pipelines, and platform automation capabilities.
- Own and optimize CI/CD processes, ensuring rapid, reliable, and secure software delivery across the development lifecycle.
- Establish and enhance observability practices through monitoring, logging, tracing, alerting, and dashboarding solutions that proactively identify and resolve issues.
- Drive infrastructure security initiatives, including identity and access management, encryption, secrets management, network security, and vulnerability remediation.
- Partner with cross-functional teams to maintain compliance with industry standards and regulatory frameworks while automating controls and audit readiness processes.
- Lead incident response efforts, participate in on-call operations, facilitate post-incident reviews, and implement long-term improvements that enhance system reliability.
- Define service reliability objectives, support capacity planning, and perform performance optimization to ensure platform scalability and availability.
- Leverage AI-powered engineering tools to accelerate infrastructure development, automate operational workflows, improve troubleshooting, and enhance team productivity.
- Mentor engineers, establish platform engineering best practices, and contribute to a culture of continuous improvement, shared accountability, and technical excellence.
View Full Description & ApplyYou'll be redirected to the employer's site