Senior Platform Engineer
New
United StatesFull-TimeSenior
Salary130000 - 160000 USD per year
Apply NowOpens the employer's application page
Job Details
- Required Skills
- AWSKubernetesCI/CDDevOpsTerraformHelm
Requirements
- Extensive hands-on experience managing Kubernetes at scale
- Experience with ArgoCD
- Experience with Helm
- Strong expertise in AWS cloud services (networking, compute, storage, security architecture)
- Advanced proficiency with Terraform and Infrastructure as Code
- Deep experience building and supporting distributed SaaS or cloud-native platforms
- Strong understanding of CI/CD pipelines
- Strong understanding of GitOps workflows
- Strong understanding of DevOps principles
- Strong understanding of internal developer platforms
- Experience implementing or working with AIOps practices (ML-driven observability or automated remediation)
- Solid knowledge of cloud security, IAM principles, and platform hardening best practices
- Ability to design scalable, reliable systems and define operational metrics (SLIs/SLOs)
- Strong communication skills
- AWS Certification (highly desirable)
- CKA/CKAD Certification (highly desirable)
- Solution Architect Certification (highly desirable)
- Proven track record of mentoring engineers
- Strong problem-solving mindset with a focus on automation, reliability, and continuous improvement
Responsibilities
- Lead the design and architecture of scalable, secure, and cloud-native platform systems aligned with business and technical requirements
- Build and optimize AWS-based infrastructure using Infrastructure as Code and modern automation practices
- Manage and evolve Kubernetes environments at scale, including cluster lifecycle, deployments, and GitOps workflows
- Define and maintain platform reliability standards, including SLIs, SLOs, monitoring, and performance benchmarks
- Integrate AIOps capabilities such as intelligent monitoring, anomaly detection, predictive scaling, and automated remediation
- Improve platform performance, reliability, and developer experience through continuous optimization initiatives
- Develop and maintain robust incident response, support, and disaster recovery processes with a focus on continuous improvement
- Collaborate with engineering teams to establish reusable patterns, DevOps practices, and secure CI/CD pipelines
- Mentor engineers, contribute to code reviews, and foster a culture of technical excellence and knowledge sharing
- Evaluate emerging technologies and drive adoption of tools that enhance platform scalability, security, and efficiency
View Full Description & ApplyYou'll be redirected to the employer's site