Job Details
- Required Skills
- DockerPythonBashGCPKubernetesCI/CDTerraform
Requirements
- Strong experience in DevOps, Site Reliability Engineering, or Platform Engineering roles supporting cloud-native environments.
- Deep hands-on expertise with Google Cloud Platform (GCP), including compute, networking, IAM, storage, and security services.
- Proven experience designing and maintaining CI/CD pipelines for multiple application and deployment environments.
- Strong proficiency with Infrastructure as Code tools, especially Terraform.
- Solid experience with Kubernetes (preferably GKE), Docker, and containerized application deployments.
- Strong scripting skills in Python, Bash, PowerShell, or similar languages for automation.
- Experience with observability tools including logging, monitoring, dashboards, and alerting systems.
- Strong understanding of cloud security principles, including IAM, secrets management, and compliance controls.
- Experience with Git-based workflows, artifact management, and release automation strategies.
- Excellent communication and collaboration skills, with the ability to influence technical decisions across teams.
Responsibilities
- Design, build, and maintain scalable GCP infrastructure using Infrastructure as Code (Terraform) and cloud-native automation practices.
- Develop and optimize CI/CD pipelines using tools such as Cloud Build, GitHub Actions, Jenkins, and related deployment systems.
- Manage and enhance GCP delivery workflows, including artifact management, deployment triggers, service accounts, and release approvals.
- Collaborate with engineering teams to improve build, release, and deployment processes across cloud-native and microservices applications.
- Implement observability solutions using Google Cloud Operations Suite, including logging, monitoring, alerting, and telemetry.
- Strengthen platform security through secrets management, IAM best practices, vulnerability scanning, and policy enforcement.
- Manage Kubernetes-based environments (GKE), including container orchestration, Helm deployments, and cluster optimization.
- Drive reliability engineering practices such as incident response, root cause analysis, SLO definition, and automated remediation.
- Develop reusable infrastructure modules, templates, and platform patterns to improve consistency and developer productivity.
- Provide technical leadership and mentorship on GCP architecture, DevOps best practices, and platform engineering standards.
View Full Description & ApplyYou'll be redirected to the employer's site