Senior DevOps Engineer

New
E
100% Remote - IndiaFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Experience
5+ years
Required Skills
PostgreSQLGCPKubernetesGrafanaPrometheusCI/CDTerraformNetworking

Requirements

  • 5+ years of hands-on experience in DevOps, SRE, or Cloud Infrastructure Engineering.
  • Strong experience with GCP services (e.g., GKE, IAM, Cloud Run, Cloud SQL, Cloud Functions, Pub/Sub).
  • Proven expertise in deploying and managing Kubernetes environments in production.
  • Proficiency in automating deployments, infrastructure configuration, and container lifecycle management.
  • Deep understanding of networking fundamentals, including DNS, load balancing, NAT, VPNs, TLS/SSL, and routing policies.
  • Demonstrated experience implementing CI/CD pipelines using GitHub Actions, ArgoCD, Jenkins, or similar.
  • Solid knowledge of PostgreSQL and experience managing databases at scale.
  • Familiarity with monitoring, logging, and alerting systems.
  • Practical knowledge of cloud security principles, vulnerability management, IAM policies, and secrets handling.

Responsibilities

  • Design and manage cloud infrastructure on Google Cloud Platform (GCP) with a focus on security, scalability, and cost-efficiency.
  • Architect and maintain Kubernetes clusters, enabling robust, production-grade container orchestration.
  • Develop and maintain fully automated CI/CD pipelines to support reliable software delivery across environments.
  • Implement infrastructure-as-code (IaC) using Terraform or equivalent tools for reproducible and auditable deployments.
  • Configure and manage PostgreSQL databases, ensuring high availability, performance tuning, and backup automation.
  • Define and enforce networking configurations (VPC, subnets, firewall rules, routing, ingress/egress control, DNS).
  • Apply and monitor security best practices across infrastructure, including IAM policies, secrets management, TLS/SSL, and threat prevention.
  • Monitor systems using tools like Prometheus, Grafana, and Stackdriver; build alerts and dashboards to ensure observability and uptime.
  • Participate in incident response, root cause analysis, and postmortems.
  • Continuously evaluate, optimize, and improve operational processes, deployment speed, and infrastructure resilience.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now