Senior DevOps Engineer
New
E
Eltropy Inc.FinTech
100% Remote - IndiaFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 5+ years
- Required Skills
- PostgreSQLGCPKubernetesGrafanaPrometheusCI/CDTerraformNetworking
Requirements
- 5+ years of hands-on experience in DevOps, SRE, or Cloud Infrastructure Engineering.
- Strong experience with GCP services (e.g., GKE, IAM, Cloud Run, Cloud SQL, Cloud Functions, Pub/Sub).
- Proven expertise in deploying and managing Kubernetes environments in production.
- Proficiency in automating deployments, infrastructure configuration, and container lifecycle management.
- Deep understanding of networking fundamentals, including DNS, load balancing, NAT, VPNs, TLS/SSL, and routing policies.
- Demonstrated experience implementing CI/CD pipelines using GitHub Actions, ArgoCD, Jenkins, or similar.
- Solid knowledge of PostgreSQL and experience managing databases at scale.
- Familiarity with monitoring, logging, and alerting systems.
- Practical knowledge of cloud security principles, vulnerability management, IAM policies, and secrets handling.
Responsibilities
- Design and manage cloud infrastructure on Google Cloud Platform (GCP) with a focus on security, scalability, and cost-efficiency.
- Architect and maintain Kubernetes clusters, enabling robust, production-grade container orchestration.
- Develop and maintain fully automated CI/CD pipelines to support reliable software delivery across environments.
- Implement infrastructure-as-code (IaC) using Terraform or equivalent tools for reproducible and auditable deployments.
- Configure and manage PostgreSQL databases, ensuring high availability, performance tuning, and backup automation.
- Define and enforce networking configurations (VPC, subnets, firewall rules, routing, ingress/egress control, DNS).
- Apply and monitor security best practices across infrastructure, including IAM policies, secrets management, TLS/SSL, and threat prevention.
- Monitor systems using tools like Prometheus, Grafana, and Stackdriver; build alerts and dashboards to ensure observability and uptime.
- Participate in incident response, root cause analysis, and postmortems.
- Continuously evaluate, optimize, and improve operational processes, deployment speed, and infrastructure resilience.
View Full Description & ApplyYou'll be redirected to the employer's site