Job Details
- Experience
- 5+ years
- Required Skills
- GCPKafkaKubernetesTerraformHelm
Requirements
- 5+ years professional experience in DevOps, SRE, or similar infrastructure-focused engineering roles
- Strong expertise with GCP
- Additional experience on AWS is a plus
- Hands-on experience with Kubernetes
- Hands-on experience with Terraform
- Hands-on experience with Helm
- Hands-on experience with related IaC tooling
- In-depth knowledge of databases at scale
- In-depth knowledge of queuing/messaging systems
- In-depth knowledge of streaming platforms
- Comfortable and experienced operating in environments with near-zero downtime requirements
- Familiarity with financial and/or trading systems
- Strong automation mindset with experience in CI/CD tooling and pipeline development
- Excellent problem-solving, communication, and collaboration skills
- Security-focused, with practical experience embedding operational security best practices
Responsibilities
- Build and operate cloud infrastructure primarily on Google Cloud Platform (GCP)
- Manage Kubernetes clusters, using tools like Helm and Terraform for infrastructure as code and deployment automation
- Collaborate with engineering teams to design, deploy, and support scalable back-end services
- Implement CI/CD pipelines and automation for code delivery, environment provisioning, and rollbacks
- Monitor, troubleshoot, and optimize systems to deliver high performance and reliability under load
- Ensure near-zero downtime deployments with seamless rollback strategies
- Support and tune databases at scale, as well as messaging and streaming systems such as Redis Streams and Kafka (or similar)
- Drive proactive operational excellence: monitoring, alerting, incident response, and performance tuning
- Introduce and champion best practices for secure, resilient infrastructure and operational tooling