Senior Site Reliability Engineer - Network Monitoring

New
India, RemoteFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Experience
6+ years
Required Skills
PythonBashKubernetesLinuxTerraformHelm

Requirements

  • 6+ years in an SRE, platform engineering, or equivalent DevOps / infrastructure role
  • Deep, hands-on Kubernetes experience: cluster administration, RBAC, networking
  • Proficiency with at least one infrastructure-as-code tool: Pulumi, Helm, Kustomize, or Terraform
  • Very strong scripting and automation skills in Python and Bash
  • Very Strong Linux background
  • Experience with AI-assisted development tooling (Claude Code, GitHub Copilot, Codex, or equivalent)
  • Solid network troubleshooting fundamentals

Responsibilities

  • Deploying and managing Kubernetes clusters and workloads hosted on OpenStack
  • Operating and improving GitOps delivery pipelines using Rancher Fleet
  • Developing new and maintaining existing Python tools and libraries within our Pulumi-based infrastructure-as-code environment
  • Supporting internal customers when they experience issues with our services or tooling
  • Directly shape how we monitor the global GoDaddy network and how we deliver reliable, automated services to our internal customers
  • Maintain and extend devcontainer based working environments with scripting, AI rule governance and custom CLI tools
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now