Senior Site Reliability Engineer

New
India (Remote)Full-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
DockerPythonJavaKubernetesCI/CDLinuxDevOpsDatadog

Requirements

  • Expertise in at least one technology stack designing, coding, testing, and delivering software
  • Proficiency in one or more technology domains
  • Working knowledge of infrastructure components
  • Excellent debugging and trouble shooting skills
  • Prior experience in DevOps and/or application development teams
  • Hands on experience using Java, Python, or scripting languages
  • Hands on experience of Kubernetes, Docker, Docker Swarm style deployments
  • Exposure on DataDog monitoring
  • Hands on experience of Continuous Delivery tools
  • Hands on experience in Unix: Linux and Solaris
  • Exposure to Orchestration and configuration management tools
  • Experience with infrastructure components utilized in data warehousing or big data environments

Responsibilities

  • Design, code, test and deliver software to automate manual operational work
  • Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
  • Engage with development team throughout the life cycle to help develop software for reliability and scale
  • Identify application patterns and analytics in support of better service level objectives
  • Design self-healing and resiliency patterns
  • Design automated software and product upgrades, change management, and release management solutions
  • Participate in the 24×7 support coverage as needed
  • Mentor and guide junior developers
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now