Apply

Staff Site Reliability Engineer

Posted 2 months agoViewed

View full description

💎 Seniority level: Staff

📍 Location: Brazil

🔍 Industry: Corporate wellness

🏢 Company: Wellhub

🗣️ Languages: English, Portuguese

🪄 Skills: AWSPythonKubernetesRubyGrafanaPrometheusCI/CD

Requirements:
  • Proven technical experience with AWS cloud services and Kubernetes.
  • Deep knowledge of Kubernetes and related ecosystem.
  • Solid knowledge of observability systems.
  • Experience with operator-managed Infrastructure as Code, preferably crossplane or Kubernetes Operators.
  • Ability to write software for production environments.
  • Excellent analytical and problem-solving skills.
  • Collaboration and learning-driven mindset.
  • CNCF Kubernetes Certifications (e.g. CKA, CKS, or CKAD).
  • AWS Certifications.
  • Excellent communication skills in both English and Portuguese.
Responsibilities:
  • Help to build a global, secure, scalable, and cost-effective Cloud platform using Kubernetes in AWS.
  • Develop and evolve Kubernetes operators and cloud-native automation.
  • Build tools for engineering teams to manage their cloud resources autonomously.
  • Ensure security and compliance by delivering secure products and implementing DevSecOps.
  • Improve observability, reliability, and cost awareness.
  • Support other engineering teams in product and tools usage.
  • Build and maintain CI/CD tools and services.
  • Maintain highly available and reliable Kubernetes clusters.
  • Contribute to product documentation.
  • Participate in defining standards, guidelines and best practices.
Apply