Apply

Senior Site Reliability Engineer

Posted 7 months agoViewed

View full description

πŸ’Ž Seniority level: Senior, At least 4 years experience in maintaining Cloud infrastructure with modern technologies, at least 1 year experience in maintaining Web3 related infrastructure.

πŸ“ Location: Switzerland

πŸ” Industry: Web3 and blockchain infrastructure

🏒 Company: GelatoπŸ‘₯ 101-250πŸ’° $240,025,345 over 3 years agoLogisticsE-CommercePrinting

πŸ—£οΈ Languages: English

⏳ Experience: At least 4 years experience in maintaining Cloud infrastructure with modern technologies, at least 1 year experience in maintaining Web3 related infrastructure.

πŸͺ„ Skills: AWSDockerPHPPythonSoftware DevelopmentEthereumGCPGitKubernetesTypeScriptAzureGoGrafanaPrometheusRustCI/CDMicroservicesNetworking

Requirements:
  • At least 4 years experience in maintaining Cloud infrastructure with modern technologies
  • At least 1 year experience in maintaining Web3 related infrastructure
  • GitOps principles at heart
  • Ability to lead and positively influence peers in decision-making process
  • Ability to maintain high performance and accuracy in rapidly changing and evolving work settings
  • Experience in operating infrastructure on at least one major Cloud provider (GCP, AWS, Azure)
  • Experience with Docker and containerized applications
  • Experience with Unix based systems
  • Experience in operating and optimizing Kubernetes clusters
  • Experience with Git, Helm, Terraform, Kubectl and similar
  • Experience in networking, CDN, Gateways and deployment strategies
  • Experience in operating highly available infrastructure
  • Understanding of microservice based architecture and operations
  • Experience in advanced debugging, logging, monitoring and alerting using tools such as Prometheus, Grafana, Splunk, Datadog
  • Experience in implementing and maintaining cost optimized solutions
  • Experience with at least one programming language (e.g. Go, Python, Rust, PHP, TypeScript) and demonstrate capabilities in software development
  • Understanding of the Web3 technologies and related challenges including Rollups-as-a-Service (RaaS)
  • Eager to learn and grow professionally
  • Fluent in English (spoken and written)
Responsibilities:
  • Maintain and operate Gelato infrastructure in a multi-cloud environment
  • Contribute to improve our incident management lifecycle for overall reliability
  • Contribute to improve our Postmortem philosophy
  • Contribute to improve our DevOps culture
  • Deploy and maintain Rollups-as-a-Service (RaaS) core components and related observability stacks
  • Evaluate and modernize our existing infrastructure and deployment strategies to align with the latest industry standards
  • Maintain and enhance our CI/CD pipeline and its governance
  • Be on-call rotation to provide operational support and service availability
  • Participate and conduct regular team meetings
  • Provide insights and recommendations on system design and scalability, focusing on reliability, security and efficiency in a Web3 context
  • Be an active team member by always looking out for cost effective innovative solutions and by facilitating the adoption of industry standards
Apply

Related Jobs

Apply

πŸ“ Europe

🧭 Full-Time

πŸ” Software Development

🏒 Company: SanityπŸ‘₯ 51-200πŸ’° Corporate almost 3 years agoSoftware Development

  • Proven experience with SRE/DevOps tools, processes, and culture.
  • Proficient in programming languages like Python, Go, and TypeScript.
  • 5+ years of experience participating in an SRE on-call rotation.
  • Hands-on experience with Kubernetes for orchestrating, scaling, and managing containerized applications in the cloud.
  • Strong database management skills, particularly with PostgreSQL.
  • Experience with infrastructure as code, using tools like Terraform.
  • Familiarity with observability tools like Prometheus and similar stacks.
  • Plan and implement a global platform for delivering our software as a service.
  • Diagnose and troubleshoot complex distributed systems.
  • Ensure observability and analyze the behavior of our stack.
  • Orchestration, deployment, monitoring, automation.
  • Participate in our on-call rotation.

PostgreSQLPythonCloud ComputingElasticSearchKubernetesTypeScriptGoPrometheusCI/CDLinuxDevOpsTerraformMicroservices

Posted about 1 month ago
Apply