Apply

Senior Site Reliability Engineer - SRE - 12 months rolling contract

Posted about 1 month agoViewed

View full description

💎 Seniority level: Senior

📍 Location: Americas, UTC-8, UTC-5

🔍 Industry: Digital paper and learning solutions

🗣️ Languages: English

🪄 Skills: AWSPostgreSQLKotlinKubernetesMongoDBTypeScriptGoCI/CDLinuxTerraformMicroservicesNetworking

Requirements:
  • Strong experience working in an AWS-hosted environment.
  • Experience in supporting production workloads and firefighting.
  • Knowledge of SRE best practices and common issues.
  • Experience with system monitoring tools.
  • Understanding and experience with distributed databases.
  • Solid understanding of Linux and Networking fundamentals.
  • Background in back-end development, including API usage and creation.
  • Knowledge of Security for network and containers.
  • Understanding of container orchestration, especially Kubernetes.
  • Experience managing Relational and Non-relational databases, including backup and restore operations.
  • Familiarity with automation/configuration management tools, preferably CDK or Terraform.
Responsibilities:
  • Design, build, and maintain the Goodnotes infrastructure, adhering to Dickerson's Hierarchy of Reliability.
  • Design, refine, and execute new and existing playbooks.
  • Educate various teams in SRE best practices, assisting in design and capacity planning.
  • Serve as the go-to person for higher-level escalation for applications.
  • Improve SLAs, optimize latency and error rates.
  • Enhance system monitoring, health reporting, and logging.
  • Implement and maintain security practices.
Apply