Apply

Senior Site Reliability Engineer - SRE - 12 months rolling contract

Posted 2024-11-07

View full description

💎 Seniority level: Senior

📍 Location: America, UTC-8, UTC-5

🔍 Industry: Digital paper solutions and learning ecosystem

🏢 Company: Goodnotes

🗣️ Languages: English

🪄 Skills: Linux

Requirements:
  • Strong experience working in AWS-hosted environments.
  • Experience supporting production workloads and firefighting.
  • Knowledge of SRE best practices and common issues.
  • Proficient with system monitoring tools.
  • Understanding and experience with distributed databases.
  • Background in Linux and Networking fundamentals.
  • Experience in back-end development, including API usage and creation.
  • Knowledge of Security for networks and containers.
  • Understanding of container orchestration, especially Kubernetes.
  • Experience managing relational and non-relational databases, including backup and restore operations.
  • Familiarity with automation/configuration management tools, preferably CDK and/or Terraform.
Responsibilities:
  • Design, build, and maintain the Goodnotes infrastructure according to Dickerson’s Hierarchy of Reliability.
  • Refine and execute new and existing playbooks.
  • Educate teams on SRE best practices including design and capacity planning.
  • Act as a higher-level escalation point for applications.
  • Optimize latency and error rates and improve SLAs.
  • Enhance system monitoring, health reporting, and logging.
  • Implement security practices and maintain information security.
  • Participate in on-call rotation during the Americas Timezone.
Apply