ApplySenior Site Reliability Engineer - SRE - 12 months rolling contract
Posted about 1 month agoViewed
View full description
💎 Seniority level: Senior
📍 Location: Americas, UTC-8, UTC-5
🔍 Industry: Digital paper and learning solutions
🗣️ Languages: English
🪄 Skills: AWSPostgreSQLKotlinKubernetesMongoDBTypeScriptGoCI/CDLinuxTerraformMicroservicesNetworking
Requirements:
- Strong experience working in an AWS-hosted environment.
- Experience in supporting production workloads and firefighting.
- Knowledge of SRE best practices and common issues.
- Experience with system monitoring tools.
- Understanding and experience with distributed databases.
- Solid understanding of Linux and Networking fundamentals.
- Background in back-end development, including API usage and creation.
- Knowledge of Security for network and containers.
- Understanding of container orchestration, especially Kubernetes.
- Experience managing Relational and Non-relational databases, including backup and restore operations.
- Familiarity with automation/configuration management tools, preferably CDK or Terraform.
Responsibilities:
- Design, build, and maintain the Goodnotes infrastructure, adhering to Dickerson's Hierarchy of Reliability.
- Design, refine, and execute new and existing playbooks.
- Educate various teams in SRE best practices, assisting in design and capacity planning.
- Serve as the go-to person for higher-level escalation for applications.
- Improve SLAs, optimize latency and error rates.
- Enhance system monitoring, health reporting, and logging.
- Implement and maintain security practices.
Apply