Apply

Site Reliability Engineer

Posted over 1 year agoViewed

View full description

💸 Salary: $140,000 usd to $165,000 usd

🔍 Industry: Education technology

🗣️ Languages: English

🪄 Skills: AWS

Requirements:
Expertise in building and maintaining infrastructure on aws, understanding of event-driven architecture, experience with aws lambda and kinesis, ability to analyze and synthesize data, strong attention to detail, excellent communication skills, ability to work iteratively and multitask
Responsibilities:
Own the infrastructure for recommender systems and application data services, ensure slas are met, build systems for end-to-end delivery of data and functionality, propose and implement features to improve products, evangelize data and ml capabilitiesApply

Related Jobs

Apply

📍 LATAM

🧭 Full-Time

💸 51850.0 - 116650.0 USD per year

🔍 Remote employment solutions

🏢 Company: Remote - Referral Board

  • Significant and demonstrated experience as a Senior Site Reliability Engineer.
  • Solid knowledge and experience in Kubernetes, AWS (or similar Cloud Provider), and Terraform.
  • Knowledge of CI/CD tools, with a preference for GitLab CI.
  • Experience with a back-end programming language such as Elixir, Clojure, Java, Node.js, or Python.
  • Experience in a programming language used for developing SRE tooling, like Go or Python.
  • Experience running and configuring Linux systems in non-cloud environments.
  • Security knowledge from both defensive and offensive perspectives.
  • Excellent communication and interpersonal skills.
  • Managing and improving existing infrastructure.
  • Helping build the next generation of the platform using tools like Kubernetes, Terraform, and Docker.
  • Streamlining and automating deployment processes.
  • Working closely with the Security team to address potential threats and patches.
  • Supporting engineers and product teams to enhance scalability, stability, and reliability.

AWSPythonKubernetesGoLinuxTerraform

Posted 3 days ago
Apply
Apply

📍 LATAM

🧭 Full-Time

💸 51850.0 - 116650.0 USD per year

🔍 Remote Employment and Compliance Solutions

🏢 Company: Remote👥 1001-5000💰 $300,000,000 Series C almost 3 years ago🫂 Last layoff over 2 years agoHuman Resources Services

  • Significant and demonstrated experience as a Senior Site Reliability Engineer, which includes architecting, implementing, and maintaining a Platform for other teams.
  • Solid knowledge and experience in Kubernetes, AWS (or similar Cloud Provider), and Terraform.
  • Knowledge of CI/CD tools (GitLab CI is preferred).
  • Experience with a back-end programming language (Elixir, Clojure, Java, Node.js, Python, etc.).
  • Experience with a programming language for SRE tooling (Go, Python).
  • Experience running and configuring Linux systems in a non-cloud environment.
  • Security knowledge from both defensive and offensive perspectives.
  • Excellent communication and interpersonal skills.
  • Managing and improving our existing infrastructure.
  • Helping build the next generation of our platform using tools like Kubernetes, Terraform, and Docker.
  • Streamlining and automating deployment processes.
  • Working closely with the Security team to address potential threats and patches.
  • Supporting engineers and product teams to improve overall scalability, stability, and reliability.

AWSPythonKubernetesGoCI/CDLinuxTerraform

Posted 4 days ago
Apply
Apply

📍 United Kingdom

🧭 Contract

🔍 Restaurant industry

NOT STATED
  • Partner with Engineering and Product Managers.
  • Learn and improve system availability.
  • Sharpen execution skills to provide an amazing experience for customers.

AWSDockerPythonSQLCI/CDDevOpsMicroservices

Posted 11 days ago
Apply
Apply

📍 Germany, Sweden

🧭 Full-Time

🔍 Cloud computing, Software development

🏢 Company: Divio👥 11-50💰 Series G over 21 years agoVideoDigital MediaVideo StreamingSemiconductorOptical Communication

  • Strong proficiency with TypeScript and modern JavaScript concepts.
  • Hands-on experience with AWS services including S3, CloudFront, Lambdas, CodePipeline, CodeBuild, and DynamoDB.
  • Familiarity with AWS CDK using TypeScript.
  • Understanding of TCP/IP network stack and web related protocols.
  • Experience integrating APIs with Storyblok.
  • Familiarity with Git and collaborative development workflows.
  • Good Linux system administration skills.
  • Familiarity with other IaaS providers like Azure, and backing services such as Postgres, Redis, RabbitMQ, and Elasticsearch.
  • Familiarity with configuration management software like Ansible and Terraform.
  • Excellent communication skills and customer-focused approach.
  • Design, develop, and maintain client-specific infrastructure and solutions with an emphasis on security, stability, performance, and cost-effectiveness.
  • Collaborate with customers in agile sprints to understand their challenges and goals.
  • Deliver optimal AWS- and TypeScript-based solutions and functionalities.
  • Contribute to long-term infrastructure strategies and standards, championing best practices.
  • Provide on-call support through monitoring and support duty rotation, proactively addressing potential issues.
  • Collaborate with internal and client teams, sharing knowledge with proactive communication.

AWSDynamoDBElasticSearchGitRabbitmqTypeScriptRedisLinuxTerraformAnsible

Posted 12 days ago
Apply
Apply

📍 Colombia, USA

🧭 Contractor

🔍 Software outsourcing

🏢 Company: Teravision Technologies👥 251-500💰 about 13 years agoAndroidiOSMobile AppsInformation TechnologySoftware

  • Proven experience managing the Kubernetes infrastructure.
  • Familiarity with CI/CD pipelines, particularly TeamCity and tools like SonarQube.
  • Hands-on experience with AWS services such as S3, Route 53, etc.
  • Strong understanding of backend systems and infrastructure management.
  • Excellent English communication skills and a Bachelor’s Degree in Computer Science or equivalent work experience.
  • Proven experience managing and maintaining Kubernetes (K8s) infrastructure, including updates, patching, and software configuration management.
  • Proficiency in troubleshooting, debugging, and ensuring system reliability in production environments.
  • Prior experience in an on-call role and knowledge of monitoring and alerting tools to support on-call responsibilities.

AWSKubernetesCI/CDTroubleshootingDebugging

Posted 13 days ago
Apply
Apply

📍 Nigeria

  • Extensive experience in administration.
  • System administration for cloud infrastructure (AWS primarily).
  • Knowledge of multi-cloud infrastructure.
  • Experience in process automation.
  • Experience in site reliability.
  • Ability to optimize performance of IT infrastructure.
  • Manage company's IT infrastructure.
  • Upgrade and install hardware and software.
  • Troubleshoot to resolve IT issues.
  • Maintain networks and servers.

AWSDockerCloud ComputingCI/CDLinuxDevOpsTerraformNetworkingTroubleshooting

Posted 14 days ago
Apply
Apply

📍 Portugal

🔍 IT infrastructure

  • Extensive experience in administration.
  • System administration for cloud infrastructure, primarily AWS.
  • Knowledge of multi-cloud infrastructure.
  • Experience in process automation.
  • Site reliability experience.
  • Ability to optimize IT infrastructure performance.
  • Manage the company's IT infrastructure.
  • Upgrade and install hardware and software.
  • Troubleshoot to resolve IT issues.
  • Maintain networks and servers.

AWSCloud ComputingLinuxDevOpsTerraform

Posted 14 days ago
Apply
Apply

📍 Pakistan

🔍 IT/Technology

  • Extensive experience in IT administration.
  • Experience in system administration for cloud infrastructure, primarily AWS.
  • Knowledge of multi-cloud infrastructure.
  • Experience in process automation.
  • Knowledge of site reliability principles.
  • Ability to optimize the performance of IT infrastructure.
  • Manage the company's IT infrastructure.
  • Upgrade and install hardware and software.
  • Troubleshoot and resolve IT issues.
  • Maintain the company's networks and servers.

AWSDockerCI/CDLinuxTerraformMicroservicesNetworkingTroubleshooting

Posted 14 days ago
Apply
Apply

📍 Vietnam

  • Possess extensive experience in IT administration.
  • Have expertise in system administration for cloud infrastructure, primarily AWS.
  • Knowledge of multi-cloud infrastructure is a plus.
  • Demonstrated ability in process automation.
  • Understanding of site reliability principles.
  • Skills to optimize the performance of IT infrastructure.
  • Manage the company's IT infrastructure effectively.
  • Upgrade and install necessary hardware and software.
  • Troubleshoot IT issues to ensure smooth operations.
  • Maintain networks and servers to guarantee reliability.

AWSLinuxDevOpsTerraform

Posted 14 days ago
Apply
Apply

📍 India

  • Extensive experience in IT administration.
  • Proficient in system administration for cloud infrastructure, primarily AWS.
  • Knowledge of multi-cloud infrastructure.
  • Skilled in process automation.
  • Experience in site reliability.
  • Ability to optimize performance of IT infrastructure.
  • Manage and oversee the company's IT infrastructure.
  • Upgrade and install necessary hardware and software.
  • Troubleshoot and resolve IT issues as they arise.
  • Maintain and monitor networks and servers.

AWSCloud ComputingCI/CDLinuxDevOpsTerraformNetworkingTroubleshooting

Posted 14 days ago
Apply