Site Reliability Engineer

Posted over 1 year agoViewed

View full description

💸 Salary: $140,000 usd to $165,000 usd

🔍 Industry: Education technology

🗣️ Languages: English

🪄 Skills: AWS

Requirements:

Expertise in building and maintaining infrastructure on aws, understanding of event-driven architecture, experience with aws lambda and kinesis, ability to analyze and synthesize data, strong attention to detail, excellent communication skills, ability to work iteratively and multitask

Responsibilities:

Own the infrastructure for recommender systems and application data services, ensure slas are met, build systems for end-to-end delivery of data and functionality, propose and implement features to improve products, evangelize data and ml capabilitiesApply

Related Jobs

Apply

🔥 Senior Site Reliability Engineer I

Posted 3 days ago

📍 LATAM

🧭 Full-Time

💸 51850.0 - 116650.0 USD per year

🔍 Remote employment solutions

🏢 Company: Remote - Referral Board

🔧 Requirements

Significant and demonstrated experience as a Senior Site Reliability Engineer.
Solid knowledge and experience in Kubernetes, AWS (or similar Cloud Provider), and Terraform.
Knowledge of CI/CD tools, with a preference for GitLab CI.
Experience with a back-end programming language such as Elixir, Clojure, Java, Node.js, or Python.
Experience in a programming language used for developing SRE tooling, like Go or Python.
Experience running and configuring Linux systems in non-cloud environments.
Security knowledge from both defensive and offensive perspectives.
Excellent communication and interpersonal skills.

💡 Responsibilities

Managing and improving existing infrastructure.
Helping build the next generation of the platform using tools like Kubernetes, Terraform, and Docker.
Streamlining and automating deployment processes.
Working closely with the Security team to address potential threats and patches.
Supporting engineers and product teams to enhance scalability, stability, and reliability.

AWSPythonKubernetesGoLinuxTerraform

Posted 3 days ago

Apply

🔥 Senior Site Reliability Engineer I

Posted 4 days ago

📍 LATAM

🧭 Full-Time

💸 51850.0 - 116650.0 USD per year

🔍 Remote Employment and Compliance Solutions

🏢 Company: Remote👥 1001-5000💰 $300,000,000 Series C almost 3 years ago🫂 Last layoff over 2 years agoHuman Resources Services

🔧 Requirements

Significant and demonstrated experience as a Senior Site Reliability Engineer, which includes architecting, implementing, and maintaining a Platform for other teams.
Solid knowledge and experience in Kubernetes, AWS (or similar Cloud Provider), and Terraform.
Knowledge of CI/CD tools (GitLab CI is preferred).
Experience with a back-end programming language (Elixir, Clojure, Java, Node.js, Python, etc.).
Experience with a programming language for SRE tooling (Go, Python).
Experience running and configuring Linux systems in a non-cloud environment.
Security knowledge from both defensive and offensive perspectives.
Excellent communication and interpersonal skills.

💡 Responsibilities

Managing and improving our existing infrastructure.
Helping build the next generation of our platform using tools like Kubernetes, Terraform, and Docker.
Streamlining and automating deployment processes.
Working closely with the Security team to address potential threats and patches.
Supporting engineers and product teams to improve overall scalability, stability, and reliability.

AWSPythonKubernetesGoCI/CDLinuxTerraform

Posted 4 days ago

Apply

🔥 Senior Site Reliability Engineer [United Kingdom]

Posted 11 days ago

📍 United Kingdom

🧭 Contract

🔍 Restaurant industry

🔧 Requirements

NOT STATED

💡 Responsibilities

Partner with Engineering and Product Managers.
Learn and improve system availability.
Sharpen execution skills to provide an amazing experience for customers.

AWSDockerPythonSQLCI/CDDevOpsMicroservices

Posted 11 days ago

Apply

🔥 Senior DevOps/Site Reliability Engineer - Client Infrastructure (full time)

Posted 12 days ago

📍 Germany, Sweden

🧭 Full-Time

🔍 Cloud computing, Software development

🏢 Company: Divio👥 11-50💰 Series G over 21 years agoVideo Digital Media Video Streaming Semiconductor Optical Communication

🔧 Requirements

Strong proficiency with TypeScript and modern JavaScript concepts.
Hands-on experience with AWS services including S3, CloudFront, Lambdas, CodePipeline, CodeBuild, and DynamoDB.
Familiarity with AWS CDK using TypeScript.
Understanding of TCP/IP network stack and web related protocols.
Experience integrating APIs with Storyblok.
Familiarity with Git and collaborative development workflows.
Good Linux system administration skills.
Familiarity with other IaaS providers like Azure, and backing services such as Postgres, Redis, RabbitMQ, and Elasticsearch.
Familiarity with configuration management software like Ansible and Terraform.
Excellent communication skills and customer-focused approach.

💡 Responsibilities

Design, develop, and maintain client-specific infrastructure and solutions with an emphasis on security, stability, performance, and cost-effectiveness.
Collaborate with customers in agile sprints to understand their challenges and goals.
Deliver optimal AWS- and TypeScript-based solutions and functionalities.
Contribute to long-term infrastructure strategies and standards, championing best practices.
Provide on-call support through monitoring and support duty rotation, proactively addressing potential issues.
Collaborate with internal and client teams, sharing knowledge with proactive communication.

AWSDynamoDBElasticSearchGitRabbitmqTypeScriptRedisLinuxTerraformAnsible

Posted 12 days ago

Apply

🔥 Senior Site Reliability Engineer

Posted 13 days ago

📍 Colombia, USA

🧭 Contractor

🔍 Software outsourcing

🏢 Company: Teravision Technologies👥 251-500💰 about 13 years agoAndroid iOS Mobile Apps Information Technology Software

🔧 Requirements

Proven experience managing the Kubernetes infrastructure.
Familiarity with CI/CD pipelines, particularly TeamCity and tools like SonarQube.
Hands-on experience with AWS services such as S3, Route 53, etc.
Strong understanding of backend systems and infrastructure management.
Excellent English communication skills and a Bachelor’s Degree in Computer Science or equivalent work experience.

💡 Responsibilities

Proven experience managing and maintaining Kubernetes (K8s) infrastructure, including updates, patching, and software configuration management.
Proficiency in troubleshooting, debugging, and ensuring system reliability in production environments.
Prior experience in an on-call role and knowledge of monitoring and alerting tools to support on-call responsibilities.

AWSKubernetesCI/CDTroubleshootingDebugging

Posted 13 days ago

Apply

🔥 DevOps/Site Reliability Engineer (Nigeria-Remote)

Posted 14 days ago

📍 Nigeria

🔧 Requirements

Extensive experience in administration.
System administration for cloud infrastructure (AWS primarily).
Knowledge of multi-cloud infrastructure.
Experience in process automation.
Experience in site reliability.
Ability to optimize performance of IT infrastructure.

💡 Responsibilities

Manage company's IT infrastructure.
Upgrade and install hardware and software.
Troubleshoot to resolve IT issues.
Maintain networks and servers.

AWSDockerCloud ComputingCI/CDLinuxDevOpsTerraformNetworkingTroubleshooting

Posted 14 days ago

Apply

🔥 DevOps/Site Reliability Engineer (Lisbon-Remote)

Posted 14 days ago

📍 Portugal

🔍 IT infrastructure

🔧 Requirements

Extensive experience in administration.
System administration for cloud infrastructure, primarily AWS.
Knowledge of multi-cloud infrastructure.
Experience in process automation.
Site reliability experience.
Ability to optimize IT infrastructure performance.

💡 Responsibilities

Manage the company's IT infrastructure.
Upgrade and install hardware and software.
Troubleshoot to resolve IT issues.
Maintain networks and servers.

AWSCloud ComputingLinuxDevOpsTerraform

Posted 14 days ago

Apply

🔥 DevOps/Site Reliability Engineer (Islamabad-Remote)

Posted 14 days ago

📍 Pakistan

🔍 IT/Technology

🔧 Requirements

Extensive experience in IT administration.
Experience in system administration for cloud infrastructure, primarily AWS.
Knowledge of multi-cloud infrastructure.
Experience in process automation.
Knowledge of site reliability principles.
Ability to optimize the performance of IT infrastructure.

💡 Responsibilities

Manage the company's IT infrastructure.
Upgrade and install hardware and software.
Troubleshoot and resolve IT issues.
Maintain the company's networks and servers.

AWSDockerCI/CDLinuxTerraformMicroservicesNetworkingTroubleshooting

Posted 14 days ago

Apply

🔥 DevOps/Site Reliability Engineer (Hanoi-Remote)

Posted 14 days ago

📍 Vietnam

🔧 Requirements

Possess extensive experience in IT administration.
Have expertise in system administration for cloud infrastructure, primarily AWS.
Knowledge of multi-cloud infrastructure is a plus.
Demonstrated ability in process automation.
Understanding of site reliability principles.
Skills to optimize the performance of IT infrastructure.

💡 Responsibilities

Manage the company's IT infrastructure effectively.
Upgrade and install necessary hardware and software.
Troubleshoot IT issues to ensure smooth operations.
Maintain networks and servers to guarantee reliability.

AWSLinuxDevOpsTerraform

Posted 14 days ago

Apply

🔥 DevOps/Site Reliability Engineer (Delhi-Remote)

Posted 14 days ago

📍 India

🔧 Requirements

Extensive experience in IT administration.
Proficient in system administration for cloud infrastructure, primarily AWS.
Knowledge of multi-cloud infrastructure.
Skilled in process automation.
Experience in site reliability.
Ability to optimize performance of IT infrastructure.

💡 Responsibilities

Manage and oversee the company's IT infrastructure.
Upgrade and install necessary hardware and software.
Troubleshoot and resolve IT issues as they arise.
Maintain and monitor networks and servers.

AWSCloud ComputingCI/CDLinuxDevOpsTerraformNetworkingTroubleshooting

Posted 14 days ago

Apply