Senior Site Reliability Engineer

Posted about 1 month agoViewed

View full description

💎 Seniority level: Senior, Significant and demonstrated experience as a Senior Site Reliability Engineer

📍 Location: LATAM

💸 Salary: 51850.0 - 116650.0 USD per year

🔍 Industry: Remote employment solutions

🏢 Company: Remote - Referral Board

🗣️ Languages: English

⏳ Experience: Significant and demonstrated experience as a Senior Site Reliability Engineer

🪄 Skills: AWSPythonKubernetesGoLinuxTerraform

Requirements:

Significant and demonstrated experience as a Senior Site Reliability Engineer.
Solid knowledge and experience in Kubernetes, AWS (or similar Cloud Provider), and Terraform.
Knowledge of CI/CD tools, with a preference for GitLab CI.
Experience with a back-end programming language such as Elixir, Clojure, Java, Node.js, or Python.
Experience in a programming language used for developing SRE tooling, like Go or Python.
Experience running and configuring Linux systems in non-cloud environments.
Security knowledge from both defensive and offensive perspectives.
Excellent communication and interpersonal skills.

Responsibilities:

Managing and improving existing infrastructure.
Helping build the next generation of the platform using tools like Kubernetes, Terraform, and Docker.
Streamlining and automating deployment processes.
Working closely with the Security team to address potential threats and patches.
Supporting engineers and product teams to enhance scalability, stability, and reliability.

Apply

Related Jobs

Apply

🔥 Senior Site Reliability Engineer - Americas

Posted 29 days ago

📍 Americas

🧭 Full-Time

💸 160000.0 - 180000.0 USD per year

🔍 Software Development

🏢 Company: Customer.io👥 251-500💰 Series A about 3 years agoDigital Media SaaS Product Search Software

🔧 Requirements

7+ years of professional experience as a Site Reliability Engineer, with proven experience leading large complex projects affecting production SaaS environments.
Professional experience with relational database systems, managing the servers and tuning performance, particularly MySQL.
Proven experience managing scale, reliability and performance challenges managing distributed applications on cloud infrastructure (Google Cloud Platform is advantageous), both managed and self-hosted solutions.
Proven ability to build cloud infrastructure using Terraform and develop operational tooling in various languages including Golang and Bash.
Deep knowledge of UNIX environments and modern collaborative development practices.
Excellent communication skills, both verbal and written, with a collaborative mindset to make informed, empathetic decisions.
Ability to work autonomously in your timezone, advancing tasks and projects with minimal guidance.
Demonstrated ability to influence product direction and contribute technical insights that help drive business value.
A strong focus on proactive identification and resolving issues in production environments.
A self-starter who thrives in both synchronous and asynchronous work environments.

💡 Responsibilities

Architect and maintain critical infrastructure to enable Customer.io to scale and handle real-time processing of billions of messages.
Strategically plan and implement infrastructure growth to meet evolving demands and repeatability.
Streamline and automate processes for efficiency and reliability, removing manual toil.
Participate in on-call rotations to swiftly address availability incidents and support technical engineers with customer-related issues.
Develop observability to ensure comprehensive monitoring and effective alerting of infrastructure and applications.
Troubleshoot and resolve production issues across various services and stack levels.
Contribute to a collaborative and supportive team environment, fostering individual, professional, and team growth.
Engage in continuous learning and knowledge sharing through code reviews, pair programming, and team collaborations to refine best practices.

Backend DevelopmentSQLBashCloud ComputingGCPKubernetesMySQLREST APICI/CDLinuxDevOpsTerraformMicroservicesTroubleshootingSaaS

Posted 29 days ago

Apply

🔥 Senior Site Reliability Engineer

Posted about 1 month ago

📍 LATAM

🧭 Full-Time

💸 51850.0 - 116650.0 USD per year

🔍 Remote Employment and Compliance Solutions

🏢 Company: Remote👥 1001-5000💰 $300,000,000 Series C almost 3 years ago🫂 Last layoff over 2 years agoHuman Resources Services

🔧 Requirements

Significant and demonstrated experience as a Senior Site Reliability Engineer, which includes architecting, implementing, and maintaining a Platform for other teams.
Solid knowledge and experience in Kubernetes, AWS (or similar Cloud Provider), and Terraform.
Knowledge of CI/CD tools (GitLab CI is preferred).
Experience with a back-end programming language (Elixir, Clojure, Java, Node.js, Python, etc.).
Experience with a programming language for SRE tooling (Go, Python).
Experience running and configuring Linux systems in a non-cloud environment.
Security knowledge from both defensive and offensive perspectives.
Excellent communication and interpersonal skills.

💡 Responsibilities

Managing and improving our existing infrastructure.
Helping build the next generation of our platform using tools like Kubernetes, Terraform, and Docker.
Streamlining and automating deployment processes.
Working closely with the Security team to address potential threats and patches.
Supporting engineers and product teams to improve overall scalability, stability, and reliability.

AWSPythonKubernetesGoCI/CDLinuxTerraform

Posted about 1 month ago

Apply

🔥 Senior Site Reliability Engineer

Posted about 1 month ago

📍 Argentina, Brazil

🧭 Full-Time

💸 65000.0 - 90000.0 USD per year

🔍 Cybersecurity

🏢 Company: SecurityScorecard👥 251-500💰 $180,000,000 Series E almost 4 years agoSecurity Risk Management Cyber Security Software

🔧 Requirements

Proven experience as an SRE, DevOps Engineer, or similar role
Strong background in CI/CD tools (Jenkins, GitHub Actions, etc.)
Experience with cloud platforms (AWS, GCP, Azure) and container orchestration (Docker, Kubernetes)
Proficiency with infrastructure as code tools (Terraform, Ansible)
Experience with automated testing frameworks (Selenium, JUnit)
Knowledge of scripting languages (Python, Bash)
Familiarity with monitoring and observability tools (Prometheus, Grafana)

💡 Responsibilities

Design, implement, and maintain CI/CD pipelines
Enhance infrastructure as code practices
Optimize deployment rollbacks and improve incident response
Develop automated testing strategies
Collaborate with developers for application reliability
Build monitoring and alerting solutions
Drive improvements in observability and metrics collection
Participate in on-call rotations

AWSDockerPythonBashJUNITKubernetesGrafanaPrometheusSeleniumCI/CDTerraform

Posted about 1 month ago

Apply

🔥 Senior Site Reliability Engineer

Posted about 2 months ago

📍 Worldwide

🧭 Contract

🔍 Software Development

🏢 Company: Teravision Technologies👥 251-500💰 over 13 years agoAndroid iOS Mobile Apps Information Technology Software

🔧 Requirements

Experience managing and maintaining Kubernetes (K8s) infrastructure, including updates, patching, and software configuration management.
Familiarity with CI/CD pipelines, particularly TeamCity, and integrating tools like SonarQube.
Hands-on experience with AWS services such as S3, Route 53, and others.
Strong understanding of backend systems and infrastructure management.
Proficiency in troubleshooting, debugging, and ensuring system reliability in production environments.
Prior experience in an on-call role.
Knowledge of monitoring and alerting tools to support on-call responsibilities.

💡 Responsibilities

NOT STATED

AWSKubernetesCI/CDTroubleshootingDebugging

Posted about 2 months ago

Apply