Apply

Senior Site Reliability Engineer

Posted about 1 month agoViewed

View full description

πŸ’Ž Seniority level: Senior, Significant and demonstrated experience as a Senior Site Reliability Engineer

πŸ“ Location: LATAM

πŸ’Έ Salary: 51850.0 - 116650.0 USD per year

πŸ” Industry: Remote employment solutions

🏒 Company: Remote - Referral Board

πŸ—£οΈ Languages: English

⏳ Experience: Significant and demonstrated experience as a Senior Site Reliability Engineer

πŸͺ„ Skills: AWSPythonKubernetesGoLinuxTerraform

Requirements:
  • Significant and demonstrated experience as a Senior Site Reliability Engineer.
  • Solid knowledge and experience in Kubernetes, AWS (or similar Cloud Provider), and Terraform.
  • Knowledge of CI/CD tools, with a preference for GitLab CI.
  • Experience with a back-end programming language such as Elixir, Clojure, Java, Node.js, or Python.
  • Experience in a programming language used for developing SRE tooling, like Go or Python.
  • Experience running and configuring Linux systems in non-cloud environments.
  • Security knowledge from both defensive and offensive perspectives.
  • Excellent communication and interpersonal skills.
Responsibilities:
  • Managing and improving existing infrastructure.
  • Helping build the next generation of the platform using tools like Kubernetes, Terraform, and Docker.
  • Streamlining and automating deployment processes.
  • Working closely with the Security team to address potential threats and patches.
  • Supporting engineers and product teams to enhance scalability, stability, and reliability.
Apply

Related Jobs

Apply

πŸ“ Americas

🧭 Full-Time

πŸ’Έ 160000.0 - 180000.0 USD per year

πŸ” Software Development

🏒 Company: Customer.ioπŸ‘₯ 251-500πŸ’° Series A about 3 years agoDigital MediaSaaSProduct SearchSoftware

  • 7+ years of professional experience as a Site Reliability Engineer, with proven experience leading large complex projects affecting production SaaS environments.
  • Professional experience with relational database systems, managing the servers and tuning performance, particularly MySQL.
  • Proven experience managing scale, reliability and performance challenges managing distributed applications on cloud infrastructure (Google Cloud Platform is advantageous), both managed and self-hosted solutions.
  • Proven ability to build cloud infrastructure using Terraform and develop operational tooling in various languages including Golang and Bash.
  • Deep knowledge of UNIX environments and modern collaborative development practices.
  • Excellent communication skills, both verbal and written, with a collaborative mindset to make informed, empathetic decisions.
  • Ability to work autonomously in your timezone, advancing tasks and projects with minimal guidance.
  • Demonstrated ability to influence product direction and contribute technical insights that help drive business value.
  • A strong focus on proactive identification and resolving issues in production environments.
  • A self-starter who thrives in both synchronous and asynchronous work environments.
  • Architect and maintain critical infrastructure to enable Customer.io to scale and handle real-time processing of billions of messages.
  • Strategically plan and implement infrastructure growth to meet evolving demands and repeatability.
  • Streamline and automate processes for efficiency and reliability, removing manual toil.
  • Participate in on-call rotations to swiftly address availability incidents and support technical engineers with customer-related issues.
  • Develop observability to ensure comprehensive monitoring and effective alerting of infrastructure and applications.
  • Troubleshoot and resolve production issues across various services and stack levels.
  • Contribute to a collaborative and supportive team environment, fostering individual, professional, and team growth.
  • Engage in continuous learning and knowledge sharing through code reviews, pair programming, and team collaborations to refine best practices.

Backend DevelopmentSQLBashCloud ComputingGCPKubernetesMySQLREST APICI/CDLinuxDevOpsTerraformMicroservicesTroubleshootingSaaS

Posted 29 days ago
Apply
Apply

πŸ“ LATAM

🧭 Full-Time

πŸ’Έ 51850.0 - 116650.0 USD per year

πŸ” Remote Employment and Compliance Solutions

🏒 Company: RemoteπŸ‘₯ 1001-5000πŸ’° $300,000,000 Series C almost 3 years agoπŸ«‚ Last layoff over 2 years agoHuman Resources Services

  • Significant and demonstrated experience as a Senior Site Reliability Engineer, which includes architecting, implementing, and maintaining a Platform for other teams.
  • Solid knowledge and experience in Kubernetes, AWS (or similar Cloud Provider), and Terraform.
  • Knowledge of CI/CD tools (GitLab CI is preferred).
  • Experience with a back-end programming language (Elixir, Clojure, Java, Node.js, Python, etc.).
  • Experience with a programming language for SRE tooling (Go, Python).
  • Experience running and configuring Linux systems in a non-cloud environment.
  • Security knowledge from both defensive and offensive perspectives.
  • Excellent communication and interpersonal skills.
  • Managing and improving our existing infrastructure.
  • Helping build the next generation of our platform using tools like Kubernetes, Terraform, and Docker.
  • Streamlining and automating deployment processes.
  • Working closely with the Security team to address potential threats and patches.
  • Supporting engineers and product teams to improve overall scalability, stability, and reliability.

AWSPythonKubernetesGoCI/CDLinuxTerraform

Posted about 1 month ago
Apply
Apply

πŸ“ Argentina, Brazil

🧭 Full-Time

πŸ’Έ 65000.0 - 90000.0 USD per year

πŸ” Cybersecurity

🏒 Company: SecurityScorecardπŸ‘₯ 251-500πŸ’° $180,000,000 Series E almost 4 years agoSecurityRisk ManagementCyber SecuritySoftware

  • Proven experience as an SRE, DevOps Engineer, or similar role
  • Strong background in CI/CD tools (Jenkins, GitHub Actions, etc.)
  • Experience with cloud platforms (AWS, GCP, Azure) and container orchestration (Docker, Kubernetes)
  • Proficiency with infrastructure as code tools (Terraform, Ansible)
  • Experience with automated testing frameworks (Selenium, JUnit)
  • Knowledge of scripting languages (Python, Bash)
  • Familiarity with monitoring and observability tools (Prometheus, Grafana)
  • Design, implement, and maintain CI/CD pipelines
  • Enhance infrastructure as code practices
  • Optimize deployment rollbacks and improve incident response
  • Develop automated testing strategies
  • Collaborate with developers for application reliability
  • Build monitoring and alerting solutions
  • Drive improvements in observability and metrics collection
  • Participate in on-call rotations

AWSDockerPythonBashJUNITKubernetesGrafanaPrometheusSeleniumCI/CDTerraform

Posted about 1 month ago
Apply
Apply

πŸ“ Worldwide

🧭 Contract

πŸ” Software Development

🏒 Company: Teravision TechnologiesπŸ‘₯ 251-500πŸ’° over 13 years agoAndroidiOSMobile AppsInformation TechnologySoftware

  • Experience managing and maintaining Kubernetes (K8s) infrastructure, including updates, patching, and software configuration management.
  • Familiarity with CI/CD pipelines, particularly TeamCity, and integrating tools like SonarQube.
  • Hands-on experience with AWS services such as S3, Route 53, and others.
  • Strong understanding of backend systems and infrastructure management.
  • Proficiency in troubleshooting, debugging, and ensuring system reliability in production environments.
  • Prior experience in an on-call role.
  • Knowledge of monitoring and alerting tools to support on-call responsibilities.
NOT STATED

AWSKubernetesCI/CDTroubleshootingDebugging

Posted about 2 months ago
Apply