SRE Engineer Contractor

New
I
IPSYE-commerce Beauty
Fully remote from México or Colombia, PST time zoneContractMiddle
SalaryCompetitive salary (USD)
Apply NowOpens the employer's application page

Job Details

Languages
English
Required Skills
AWSPythonBashCI/CDTerraformDatadog

Requirements

  • Hands-on experience with observability and monitoring tools, ideally Datadog.
  • Proven experience participating in on-call and incident response.
  • Working understanding of SRE fundamentals: SLIs, SLOs, error budgets, and toil reduction.
  • Working knowledge of cloud infrastructure (AWS preferred).
  • Proficiency in scripting and automation (e.g., Python, Bash).
  • Familiarity with CI/CD pipelines and infrastructure-as-code (e.g., Terraform).
  • Comfort and curiosity with AI tools (e.g., Claude, Cursor) for debugging and documentation.
  • Experience with modern distributed/microservice and API-gateway architectures.
  • Strong, calm communication skills for incident channels and RCA documentation.
  • Ability to work effectively across a distributed, multi-time-zone team.
  • Submit a resume/CV in English.

Responsibilities

  • Build and maintain observability across our platform in Datadog: dashboards, monitors, APM, log pipelines, and meaningful, low-noise alerting.
  • Define and track SLIs, SLOs, and error budgets for specific services.
  • Participate in the on-call rotation and serve as an SRE Partner during incidents.
  • Drive incident response per our framework, keeping clear, real-time documentation of status, findings, and decisions.
  • Contribute actively to blameless post-incident reviews (RCAs).
  • Automate toil by building scripts, tooling, and self-healing mechanisms.
  • Leverage AI tools (e.g., Claude, Cursor) to accelerate debugging, runbook maintenance, and RCA drafting.
  • Maintain and improve SRE runbooks and triage workflows.
View Full Description & ApplyYou'll be redirected to the employer's site
Competitive salary (USD)
Apply Now