SRE Engineer Contractor
New
I
IPSYE-commerce Beauty
Fully remote from México or Colombia, PST time zoneContractMiddle
SalaryCompetitive salary (USD)
Apply NowOpens the employer's application page
Job Details
- Languages
- English
- Required Skills
- AWSPythonBashCI/CDTerraformDatadog
Requirements
- Hands-on experience with observability and monitoring tools, ideally Datadog.
- Proven experience participating in on-call and incident response.
- Working understanding of SRE fundamentals: SLIs, SLOs, error budgets, and toil reduction.
- Working knowledge of cloud infrastructure (AWS preferred).
- Proficiency in scripting and automation (e.g., Python, Bash).
- Familiarity with CI/CD pipelines and infrastructure-as-code (e.g., Terraform).
- Comfort and curiosity with AI tools (e.g., Claude, Cursor) for debugging and documentation.
- Experience with modern distributed/microservice and API-gateway architectures.
- Strong, calm communication skills for incident channels and RCA documentation.
- Ability to work effectively across a distributed, multi-time-zone team.
- Submit a resume/CV in English.
Responsibilities
- Build and maintain observability across our platform in Datadog: dashboards, monitors, APM, log pipelines, and meaningful, low-noise alerting.
- Define and track SLIs, SLOs, and error budgets for specific services.
- Participate in the on-call rotation and serve as an SRE Partner during incidents.
- Drive incident response per our framework, keeping clear, real-time documentation of status, findings, and decisions.
- Contribute actively to blameless post-incident reviews (RCAs).
- Automate toil by building scripts, tooling, and self-healing mechanisms.
- Leverage AI tools (e.g., Claude, Cursor) to accelerate debugging, runbook maintenance, and RCA drafting.
- Maintain and improve SRE runbooks and triage workflows.
View Full Description & ApplyYou'll be redirected to the employer's site