Platform Engineer (Site Reliability Engineering)

New
B
BitsoCryptocurrency Fintech
Latin AmericaFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
PythonArtificial IntelligenceJavaKubernetesCI/CDDevOpsLLM

Requirements

  • Proven ability to operate confidently in high-pressure incident scenarios.
  • Hands-on experience with Kubernetes, including deploying and debugging at the pod level.
  • Solid understanding of CI/CD pipelines and modern DevOps practices.
  • Software development background with the ability to read, write, and debug code.
  • Strong automation mindset focused on identifying and eliminating repetitive toil.
  • Experience building or working with AI agents or LLM-based workflows.
  • Strong interpersonal and written communication skills.
  • Self-directed learner capable of contributing without a fully defined path.
  • Familiarity with fintech or crypto industry vocabulary.

Responsibilities

  • Own and execute on-call shifts including incident acknowledgement, resolution, and communication.
  • Lead incident postmortems for Sev1 and Sev2 events, facilitating sessions and tracking action items.
  • Identify incident patterns and propose systemic fixes like runbook improvements and platform hardening.
  • Build and extend internal automation and tooling, including AI-assisted workflows, to reduce manual toil.
  • Contribute to the observability ecosystem by improving dashboards and alert configurations.
  • Participate in change and maintenance management processes to reduce deployment risks.
  • Collaborate with engineering squads to surface platform risks and drive preventive actions.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now