Platform Engineer (Site Reliability Engineering)
New
B
BitsoCryptocurrency Fintech
Latin AmericaFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- PythonArtificial IntelligenceJavaKubernetesCI/CDDevOpsLLM
Requirements
- Proven ability to operate confidently in high-pressure incident scenarios.
- Hands-on experience with Kubernetes, including deploying and debugging at the pod level.
- Solid understanding of CI/CD pipelines and modern DevOps practices.
- Software development background with the ability to read, write, and debug code.
- Strong automation mindset focused on identifying and eliminating repetitive toil.
- Experience building or working with AI agents or LLM-based workflows.
- Strong interpersonal and written communication skills.
- Self-directed learner capable of contributing without a fully defined path.
- Familiarity with fintech or crypto industry vocabulary.
Responsibilities
- Own and execute on-call shifts including incident acknowledgement, resolution, and communication.
- Lead incident postmortems for Sev1 and Sev2 events, facilitating sessions and tracking action items.
- Identify incident patterns and propose systemic fixes like runbook improvements and platform hardening.
- Build and extend internal automation and tooling, including AI-assisted workflows, to reduce manual toil.
- Contribute to the observability ecosystem by improving dashboards and alert configurations.
- Participate in change and maintenance management processes to reduce deployment risks.
- Collaborate with engineering squads to surface platform risks and drive preventive actions.
View Full Description & ApplyYou'll be redirected to the employer's site