Identify and address networking issues and security vulnerabilities, implementing robust solutions as needed. Drive reliability improvements for our applications, ensuring scalability and stability as we grow. Build and configure tools to enhance the security and resilience of our underlying infrastructure. Implement best practices for scaling, load balancing, and disaster recovery. Collaborate with engineering teams in technical and product discussions, providing guidance on technical direction and facilitating decision-making. Conduct post-incident reviews to analyse system failures, design and implement engineering solutions to prevent recurrence. Leverage our observability tools (OpenTelemetry, Prometheus, Loki, Grafana) to monitor and improve system performance.