- Manage monitoring, alerting, incident response, and performance tuning for production services.
- Maintain infrastructure as code and deployment workflows using Kubernetes and OpenTofu/Terraform on AWS and GCP.
- Provide senior-level code reviews for backend and infrastructure teams while supporting frontend initiatives.
- Collaborate with Customer Success to triage and resolve user-impacting technical issues.
- Contribute to high-level architecture and systems design discussions.
- Champion engineering quality through testing, observability, and operational standards.
- Own service lifecycle management, from development through production monitoring and incident response.
AWSNode.jsPostgreSQL+6 more