Continuously monitor alerting channels (PagerDuty, DataDog, CloudWatch, Prometheus/Grafana), validate alerts, filter false positives, and provide first-line support for site operations and infrastructure issues. Serve as the communication hub during incidents, providing regular status updates to stakeholders, escalating verified incidents to appropriate on-call teams, and maintaining incident bridges with proper handoffs. Execute documented runbooks and standard operating procedures for common issues, handling infrastructure access requests, basic troubleshooting, and deployment support activities. Investigate initial security alerts, monitor application performance, and process routine change requests, configuration updates, and maintenance tasks across operational teams. Create and maintain operational runbooks, update documentation based on incident learnings, and contribute to post-incident reviews to drive continuous improvement. Assist with monitoring configuration including adding new monitors, adjusting alert thresholds, and optimizing alerting systems to reduce noise and improve signal quality. Work independently during off-hours shifts in a remote, global team environment while maintaining strong collaboration and knowing when to escalate complex issues.