Design, develop, and maintain robust monitoring and alerting tools for mission-critical systems Automate production testing to ensure reliability and scalability Proactively identify, troubleshoot, and resolve performance and software issues Track, update, and drive resolution of technical problems across teams Suggest and implement architectural enhancements and process improvements Research, evaluate, and recommend new technologies and vendor solutions Safeguard critical systems by applying best-in-class security practices and solutions