Manage, monitor, and maintain Ubuntu and Red Hat Linux servers Perform system upgrades, kernel updates, patch management, and performance tuning Implement and enforce security policies, user access controls, and backup/recovery strategies Troubleshoot hardware, OS, and network-related issues Maintain configuration management and deployment pipelines using Ansible, Puppet, or similar tools Monitor system health, resource utilization, and AI/ML workloads Collaborate with DevOps, Cloud, and Software teams for environment provisioning and infrastructure scaling Participate in capacity planning, disaster recovery, and incident response activities Maintain detailed documentation, SOPs, and audit reports