3+ years managing large-scale production environments. Comfortable with 24/7 on-call responsibilities and incident response. Strong Linux/Unix system administration skills. Understanding of TCP/IP, DNS, load balancing, and network security. Experience with SQL and NoSQL databases in production environments. Proficiency in at least two of: Python, Shell, PHP, Java, or similar languages. Experience with AWS, GCP, or Azure infrastructure and services. Hands-on experience with Docker, Kubernetes, and container orchestration. Experience with Prometheus, Grafana, ELK stack, or similar tools. Proficiency with Terraform, CloudFormation, or similar tools. Expert-level Git usage and collaborative development practices. Experience defining and maintaining service level objectives. Understanding of error budget concepts and implementation. Track record of identifying and eliminating repetitive manual work. Experience with performance testing and capacity management. Experience with microservices architecture and distributed systems. Knowledge of security best practices and compliance frameworks.