Apply📍 Ireland
🔍 Enterprise Technology Management
- Extensive experience with container orchestration and managing production clusters, focusing on deployment, scaling, and troubleshooting within Kubernetes environments.
- Proficiency in tools like Ansible, Helm, and Kustomize for automating infrastructure provisioning, configuration, and deployment.
- Experience with Prometheus, Grafana, or similar to proactively track system health, detect anomalies, and optimize performance across the platform.
- Deep knowledge of the AWS ecosystem, including EC2, S3, IAM, VPC, and other essential services for building and managing scalable infrastructure.
- Hands-on experience with Terraform to provision and manage cloud resources, ensuring version control, repeatability, and efficiency in infrastructure deployment.
- Familiarity with message queuing systems like RabbitMQ and Kafka, as well as managed queuing services such as AmazonMQ.
- Strong background in managing MySQL databases and leveraging Amazon RDS for high availability, performance tuning, and secure database management in cloud environments.
- Understanding of network design and security protocols to protect systems, enforce compliance, and meet industry-standard audit requirements.
- Experience ensuring high uptime agreements for critical systems, implementing strategies for fault tolerance, disaster recovery, and proactive monitoring to maintain service availability and minimize downtime.
- Proven ability to work effectively with cross-functional teams from multiple departments to achieve project goals and execute project plans in an orderly and efficient manner.
- Ability to develop and maintain code in one or more high-level programming languages such as Python, Go, or JavaScript.
- Gather and analyze metrics from our platform and applications to continually improve our performance tuning and fault finding.
- Partner with our world-class engineering teams to improve services through rigorous testing and release procedures.
- Create sustainable systems and services through automation and uplifts while working closely with engineering professionals within the company to enable projects to be completed efficiently.
- Develop, monitor, and manage the entire system landscape by balancing feature development speed and reliability with well-defined service level objectives, ensuring minimal downtime and maximum availability.
- Participate in the development and implementation of practices, procedures, and technology to ensure our system landscapes are operating within our Security, Compliance, and Availability commitments.
- Plan, prepare, and execute system upgrades.
- Mentor and train other engineers throughout the company and seek to continually improve processes company-wide.
AWSPythonAmazon RDSJavascriptKafkaKubernetesMySQLRabbitmqGoGrafanaPrometheusCollaborationCI/CDJavaScript
Posted 2024-10-05
Apply