ApplySenior DevOps Engineer (AWS) - Europe (Remote, f/m/d)
Posted 4 months agoViewed
View full description
Requirements:
- 5+ years experience working as Cloud Engineer, Site Reliability Engineer, DevOps Engineer or similar role
- Firm grasp on Kubernetes with Helm and container composition using Docker
- Excellent knowledge and working experience with AWS
- Ability to program (structured and OO) with multiple high level languages, such as TypeScript, Rust, C/C++, C#, Python, GoLang or Java
- Extensive experience with Application Performance Monitoring, Tracing and Logging software such as DataDog, Kibana, Graphana, HoneyComb, Harvest etc.
- Good experience with event driven architecture using distributed message brokers such as Apache Kafka, AWS SQS/SNS, RabbitMQ or Redis Streams
- Good knowledge of dynamic resource management frameworks (Kubernetes, Yarn, Helm, cdk8s, AWS CDK, Terraform)
- A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
- Basic knowledge of common tools and pipelines in the NodeJS/NPM ecosystem
- Problem-solving attitude and team spirit
- Preferred: AWS and/or Kubernetes certifications
Responsibilities:
- Run the production environment by monitoring availability and taking a holistic view of system health
- Plan, guide and review the contributions of your team members
- Build software and systems to manage platform infrastructure and applications
- Improve reliability, quality, and time-to-market of our suite of software solutions
- Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
- Provide primary operational support and engineering for our digital advertisement SaaS platform
- Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
- Partner with development teams to improve services through rigorous testing and release procedures
- Participate in system design consulting, platform management, and capacity planning
- Create sustainable systems and services through automation and uplifts
- Balance feature development speed and reliability with well-defined service level objectives
Apply