Apply

Senior Infrastructure Engineer

Posted 2024-10-20

View full description

๐Ÿ’Ž Seniority level: Senior

๐Ÿ“ Location: EMEA, PST, EST, GMT

๐Ÿ” Industry: Database and Software Solutions

๐Ÿข Company: MongoDB

๐Ÿช„ Skills: AWSGitKubernetesMicrosoft AzureOAuthAzureGoGrafanaPrometheusCollaborationCI/CDLinux

Requirements:
  • Pragmatic, detail-oriented, self-motivated, and understands the benefits of collaboration.
  • Provides guidance and coaching to entry-mid level engineers.
  • Takes a software-driven approach to solving problems and routinely uses git to track progress.
  • Familiar with software engineering principles, dependency injection, composition, and test driven development.
  • Experience designing/implementing medium/large scale software projects (preferably with Go).
  • Familiar with standard authentication protocols (e.g OAuth).
  • Familiar with the development of web services and/or Kubernetes controllers.
  • Experienced performing deep technical analysis and fixing applications, systems, and networks.
  • Strong Linux and TCP/IP networking skills.
  • Solid knowledge of cloud infrastructure.
  • Experience with configuration management tools and managing infrastructure through code.
  • Familiar with how to use CI/CD workflows and tooling to deploy production services.
  • Experience running containers in a production environment, preferably Kubernetes based.
  • Experience with observability concepts and tooling, metrics, logging, traces, Prometheus, Grafana, OpenTelemetry.
  • Has practical knowledge of delivering production level services with SLI/SLOs and understands how to measure, track and adjust them.
Responsibilities:
  • Work with engineering teams across MongoDB to investigate gaps and limitations in existing development workflows and understand new infrastructure and platform requirements.
  • Design self-service platform services and developer tooling that focuses on reliability, usability, and provides the appropriate level of abstraction from cloud infrastructure.
  • Regularly write and review automation, configuration management, and application code.
  • Author and review functional specifications and scoping documents for large platform projects and services.
  • Own and operate much of the internal development platform that runs MongoDB.
  • Work on a distributed team that frequently interacts with remote engineers across multiple time zones.
Apply

Related Jobs

Apply

๐Ÿ“ Romania

๐Ÿงญ Full-Time

๐Ÿ” Analytics engineering

๐Ÿข Company: dbt Labs

  • Experience with AWS, Azure, or GCP, Terraform, Kubernetes, Python, and Bash.
  • Solid experience with declarative Infrastructure as Code, ideally with Terraform or a willingness to learn.
  • Experience working asynchronously in a fully-remote, distributed team.
  • Excellent communication and writing skills.

  • Design, operate, and support infrastructure systems with parity across tenancy models (single vs multi) and public clouds (AWS and Azure).
  • Work with engineering teams to consistently deploy their services to those environments.
  • Help create a great developer experience collaborating with Architecture, SRE, Release Engineering, and Security teams.
  • Participate in a balanced on-call rotation and help upgrade tooling to reduce toil.

AWSPythonBashKubernetesAzureTerraform

Posted 2024-11-14
Apply
Apply

๐Ÿ“ Georgia

๐Ÿงญ Full-Time

๐Ÿ” Integration and automation software

๐Ÿข Company: Workato

  • 7+ years of professional experience in hands-on engineering roles (DevOps/SRE), with a BS or MS in Computer Science (or equivalent).
  • 1+ year of experience with hosting AI models (ML flow, AWS Sagemaker, Azure AI, Kubernetes).
  • 1+ year of experience with ML Ops (ML flow, vector databases, dagster).
  • Strong experience managing Kubernetes clusters and workloads using EKS.
  • Proficiency in Python; knowledge of Go, Ruby, or JavaScript is a plus.
  • Experience with CI/CD tools like GitHub Actions or GitLab CI.
  • Expertise in deploying Kubernetes-based services using Kustomize, Helm, and GitOps tools.
  • Hands-on experience with AWS architectures, networking fundamentals, and web services.
  • Experience using Infrastructure as Code tools like Terraform.
  • Knowledge of container technologies and best practices.

  • As a Senior Infrastructure Engineer, you will be responsible for deploying, scaling, and maintaining services at the ML/AI team.
  • You will work closely with ML Engineers and Data Scientists as part of a flexible team.
  • Your role will have a direct impact on the modernization and maturation of the platform, including infrastructure architecture decisions.

DevOps

Posted 2024-11-07
Apply
Apply

๐Ÿ“ USA, UK, Germany, France, Canada, India, Chile

๐Ÿงญ Full-Time

๐Ÿ” Automation

๐Ÿข Company: Make

  • At least 5 years of experience in managing and operating Linux/Unix-based infrastructure.
  • Knowledge of at least one cloud provider, ideally AWS.
  • Day-to-day experience with a container orchestration platform, preferably Kubernetes.
  • Proficiency in Infrastructure as Code practices and tools such as Terraform.
  • Hands-on experience with CI/CD tools and various deployment strategies.
  • Understanding of Service Level Indicators, Objectives, and Agreements.
  • Effective communication skills in English.
  • Openness to knowledge sharing and mentoring.
  • Experience with troubleshooting and debugging issues.
  • Working knowledge of programming/scripting languages like Python or Go.

  • Design, build, and maintain a scalable & resilient infrastructure on AWS.
  • Follow the Infrastructure as Code principle to keep changes versioned.
  • Build and manage cloud infrastructure using Terraform.
  • Continuously evolve & maintain Kubernetes clusters.
  • Implement and consult on observability and monitoring framework.
  • Share knowledge in technologies like Kubernetes, Docker, and more.
  • Contribute to service blueprints.
  • Actively test and evolve system reliability.
  • Cooperate with Security on infrastructure compliance.
  • Design and support continuous deployment tooling.
  • Be on-call for incidents affecting availability.
  • Debug production issues across services.

AWSDockerNode.jsPostgreSQLPythonElasticSearchKubernetesRabbitmqElasticsearchGoPostgresRedisCommunication SkillsCI/CDTerraform

Posted 2024-11-07
Apply
Apply

๐Ÿ“ Dubai, London

๐Ÿ” Data Infrastructure

๐Ÿข Company: Eqvilent

  • 3+ years in a similar role.
  • Proven experience with AWS or other cloud providers.
  • Experience with distributed systems (e.g. Apache Kafka, Apache Airflow, Apache Hadoop).
  • Proficiency with Terraform.
  • Extensive experience with Docker and Kubernetes, including cluster setup, node pools, and Helm charts.
  • Experience with CI/CD tools (e.g. GitLab CI, Jenkins).
  • Familiarity with observability tools such as Prometheus, Grafana, ELK stack.
  • Solid understanding of networking, security, and system architecture.
  • Strong scripting skills (e.g., Python, Bash).
  • Excellent problem-solving skills, communication, and collaboration abilities.

  • Design, implement, and maintain both cloud and on-premise compute and storage infrastructure.
  • Set up and manage Kubernetes clusters, implement Helm charts, ensuring high availability and performance.
  • Set up, maintain, and scale distributed systems (e.g. Apache Kafka, Apache Airflow) ensuring data integrity and security.
  • Automate code delivery processes and implement CI/CD, monitoring, logging, and alerting solutions.
  • Collaborate with development and operations teams, provide production support, and participate in on-call rotations.

AWSDockerPythonApache AirflowApache HadoopBashHadoopJenkinsKafkaKubernetesAirflowApache KafkaGrafanaPrometheusCollaborationCI/CDTerraform

Posted 2024-10-21
Apply