Apply

Data Infrastructure Engineer

Posted 2024-07-11

View full description

📍 Location: U.S., EDT, PDT

💸 Salary: 101000 - 198000 USD per year

🔍 Industry: Data management software

🏢 Company: MongoDB👥 1001-5000💰 $ Post-IPO Equity on 2018-03-06DatabaseOpen SourceCloud ComputingSaaSSoftware

🗣️ Languages: English

🪄 Skills: SQLHadoopJavaMongoDBGoGolangSparkCI/CD

Requirements:
  • Understand advanced SQL and review SQL statements.
  • Experience with distributed computing frameworks like Hadoop or Spark.
  • Familiarity with data pipelines and workflow management tools.
  • Ability to write maintainable code in at least one language, preferably Go.
  • Proficient with CI/CD systems and infrastructure-as-code tools like Terraform.
  • Knowledge of cloud-native infrastructure for data services.
Responsibilities:
  • Empower MongoDB developers to efficiently build a high-quality data platform.
  • Maintain a stable, secure, scalable, and durable data platform infrastructure.
  • Respond promptly to user needs and resolve issues proactively.
Apply

Related Jobs

Apply

📍 United States, BC, ON, Canada

🧭 Full-Time

💸 $139,000 - $248,000 per year

🔍 Web development

  • 5+ years of experience as a Data Infrastructure Engineer or in related roles like Platform Engineer, SRE, DevOps or Backend Engineer.
  • Strong experience with provisioning and managing data infrastructure components like Kafka, Spark, and Airflow.
  • Proficiency with cloud services and environments (compute, storage, networking, identity management, infrastructure as code, etc.).
  • Experience with containerization technologies like Docker and Kubernetes.
  • Expertise in infrastructure as code tools like Terraform and Pulumi.
  • Solid understanding of networking concepts and configurations, including VPCs, load balancers, and endpoints.
  • Experience with monitoring and logging tools.
  • Strong problem-solving skills and attention to detail.
  • Excellent communication and collaboration skills.

  • Provision and deploy infrastructure using Pulumi for Kafka, Spark, Airflow, Athena, and other critical systems on AWS.
  • Manage and maintain clusters, ensuring optimal performance and reliability, including implementing auto-scaling and right-sizing instances.
  • Configure and manage VPCs, load balancers, and VPC endpoints for secure communication between internal and external services.
  • Manage IAM roles, apply security patches, plan and execute version upgrades, and ensure compliance with regulations such as GDPR.
  • Design and implement high-availability solutions across multiple zones and regions, including backups, multi-region replication, and disaster recovery plans.
  • Oversee S3 data lake management, including file size management, compaction, encryption, and compression to maximize storage efficiency.
  • Implement caching strategies, indexing, and query optimization to ensure efficient data retrieval and processing.
  • Spearhead initiatives for optimizing performance, capacity planning, ensuring fault tolerance, and implementing failure recovery across all infrastructure components.
  • Implement monitoring and logging using tools like Datadog, CloudWatch and OpenSearch.
  • Develop services, tools and automation to simplify infrastructure complexity for other engineering teams, enabling them to focus on building great products.
  • Participate in all engineering activities including incident response, interviewing, designing and reviewing technical specifications, code review, and releasing new functionality.
  • Mentor, coach, and inspire a team of engineers of various levels.

AWSDockerKafkaKubernetesAirflowSparkCollaborationProblem SolvingTerraform

Posted 2024-09-25
Apply
Apply

📍 United States, BC & ON, Canada

🧭 Full-Time

💸 164000 - 263500 USD per year

🔍 Web development

  • 8+ years of experience as a Data Infrastructure Engineer or related roles.
  • Deep expertise in provisioning and managing data infrastructure components like Kafka, Spark, and Airflow.
  • Extensive experience with cloud services (compute, storage, networking, identity management, etc.).
  • Strong experience with containerization technologies (Docker and Kubernetes).
  • Advanced knowledge of infrastructure as code tools (Terraform and Pulumi).
  • Strong understanding of networking concepts (VPCs, load balancers).
  • Extensive experience with monitoring and logging tools.
  • Strong problem-solving skills and attention to detail.
  • Excellent leadership, communication, and mentoring skills.

  • Oversee the provisioning and deployment of infrastructure using Pulumi for systems like Kafka, Spark, Airflow, and Athena on AWS.
  • Design and implement strategies for scaling Airflow, Kafka, and Spark clusters.
  • Lead efforts in optimizing performance and capacity planning with a focus on fault tolerance.
  • Manage secure communication between services through VPCs and load balancers.
  • Ensure compliance with regulations and manage IAM roles and security patches.
  • Architect high-availability solutions and oversee S3 data lake management.
  • Implement monitoring and logging tools and develop automation tools for engineers.
  • Participate in incident response, interviews, and code reviews while mentoring junior engineers.

AWSDockerLeadershipKafkaKubernetesC (Programming language)AirflowSparkMentoringTerraform

Posted 2024-09-06
Apply