Apply

Data Infrastructure Engineer

Posted 2024-07-11

View full description

🔍 Industry: Data management software

🏢 Company: MongoDB👥 1001-5000💰 $ Post-IPO Equity on 2018-03-06DatabaseOpen SourceCloud ComputingSaaSSoftware

🗣️ Languages: English

🪄 Skills: SQLHadoopMongoDBGoGolangSpark

Requirements:
  • Advanced understanding of SQL.
  • Experience with distributed computing frameworks like Hadoop or Spark.
  • Experience with data pipelines and workflow management tools.
  • Ability to write maintainable code in at least one programming language.
  • Knowledge of CI/CD systems and infrastructure-as-code tools.
  • Understanding of cloud-native infrastructure for data services.
Responsibilities:
  • Empower MongoDB developers to build a high-quality data platform.
  • Stabilize and secure the fledgling data platform.
  • Respond to user needs and maintain the infrastructure.
Apply

Related Jobs

Apply

📍 UK

🔍 Collaborative Work Management

🏢 Company: Smartsheet

  • Good experience with one or more programming languages, e.g., Golang, Python, NodeJS.
  • Experience with one or more cloud platforms.
  • Experience with Snowflake and AirByte.
  • Experience with terraform or other infrastructure as code tools.
  • Experience working with Big Data.

  • Participate in system architecture discussions.
  • Contribute to the full software development life cycle (SDLC) including defining, prioritizing, designing, implementing, testing, and deploying new platform features.
  • Develop automated solutions to handle Big Data infrastructure problems.
  • Work on existing and new CI/CD tooling and pipelines.
  • Participate in all phases of quality assurance and defect resolution.
  • Design and implement data warehouses and data lakes, managing data volumes with required security measures.
  • Develop and maintain data pipelines and ETL processes to ingest, transform, and load data.
  • Implement and monitor data quality checks to ensure data accuracy and reliability.
  • Perform other duties as assigned.

PythonETLSnowflakeGolangCI/CDTerraform

Posted 2024-10-18
Apply
Apply

📍 United States, BC, ON, Canada

🧭 Full-Time

💸 $139,000 - $248,000 per year

🔍 Web development

  • 5+ years of experience as a Data Infrastructure Engineer or in related roles like Platform Engineer, SRE, DevOps or Backend Engineer.
  • Strong experience with provisioning and managing data infrastructure components like Kafka, Spark, and Airflow.
  • Proficiency with cloud services and environments (compute, storage, networking, identity management, infrastructure as code, etc.).
  • Experience with containerization technologies like Docker and Kubernetes.
  • Expertise in infrastructure as code tools like Terraform and Pulumi.
  • Solid understanding of networking concepts and configurations, including VPCs, load balancers, and endpoints.
  • Experience with monitoring and logging tools.
  • Strong problem-solving skills and attention to detail.
  • Excellent communication and collaboration skills.

  • Provision and deploy infrastructure using Pulumi for Kafka, Spark, Airflow, Athena, and other critical systems on AWS.
  • Manage and maintain clusters, ensuring optimal performance and reliability, including implementing auto-scaling and right-sizing instances.
  • Configure and manage VPCs, load balancers, and VPC endpoints for secure communication between internal and external services.
  • Manage IAM roles, apply security patches, plan and execute version upgrades, and ensure compliance with regulations such as GDPR.
  • Design and implement high-availability solutions across multiple zones and regions, including backups, multi-region replication, and disaster recovery plans.
  • Oversee S3 data lake management, including file size management, compaction, encryption, and compression to maximize storage efficiency.
  • Implement caching strategies, indexing, and query optimization to ensure efficient data retrieval and processing.
  • Spearhead initiatives for optimizing performance, capacity planning, ensuring fault tolerance, and implementing failure recovery across all infrastructure components.
  • Implement monitoring and logging using tools like Datadog, CloudWatch and OpenSearch.
  • Develop services, tools and automation to simplify infrastructure complexity for other engineering teams, enabling them to focus on building great products.
  • Participate in all engineering activities including incident response, interviewing, designing and reviewing technical specifications, code review, and releasing new functionality.
  • Mentor, coach, and inspire a team of engineers of various levels.

AWSDockerKafkaKubernetesAirflowSparkCollaborationProblem SolvingTerraform

Posted 2024-09-25
Apply
Apply

📍 United States, BC & ON, Canada

🧭 Full-Time

💸 164000 - 263500 USD per year

🔍 Web development

  • 8+ years of experience as a Data Infrastructure Engineer or related roles.
  • Deep expertise in provisioning and managing data infrastructure components like Kafka, Spark, and Airflow.
  • Extensive experience with cloud services (compute, storage, networking, identity management, etc.).
  • Strong experience with containerization technologies (Docker and Kubernetes).
  • Advanced knowledge of infrastructure as code tools (Terraform and Pulumi).
  • Strong understanding of networking concepts (VPCs, load balancers).
  • Extensive experience with monitoring and logging tools.
  • Strong problem-solving skills and attention to detail.
  • Excellent leadership, communication, and mentoring skills.

  • Oversee the provisioning and deployment of infrastructure using Pulumi for systems like Kafka, Spark, Airflow, and Athena on AWS.
  • Design and implement strategies for scaling Airflow, Kafka, and Spark clusters.
  • Lead efforts in optimizing performance and capacity planning with a focus on fault tolerance.
  • Manage secure communication between services through VPCs and load balancers.
  • Ensure compliance with regulations and manage IAM roles and security patches.
  • Architect high-availability solutions and oversee S3 data lake management.
  • Implement monitoring and logging tools and develop automation tools for engineers.
  • Participate in incident response, interviews, and code reviews while mentoring junior engineers.

AWSDockerLeadershipKafkaKubernetesC (Programming language)AirflowSparkMentoringTerraform

Posted 2024-09-06
Apply