Apply

Data Infrastructure Engineer

Posted 15 days agoViewed

View full description

💎 Seniority level: Senior, 5+ years

📍 Location: United States

💸 Salary: 130000.0 - 185000.0 USD per year

🔍 Industry: Software Development

🏢 Company: Datavant👥 1001-5000💰 $40,000,000 Series B over 4 years agoBiopharmaClinical TrialsData IntegrationHealth CareSoftware

🗣️ Languages: English

⏳ Experience: 5+ years

🪄 Skills: AWSLeadershipPythonApache AirflowSnowflake

Requirements:
  • 5+ years experience in software development
  • 2+ years experience building and maintaining a data lake and/or data warehouse
  • Strong understanding of cloud architecture
  • 2+ years experience using a cloud provider
  • Experience writing Infrastructure-as-Code
  • Deep knowledge of operational database products
Responsibilities:
  • Collaborate with department and software engineers
  • Plan and delegate complex projects
  • Mentor early career developers
  • Facilitate technical discussions
  • Engage with stakeholders
  • Build and maintain data-related infrastructure
  • Write performant, reusable code and Infrastructure-as-Code
  • Review code to ensure technical quality
Apply

Related Jobs

Apply

📍 United States of America

🧭 Full-Time

💸 130295.0 - 260590.0 USD per year

🔍 Healthcare

  • 7+ years experience managing expansive data platforms like Splunk and Clickhouse.
  • 6+ years mastering high-volume data pipelines with tools such as Vector, Cribl, and Confluent.
  • Strong understanding of contemporary data modeling and architecture.
  • Proven collaboration skills across different teams.
  • Exceptional problem-solving abilities in a healthcare IT environment.
  • Excellent communication skills to convey technical data solutions to diverse audiences.
  • Experience with project management, CI/CD pipelines, and GitHub.
  • Proficiency in query languages like SPL2 and programming with Python or Java.
  • Architect and cultivate a scalable observability data platform using tools like Splunk and Clickhouse.
  • Innovate and refine enterprise data models to boost performance and reliability.
  • Support data management policies and the data lifecycle.
  • Enhance data integrity through robust governance processes.
  • Ensure compliance with regulations regarding data security.
  • Develop sophisticated data pipelines for data collection and processing.
  • Optimize data flows and long-term storage strategies.
  • Collaborate with various IT teams for a unified operational data view.
  • Drive enhancements in data platform architecture and security measures.

AWSPythonETLJavaKafkaClickhouseData engineeringCI/CDData modelingData management

Posted about 1 month ago
Apply
Apply

📍 United States, BC, ON, Canada

🧭 Full-Time

💸 $139,000 - $248,000 per year

🔍 Web development

  • 5+ years of experience as a Data Infrastructure Engineer or in related roles like Platform Engineer, SRE, DevOps or Backend Engineer.
  • Strong experience with provisioning and managing data infrastructure components like Kafka, Spark, and Airflow.
  • Proficiency with cloud services and environments (compute, storage, networking, identity management, infrastructure as code, etc.).
  • Experience with containerization technologies like Docker and Kubernetes.
  • Expertise in infrastructure as code tools like Terraform and Pulumi.
  • Solid understanding of networking concepts and configurations, including VPCs, load balancers, and endpoints.
  • Experience with monitoring and logging tools.
  • Strong problem-solving skills and attention to detail.
  • Excellent communication and collaboration skills.
  • Provision and deploy infrastructure using Pulumi for Kafka, Spark, Airflow, Athena, and other critical systems on AWS.
  • Manage and maintain clusters, ensuring optimal performance and reliability, including implementing auto-scaling and right-sizing instances.
  • Configure and manage VPCs, load balancers, and VPC endpoints for secure communication between internal and external services.
  • Manage IAM roles, apply security patches, plan and execute version upgrades, and ensure compliance with regulations such as GDPR.
  • Design and implement high-availability solutions across multiple zones and regions, including backups, multi-region replication, and disaster recovery plans.
  • Oversee S3 data lake management, including file size management, compaction, encryption, and compression to maximize storage efficiency.
  • Implement caching strategies, indexing, and query optimization to ensure efficient data retrieval and processing.
  • Spearhead initiatives for optimizing performance, capacity planning, ensuring fault tolerance, and implementing failure recovery across all infrastructure components.
  • Implement monitoring and logging using tools like Datadog, CloudWatch and OpenSearch.
  • Develop services, tools and automation to simplify infrastructure complexity for other engineering teams, enabling them to focus on building great products.
  • Participate in all engineering activities including incident response, interviewing, designing and reviewing technical specifications, code review, and releasing new functionality.
  • Mentor, coach, and inspire a team of engineers of various levels.

AWSDockerKafkaKubernetesAirflowSparkCollaborationProblem SolvingTerraformNetworking

Posted 5 months ago
Apply