Data Infrastructure Engineer

Posted 15 days agoViewed

💎 Seniority level: Senior, 5+ years

📍 Location: United States

💸 Salary: 130000.0 - 185000.0 USD per year

🔍 Industry: Software Development

🏢 Company: Datavant👥 1001-5000💰 $40,000,000 Series B over 4 years agoBiopharma Clinical Trials Data Integration Health Care Software

🗣️ Languages: English

⏳ Experience: 5+ years

🪄 Skills: AWSLeadershipPythonApache AirflowSnowflake

Requirements:

5+ years experience in software development
2+ years experience building and maintaining a data lake and/or data warehouse
Strong understanding of cloud architecture
2+ years experience using a cloud provider
Experience writing Infrastructure-as-Code
Deep knowledge of operational database products

Responsibilities:

Collaborate with department and software engineers
Plan and delegate complex projects
Mentor early career developers
Facilitate technical discussions
Engage with stakeholders
Build and maintain data-related infrastructure
Write performant, reusable code and Infrastructure-as-Code
Review code to ensure technical quality

Apply

Related Jobs

Apply

🔥 Staff Observability Data Infrastructure Engineer

Posted about 1 month ago

📍 United States of America

🧭 Full-Time

💸 130295.0 - 260590.0 USD per year

🔍 Healthcare

🔧 Requirements

7+ years experience managing expansive data platforms like Splunk and Clickhouse.
6+ years mastering high-volume data pipelines with tools such as Vector, Cribl, and Confluent.
Strong understanding of contemporary data modeling and architecture.
Proven collaboration skills across different teams.
Exceptional problem-solving abilities in a healthcare IT environment.
Excellent communication skills to convey technical data solutions to diverse audiences.
Experience with project management, CI/CD pipelines, and GitHub.
Proficiency in query languages like SPL2 and programming with Python or Java.

💡 Responsibilities

Architect and cultivate a scalable observability data platform using tools like Splunk and Clickhouse.
Innovate and refine enterprise data models to boost performance and reliability.
Support data management policies and the data lifecycle.
Enhance data integrity through robust governance processes.
Ensure compliance with regulations regarding data security.
Develop sophisticated data pipelines for data collection and processing.
Optimize data flows and long-term storage strategies.
Collaborate with various IT teams for a unified operational data view.
Drive enhancements in data platform architecture and security measures.

AWSPythonETLJavaKafkaClickhouseData engineeringCI/CDData modelingData management

Posted about 1 month ago

Apply

🔥 Senior Data Infrastructure Engineer

Posted 5 months ago

📍 United States, BC, ON, Canada

🧭 Full-Time

💸 $139,000 - $248,000 per year

🔍 Web development

🔧 Requirements

5+ years of experience as a Data Infrastructure Engineer or in related roles like Platform Engineer, SRE, DevOps or Backend Engineer.
Strong experience with provisioning and managing data infrastructure components like Kafka, Spark, and Airflow.
Proficiency with cloud services and environments (compute, storage, networking, identity management, infrastructure as code, etc.).
Experience with containerization technologies like Docker and Kubernetes.
Expertise in infrastructure as code tools like Terraform and Pulumi.
Solid understanding of networking concepts and configurations, including VPCs, load balancers, and endpoints.
Experience with monitoring and logging tools.
Strong problem-solving skills and attention to detail.
Excellent communication and collaboration skills.

💡 Responsibilities

Provision and deploy infrastructure using Pulumi for Kafka, Spark, Airflow, Athena, and other critical systems on AWS.
Manage and maintain clusters, ensuring optimal performance and reliability, including implementing auto-scaling and right-sizing instances.
Configure and manage VPCs, load balancers, and VPC endpoints for secure communication between internal and external services.
Manage IAM roles, apply security patches, plan and execute version upgrades, and ensure compliance with regulations such as GDPR.
Design and implement high-availability solutions across multiple zones and regions, including backups, multi-region replication, and disaster recovery plans.
Oversee S3 data lake management, including file size management, compaction, encryption, and compression to maximize storage efficiency.
Implement caching strategies, indexing, and query optimization to ensure efficient data retrieval and processing.
Spearhead initiatives for optimizing performance, capacity planning, ensuring fault tolerance, and implementing failure recovery across all infrastructure components.
Implement monitoring and logging using tools like Datadog, CloudWatch and OpenSearch.
Develop services, tools and automation to simplify infrastructure complexity for other engineering teams, enabling them to focus on building great products.
Participate in all engineering activities including incident response, interviewing, designing and reviewing technical specifications, code review, and releasing new functionality.
Mentor, coach, and inspire a team of engineers of various levels.

AWSDockerKafkaKubernetesAirflowSparkCollaborationProblem SolvingTerraformNetworking

Posted 5 months ago

Apply