Senior Data Platform Engineer II (Databricks)

Remote, United StatesFull-TimeSenior

Salary not disclosed

Apply NowOpens the employer's application page

Job Details

Experience: 6+ years experience as an engineer building and optimizing highly scalable distributed data systems (e.g., Databricks, Spark, or Snowflake).
Required Skills: AWSDockerSQLETLKubernetesSnowflakeSparkCI/CDTerraformData modelingDatabricksEHRHIPAADistributed Systems

Requirements

BS/BTech (or higher) in Computer Science, Engineering or a related field or equivalent experience.
6+ years experience as an engineer building and optimizing highly scalable distributed data systems (e.g., Databricks, Spark, or Snowflake).
3+ years of experience working with SQL and data modeling on large multi-table data sets.
3+ years of experience acting as a trusted technical decision-maker in a team setting, solving for short-term and long-term business value.
3+ years of experience coaching other engineers.
Deep expertise in managing Databricks workspaces, including Unity Catalog for data governance, lineage, and fine-grained access control.
Advanced proficiency with Terraform (or similar) to automate the provisioning and scaling of Databricks clusters, cloud resources (AWS preferred), and networking.
In-depth knowledge of distributed systems, including partitioning, liquid clustering/Z-Ordering, sharding, and high-availability strategies for petabyte-scale data.
Proven track record in performance monitoring and query tuning for distributed workloads to ensure system reliability and cost-efficiency.
Experience designing and optimizing high-throughput ETL/ELT pipelines and ingestion systems (batch and streaming) using Spark.
Experience building robust CI/CD pipelines for data infrastructure and deploying services using containerization (Docker, Kubernetes).
Expertise in building systems that handle protected information, with specific experience in HIPAA and SOX compliance frameworks.
Experience navigating health-tech data complexities, such as Electronic Health Records (EHR), clinical data formats (HL7/FHIR), and claims data.

Responsibilities

Architect and manage the high-performance, distributed data environments that power our healthcare analytics.
Develop and implement scalable and performant solutions.
Partner, as a peer, with Engineering Managers, Product Managers, and stakeholders throughout Aledade to develop and execute technical roadmaps using Agile processes.
Mentor and coach more junior engineers including thorough pull request reviews for other developers and be receptive to critical feedback on your own work.

View Full Description & ApplyYou'll be redirected to the employer's site