Senior Data Engineer

New

Our team works in a hybrid model from the San Francisco Bay Area. We will prioritize candidates who are able to work 2 days per week from our office, and we will consider highly qualified remote candidates who can travel quarterly to the San Francisco office.Full-TimeSenior

Salary190,000 - 220,000 USD per year

Apply NowOpens the employer's application page

Job Details

Experience: 5+ years building data-intensive SaaS platforms (L5: 8+ years with technical leadership)
Required Skills: AWSPythonSQLAirflowSparkTerraformData modelingDatabricksHIPAA

Requirements

5+ years building data-intensive SaaS platforms (L5: 8+ years with technical leadership).
Deep, hands-on expertise with Spark and distributed data processing.
Strong SQL and data modeling / warehouse design (dimensional modeling, Delta / Lakehouse).
Proven track record scaling a product to an enterprise level.
Experience with orchestration (Airflow), IaC (Terraform), and CI/CD for data.
Experience with data-quality / testing frameworks such as dbt tests or Great Expectations.
Ability to quickly understand complex modeling workflows and the business need driving them.
Ships high-caliber, well-tested code with strong attention to detail.
Experience with healthcare data (claims, eligibility) and handling PHI / PII under HIPAA.
Thrives under minimal supervision in a rapidly changing, ambiguous start-up environment.

Responsibilities

Scale Arbital's healthcare data pipelines and lakehouse on AWS and Databricks, and own the underlying architecture.
Implement and scale actuarially sound healthcare financial calculations in Spark.
Build and maintain orchestration (Airflow) and CI/CD so enrichment and aggregation workflows are reliable, observable, and reproducible.
Own data quality, integrity, privacy, security, and HIPAA compliance through automated testing and quality-control procedures.
Collaborate with actuarial and delivery teams that primarily work in Python and R.
Partner with data scientists to deploy and monitor machine learning models in production.
Lead technical design reviews and contribute to platform-wide architecture decisions.
Establish data observability, lineage, and SLAs, and tune Spark/Databricks jobs for performance and cost.
Raise the engineering bar through code review, mentorship, and setting data-engineering standards across the team.

View Full Description & ApplyYou'll be redirected to the employer's site