Senior Data Engineer

New
Our team works in a hybrid model from the San Francisco Bay Area. We will prioritize candidates who are able to work 2 days per week from our office, and we will consider highly qualified remote candidates who can travel quarterly to the San Francisco office.Full-TimeSenior
Salary190,000 - 220,000 USD per year
Apply NowOpens the employer's application page

Job Details

Experience
5+ years building data-intensive SaaS platforms (L5: 8+ years with technical leadership)
Required Skills
AWSPythonSQLAirflowSparkTerraformData modelingDatabricksHIPAA

Requirements

  • 5+ years building data-intensive SaaS platforms (L5: 8+ years with technical leadership).
  • Deep, hands-on expertise with Spark and distributed data processing.
  • Strong SQL and data modeling / warehouse design (dimensional modeling, Delta / Lakehouse).
  • Proven track record scaling a product to an enterprise level.
  • Experience with orchestration (Airflow), IaC (Terraform), and CI/CD for data.
  • Experience with data-quality / testing frameworks such as dbt tests or Great Expectations.
  • Ability to quickly understand complex modeling workflows and the business need driving them.
  • Ships high-caliber, well-tested code with strong attention to detail.
  • Experience with healthcare data (claims, eligibility) and handling PHI / PII under HIPAA.
  • Thrives under minimal supervision in a rapidly changing, ambiguous start-up environment.

Responsibilities

  • Scale Arbital's healthcare data pipelines and lakehouse on AWS and Databricks, and own the underlying architecture.
  • Implement and scale actuarially sound healthcare financial calculations in Spark.
  • Build and maintain orchestration (Airflow) and CI/CD so enrichment and aggregation workflows are reliable, observable, and reproducible.
  • Own data quality, integrity, privacy, security, and HIPAA compliance through automated testing and quality-control procedures.
  • Collaborate with actuarial and delivery teams that primarily work in Python and R.
  • Partner with data scientists to deploy and monitor machine learning models in production.
  • Lead technical design reviews and contribute to platform-wide architecture decisions.
  • Establish data observability, lineage, and SLAs, and tune Spark/Databricks jobs for performance and cost.
  • Raise the engineering bar through code review, mentorship, and setting data-engineering standards across the team.
View Full Description & ApplyYou'll be redirected to the employer's site
190,000 - 220,000 USD per year
Apply Now