Senior Data Engineer - Databricks

New
Based in the United StatesFull-TimeSenior
SalaryCompetitive compensation package with performance-based incentives
Apply NowOpens the employer's application page

Job Details

Experience
4+ years of experience in data engineering
Required Skills
AWSPostgreSQLPythonSQLETLAzureSparkDatabricksPySpark

Requirements

  • 4+ years of experience in data engineering with strong exposure to production-grade data pipelines.
  • At least 2+ years of hands-on experience with Databricks and/or Apache Spark ecosystems across Azure and/or AWS.
  • Strong proficiency in Python, PySpark, and SQL for building scalable data transformations and workflows.
  • Deep experience with Delta Lake, including schema evolution, ACID transactions, and incremental/streaming processing.
  • Proven expertise in performance tuning and optimizing Databricks workloads in production environments.
  • Solid understanding of PostgreSQL and relational database design, including query optimization and integration patterns.
  • Experience working with legacy ETL systems (e.g., SSIS, Informatica, or custom pipelines) and modernization efforts.
  • Familiarity with multi-tenant architectures, data isolation strategies, and enterprise-grade data governance.
  • Strong collaboration skills with cross-functional teams including data science, product, and infrastructure.
  • Knowledge of CI/CD practices, version control, and automated testing for data engineering workflows.

Responsibilities

  • Own the design, development, and production support of Databricks-based data pipelines powering ERP and CRM data ingestion, transformation, and delivery at scale.
  • Monitor and optimize production data workflows, ensuring SLA compliance, system reliability, and rapid incident resolution across distributed environments.
  • Lead performance tuning and cost optimization initiatives across Databricks workloads, improving throughput and reducing compute costs.
  • Drive migration of legacy ETL/ELT systems to modern Databricks architecture, ensuring seamless and automated transitions.
  • Architect and maintain Delta Lake structures, including schema design, partitioning strategies, and incremental processing frameworks.
  • Implement robust data quality, observability, and monitoring systems to ensure integrity across pipelines and ML inputs.
  • Support multi-tenant data onboarding, ensuring secure, isolated, and validated pipelines for enterprise customers.
  • Collaborate cross-functionally with ML, product, and enterprise architecture teams to align data infrastructure with platform needs.
  • Maintain operational readiness through documentation, CI/CD practices, runbooks, and participation in on-call support rotations.
View Full Description & ApplyYou'll be redirected to the employer's site
Competitive compensation package with performance-based incentives
Apply Now