Senior Data Engineer - Databricks

New

Based in the United StatesFull-TimeSenior

SalaryCompetitive compensation package with performance-based incentives

Apply NowOpens the employer's application page

Job Details

4+ years of experience in data engineering with strong exposure to production-grade data pipelines.
At least 2+ years of hands-on experience with Databricks and/or Apache Spark ecosystems across Azure and/or AWS.
Strong proficiency in Python, PySpark, and SQL for building scalable data transformations and workflows.
Deep experience with Delta Lake, including schema evolution, ACID transactions, and incremental/streaming processing.
Proven expertise in performance tuning and optimizing Databricks workloads in production environments.
Solid understanding of PostgreSQL and relational database design, including query optimization and integration patterns.
Experience working with legacy ETL systems (e.g., SSIS, Informatica, or custom pipelines) and modernization efforts.
Familiarity with multi-tenant architectures, data isolation strategies, and enterprise-grade data governance.
Strong collaboration skills with cross-functional teams including data science, product, and infrastructure.
Knowledge of CI/CD practices, version control, and automated testing for data engineering workflows.

Own the design, development, and production support of Databricks-based data pipelines powering ERP and CRM data ingestion, transformation, and delivery at scale.
Monitor and optimize production data workflows, ensuring SLA compliance, system reliability, and rapid incident resolution across distributed environments.
Lead performance tuning and cost optimization initiatives across Databricks workloads, improving throughput and reducing compute costs.
Drive migration of legacy ETL/ELT systems to modern Databricks architecture, ensuring seamless and automated transitions.
Architect and maintain Delta Lake structures, including schema design, partitioning strategies, and incremental processing frameworks.
Implement robust data quality, observability, and monitoring systems to ensure integrity across pipelines and ML inputs.
Support multi-tenant data onboarding, ensuring secure, isolated, and validated pipelines for enterprise customers.
Collaborate cross-functionally with ML, product, and enterprise architecture teams to align data infrastructure with platform needs.
Maintain operational readiness through documentation, CI/CD practices, runbooks, and participation in on-call support rotations.

View Full Description & ApplyYou'll be redirected to the employer's site