Senior/Lead Data Engineer

MexicoFull-TimeLead
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
PythonSQLAzureSparkCI/CDdbtDatabricksPySpark

Requirements

  • Strong expertise in SQL and dimensional data modeling, including medallion architecture, SCD patterns, and dataset grain management.
  • Extensive experience building production-grade data pipelines using Python, with strong testing practices (pytest), typing, and code quality tools.
  • Deep hands-on experience with Spark/PySpark and performance troubleshooting using tools such as Spark UI.
  • Strong experience with dbt, including model development, testing frameworks, and data documentation practices.
  • Solid knowledge of Databricks, Delta Lake (MERGE, OPTIMIZE, Z-ORDER, time travel), and modern lakehouse architectures.
  • Experience working with cloud data platforms and tools such as GitHub, CI/CD workflows, and secret management systems.
  • Proven ability to operate in complex, distributed systems and make architectural trade-offs between cost, scalability, and performance.
  • Strong communication skills with the ability to document technical decisions and translate engineering work into business impact.
  • Leadership experience mentoring engineers, setting technical direction, and contributing to engineering best practices.
  • Familiarity with Azure ecosystem tools (ADF, ADLS) and modern observability or AI-assisted engineering tools is a plus.

Responsibilities

  • Design, build, and maintain scalable, idempotent end-to-end data pipelines using modern data stack principles to support analytics and AI workloads.
  • Develop robust data models (star and snowflake schemas) and write high-quality, grain-aware SQL to build scalable and reliable data marts.
  • Build production-grade Python systems with strong engineering discipline, including testing, type safety, and modular design.
  • Develop and manage dbt models across layered architectures (staging, intermediate, marts), ensuring strong testing and documentation standards.
  • Implement and enforce data quality frameworks, including validation checks, schema enforcement, and anomaly detection across pipelines.
  • Develop and deploy Databricks-based workloads (including Asset Bundles) and operate within secure, governed cloud environments.
  • Provide technical leadership by defining architecture standards, reviewing code, mentoring engineers, and guiding cross-team engineering decisions.
  • Ensure reliability and observability of data systems, while proactively identifying and resolving performance or pipeline issues.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now