Senior/Lead Data Engineer

New

TruelogicSoftware, AI

ColombiaFull-TimeLead

Salary not disclosed

Apply NowOpens the employer's application page

Job Details

Expertise in SQL and dimensional modeling methodologies, including medallion architecture, SCDs, and grain management.
Proven ability to design idempotent pipelines utilizing incremental, checkpoint, and replaceWhere strategies.
Extensive experience with production-grade Python engineering, including type hints, pytest, and ruff.
Strong capability to diagnose and resolve failing Spark / PySpark jobs utilizing tools like Spark UI.
Deep understanding of Delta Lake features such as MERGE, OPTIMIZE, Z-ORDER, and time travel.
Hands-on expertise with dbt, including models, tests, and exposures.
Experience authoring and deploying jobs using Databricks Asset Bundles (DAB) and operating within a Unity Catalog environment.
Strong adherence to disciplined Git workflows, conventional commits, and strict documentation practices.
Experience provisioning and utilizing Service Principals, GitHub environment secrets, and secret management tools like Azure Key Vault or Databricks secret scopes.
Strong written technical communication skills for PR descriptions and runbooks.
Experience leading technical initiatives, establishing architectural standards, and contributing to interview rubrics is preferred.
Experience reading or modifying Azure Data Factory (ADF) pipelines and familiarity with Azure Data Lake storage is highly preferred.

Design and build robust, idempotent data pipelines from scratch utilizing a modern data stack.
Design star and snowflake schemas, writing precise, grain-aware SQL to construct scalable data marts.
Write production-grade, unit-tested Python code at the module level, adhering to strong engineering disciplines such as type hinting and testing.
Build and test dbt models across staging, intermediate, and mart layers while managing overall project structure.
Author and deploy jobs using Databricks Asset Bundles (DAB) following documented architectural patterns.
Implement rigorous data quality checks at source, intermediate, and destination layers to prevent silent drops of nulls or duplicates.
Maintain data governance through comprehensive dbt tests and strict documentation-at-merge-time discipline.
Operate securely within a multi-repository architecture, utilizing service principals and ensuring zero personal credentials in production deployments.
Own data pipelines end-to-end, making key technical design decisions and mentoring mid-level engineers through substantive code reviews.
Define overarching technical direction across core data systems and act as a technical leader to unblock the team.

View Full Description & ApplyYou'll be redirected to the employer's site