Data Engineer

New
Remote, Latin AmericaFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
PythonSQLETLGitData modelingAzure DevOpsPySpark

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or equivalent experience.
  • Proven experience in ETL data engineering with expertise in Python (PySpark).
  • Proficiency in Azure Synapse Analytics (Notebooks, Pipelines, Linked Services, Azure Key Vault).
  • Ability to write complex SQL queries and optimize performance using SparkSQL and MS SQL.
  • Experience with version control systems, specifically Git and Azure DevOps.
  • Strong problem-solving and analytical skills.
  • Excellent communication skills for cross-functional collaboration.
  • Familiarity with data integration best practices.
  • Knowledge of Data Fabric concepts (data lakes, delta lakes, etc.) is a plus.
  • Experience with Power BI, Tableau, or Agile methodologies is a plus.

Responsibilities

  • Develop and maintain ETL data engineering processes using Python (PySpark) within Azure Synapse Analytics Notebooks and Pipelines.
  • Design and build effective data storage structures in a Massively Parallel Processing (MPP) SQL Pool using star schemas, facts, and dimensions.
  • Extract data from various sources including REST APIs, SQL database tables, and CSV files.
  • Optimize ETL data notebooks/pipelines for scalability and performance.
  • Implement data quality checks and validation processes to maintain accuracy and consistency.
  • Collaborate with data architects to create data models and schemas.
  • Monitor ETL jobs, diagnose issues, and ensure pipeline reliability.
  • Maintain comprehensive documentation of processes and data flows.
  • Ensure data security and compliance with data governance and privacy standards.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now