Data Engineer – AI/BI

New
Remote-friendly work arrangement based in IndiaFull-Time
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
PythonSQLMLFlowSparkScalaDatabricks

Requirements

  • Strong hands-on experience with Databricks, Delta Lake, and Apache Spark in building scalable data engineering solutions.
  • Proficiency in SQL and Python (or Scala) for data transformation, pipeline development, and workflow automation.
  • Experience implementing Medallion Architecture and working with large-scale cloud data platforms.
  • Proven ability to build and manage ELT pipelines, data ingestion frameworks, and orchestration workflows.
  • Experience integrating enterprise systems such as Salesforce, PeopleSoft, or similar platforms.
  • Understanding of data governance, metadata management, lineage tracking, and data quality best practices.
  • Familiarity with cloud environments such as Azure or AWS, including storage and security concepts.
  • Experience with BI/AI enablement tools such as Databricks AI/BI, dashboards, or semantic modeling is highly desirable.
  • Strong analytical thinking, problem-solving skills, and ability to work in a fast-paced engineering environment.

Responsibilities

  • Design, build, and optimize end-to-end data pipelines on Databricks using Apache Spark and Delta Lake, following Medallion Architecture (bronze, silver, gold) principles.
  • Develop and operationalize AI/BI solutions including dashboards, semantic models, and Databricks Genie experiences for self-service analytics.
  • Integrate enterprise data sources such as PeopleSoft, Salesforce, and D2L using APIs, JDBC, and other ingestion frameworks.
  • Implement scalable ELT workflows with orchestration, monitoring, schema evolution, and automated error handling.
  • Establish data quality frameworks, validation rules, and observability mechanisms to ensure accuracy, reliability, and performance.
  • Implement data governance, lineage, and metadata management using tools such as Unity Catalog or equivalent solutions.
  • Support AI/ML teams by delivering curated datasets, feature stores, and ML-ready data pipelines using tools like MLflow.
  • Ensure data security, privacy, and compliance through access controls, encryption, masking, and governance best practices.
  • Maintain technical documentation, architecture diagrams, and enablement materials for platform adoption and scalability.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now