Data Engineer

New
N
NscaleAI Infrastructure
Location: USFull-Time
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
GraphQLPythonGitPandasSparkCI/CDRESTful APIsPySpark

Requirements

  • Deep, hands-on experience building in Palantir Foundry, including ontology modelling, pipeline development, API integration, and large-scale data platform design.
  • Strong proficiency in Python, with experience applying data engineering libraries and frameworks (e.g. Spark, PySpark, Dask, pandas).
  • Familiarity with API-driven data integration, including REST, GraphQL, and Foundry Action APIs.
  • Practical experience working in Git-based development workflows, including code reviews, version control, and CI/CD pipelines.
  • Comfort working in ambiguous, early-stage environments where requirements evolve quickly.
  • Strong communication skills with the ability to explain data concepts to technical and non-technical stakeholders.
  • A bias toward ownership, pragmatism, and shipping useful solutions.

Responsibilities

  • Design and build scalable, reliable data pipelines that ingest data from infrastructure, platform services, and business systems.
  • Define data models and schemas that support operational workflows and use cases across the business, monitoring, and analytics.
  • Clean, transform and structure the data to create a digital twin of Nscale.
  • Implement permissioning and manage access and security of the Foundry implementation.
  • Create trusted datasets and metrics that power workflows and processes, internal tools, and customer-facing insights.
  • Enable self-serve analytics by establishing clear data contracts, documentation, and semantic layers.
  • Implement data quality checks, monitoring, and alerting to ensure data correctness and availability.
  • Establish best practices around data versioning, access control, and governance.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now