Data Engineer
New
N
NscaleAI Infrastructure
Location: USFull-Time
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- GraphQLPythonGitPandasSparkCI/CDRESTful APIsPySpark
Requirements
- Deep, hands-on experience building in Palantir Foundry, including ontology modelling, pipeline development, API integration, and large-scale data platform design.
- Strong proficiency in Python, with experience applying data engineering libraries and frameworks (e.g. Spark, PySpark, Dask, pandas).
- Familiarity with API-driven data integration, including REST, GraphQL, and Foundry Action APIs.
- Practical experience working in Git-based development workflows, including code reviews, version control, and CI/CD pipelines.
- Comfort working in ambiguous, early-stage environments where requirements evolve quickly.
- Strong communication skills with the ability to explain data concepts to technical and non-technical stakeholders.
- A bias toward ownership, pragmatism, and shipping useful solutions.
Responsibilities
- Design and build scalable, reliable data pipelines that ingest data from infrastructure, platform services, and business systems.
- Define data models and schemas that support operational workflows and use cases across the business, monitoring, and analytics.
- Clean, transform and structure the data to create a digital twin of Nscale.
- Implement permissioning and manage access and security of the Foundry implementation.
- Create trusted datasets and metrics that power workflows and processes, internal tools, and customer-facing insights.
- Enable self-serve analytics by establishing clear data contracts, documentation, and semantic layers.
- Implement data quality checks, monitoring, and alerting to ensure data correctness and availability.
- Establish best practices around data versioning, access control, and governance.
View Full Description & ApplyYou'll be redirected to the employer's site