GenAI Data Engineer
New
T
Tiger Analytics Inc.Analytics Consulting
Hartford, Connecticut, United States. Toronto, Ontario, CanadaFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- PythonSQLGCPSnowflakeBigQueryNLP
Requirements
- Expertise in data pipeline design and implementation.
- Proficiency in Snowflake (Directory Tables, Scoped URLs, Snowpark).
- Strong experience with GCP (BigQuery, Cloud Storage, Dataflow).
- Experience integrating AI/ML tools like OCR, NLP, and Document AI.
- Advanced SQL tuning capabilities.
- Expertise in Python for data processing.
- Experience working with Dataiku.
- Capability to manage petabyte-scale data environments.
Responsibilities
- Design and implement robust data pipelines that ingest, process, and store unstructured data formats at scale within Snowflake and GCP.
- Leverage Snowflake’s unstructured data capabilities (Directory Tables, Scoped URLs, Snowpark) to make dark data queryable.
- Build and maintain cloud-native ETL/ELT processes using BigQuery, Cloud Storage, and Dataflow.
- Integrate AI tools (OCR, NLP entities, Document AI) into the engineering flow to transform unstructured blobs into structured insights.
- Tune complex SQL queries and Python-based processing jobs to handle petabyte-scale environments efficiently.
View Full Description & ApplyYou'll be redirected to the employer's site