5+ years of experience in data engineering or backend systems, with senior or staff-level contributions. Deep Python proficiency, with production experience in ETL, data validation, and orchestration frameworks (e.g., Airflow, Dagster, dbt). Strong experience with data warehouse design, including star/snowflake schemas, denormalization strategies, and performance optimization. Strong understanding of data privacy and security practices, especially in healthcare (HIPAA, de-identification, audit logging, etc.). Proven experience managing complex integrations with EMRs or clinical systems. Familiarity with LLM and ML development tools (e.g., TensorFlow, PyTorch, LangChain, transformers, vector DBs). Experience deploying or supporting predictive models in production environments. Expertise in Snowflake or similar cloud data platforms (e.g., BigQuery, Redshift). Strong grasp of data modeling, provenance, and semantics for analytical and AI purposes. Experience working with AWS services such as S3, Lambda, Batch, Event Bridge, Cloud Front, EC2, etc.