Design and maintain scalable data pipelines and ETL/ELT processes Build document processing pipelines for text and image extraction Architect AWS-based data solutions using S3, Glue, Redshift, RDS, ECS, etc Optimize SQL queries and develop Python-based data processing workflows Troubleshoot data pipeline issues and implement solutions Ensure data pipeline performance, scalability, and security