5+ years working on the role professionally Coding Python and use in data processing solutions and related data technologies like Pandas, and PySpark Work with relational and non-relational data stores (like: HBASE, Cassandra or MongoDB; S3, blobs) Data Streams (Kafka, Kinesis, Flume,) and message queuing (SQS, SNS, RabbitMQ, etc) Data/Stream processing (Spark, Flink, Hadoop) ETL using solutions: Talend; Informatica; SQL Server Integration Services (SSIS) Data warehouse (Snowflake, Redshift, Hive) Implementation of data warehouse solutions, providing near real-time data to a variety of client systems Experience designing and implementing data applications and services on the public cloud, AWS, GCP, or Azure using PaaS platforms Familiarity with data privacy regulations and best practices