5+ years of software development and data engineering experience with demonstrated ownership of production grade data infrastructure Bachelor's degree in Computer Science or a related field, or equivalent practical experience Deep expertise scaling Spark in production (Databricks, EMR, etc) Strong understanding of distributed computing and modern data modeling for scalable systems Proficient in Python with experience implementing software engineering best practices Hands-on experience with both relational (MySQL / PostgreSQL) and NoSQL (MongoDB, DynamoDB, Cassandra) databases Strong communicator with experience influencing cross functional stakeholders