5+ years as a data engineer or data scientist 3+ years of experience with software-defined storage 3+ years using Amazon AWS, including S3 and S3 API 3+ years building and maintaining workflows in git 1+ years developing ML pipelines Solid foundation of Linux skills Strong oral and written communication skills