Design and maintain infrastructure supporting scalable real time data pipelines to handle huge datasets Develop and support tooling enabling implementation of custom ML algorithms in a low latency environment Work on infrastructure for running training, inference, monitoring, and deployment on thousands of ML tasks concurrently