- Maintain, extend, and improve existing data/ML workflows, and implement new ones to handle high-velocity data.
- Provide interfaces and systems that enable ML engineers and researchers to build datasets on demand.
- Influence data storage and processing strategies.
- Collaborate with the ML team, as well as frontend and backend teams, to build out our data platform.
- Reduce time-to-deployment for dashboards and ML models.
- Establish best practices and develop pipelines and software that enable ML engineers and researchers to efficiently build and use datasets.
- Work with large datasets under performance constraints comparable to those at the largest companies.
- Iterate quickly, with a focus on shipping early and often, ensuring that new products or features can be deployed to millions of users.