Senior Staff Machine Learning Scientist, Assets

WebflowDigital Experience Platform

Remote-first (United States; BC & ON, Canada)Full-TimeStaff

Salary194000 - 324000 USD per year

Apply NowOpens the employer's application page

Job Details

Deep theoretical and practical expertise in computer vision, with strong foundations in areas such as representation learning, visual recognition, image or video generation, multimodal learning, and large-scale model training
Advanced degree in Computer Science, Electrical Engineering, Statistics, Applied Mathematics, or a related field; PhD strongly preferred
8+ years of relevant industry and/or research experience, or equivalent impact, including a track record of driving major research initiatives in computer vision or multimodal machine learning
Strong experience with state-of-the-art vision and multimodal model architectures, including transformers, diffusion models, contrastive learning approaches, and foundation models
Proven ability to formulate research problems clearly, design rigorous experiments, and translate findings into meaningful model or product advances
Strong coding and prototyping skills in Python, with the ability to write clean, scalable, and well-documented research code
Proficiency in modern deep learning frameworks such as PyTorch and TensorFlow
Demonstrated technical leadership, including mentoring scientists or engineers and influencing research direction across a team or organization
Strong communication skills, with the ability to present complex research clearly to both technical and cross-functional audiences

Lead and drive ambitious research initiatives that advance the state of the art in computer vision, multimodal understanding, and visual generation
Develop novel models, algorithms, and training methodologies for challenging vision problems such as image understanding, video understanding, visual search, scene representation, segmentation, detection, generation, and multimodal reasoning
Translate cutting-edge research into practical model improvements that can shape product direction and unlock new user experiences
Design, implement, train, and optimize large-scale vision and multimodal foundation models across diverse datasets and tasks
Partner closely with applied scientists, ML engineers, and product teams to move research from exploration to production-ready systems
Set technical direction for high-impact research areas, identifying promising bets and influencing longer-term strategy for vision and multimodal AI
Mentor other scientists and engineers, raise the quality bar for research, and help build a strong scientific culture across the team
Stay at the forefront of research in computer vision, multimodal learning, generative modeling, and foundation models, and apply emerging techniques to real-world problems

View Full Description & ApplyYou'll be redirected to the employer's site