Senior Staff Machine Learning Scientist, Assets

W
WebflowDigital Experience Platform
Remote-first (United States; BC & ON, Canada)Full-TimeStaff
Salary194000 - 324000 USD per year
Apply NowOpens the employer's application page

Job Details

Experience
8+ years
Required Skills
PythonPyTorchTensorflowComputer Vision

Requirements

  • Deep theoretical and practical expertise in computer vision, with strong foundations in areas such as representation learning, visual recognition, image or video generation, multimodal learning, and large-scale model training
  • Advanced degree in Computer Science, Electrical Engineering, Statistics, Applied Mathematics, or a related field; PhD strongly preferred
  • 8+ years of relevant industry and/or research experience, or equivalent impact, including a track record of driving major research initiatives in computer vision or multimodal machine learning
  • Strong experience with state-of-the-art vision and multimodal model architectures, including transformers, diffusion models, contrastive learning approaches, and foundation models
  • Proven ability to formulate research problems clearly, design rigorous experiments, and translate findings into meaningful model or product advances
  • Strong coding and prototyping skills in Python, with the ability to write clean, scalable, and well-documented research code
  • Proficiency in modern deep learning frameworks such as PyTorch and TensorFlow
  • Demonstrated technical leadership, including mentoring scientists or engineers and influencing research direction across a team or organization
  • Strong communication skills, with the ability to present complex research clearly to both technical and cross-functional audiences

Responsibilities

  • Lead and drive ambitious research initiatives that advance the state of the art in computer vision, multimodal understanding, and visual generation
  • Develop novel models, algorithms, and training methodologies for challenging vision problems such as image understanding, video understanding, visual search, scene representation, segmentation, detection, generation, and multimodal reasoning
  • Translate cutting-edge research into practical model improvements that can shape product direction and unlock new user experiences
  • Design, implement, train, and optimize large-scale vision and multimodal foundation models across diverse datasets and tasks
  • Partner closely with applied scientists, ML engineers, and product teams to move research from exploration to production-ready systems
  • Set technical direction for high-impact research areas, identifying promising bets and influencing longer-term strategy for vision and multimodal AI
  • Mentor other scientists and engineers, raise the quality bar for research, and help build a strong scientific culture across the team
  • Stay at the forefront of research in computer vision, multimodal learning, generative modeling, and foundation models, and apply emerging techniques to real-world problems
View Full Description & ApplyYou'll be redirected to the employer's site
194000 - 324000 USD per year
Apply Now