Senior / Staff Machine Learning Infrastructure Engineer

New
Remote US & CanadaFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Experience
3-5 years
Required Skills
AWSDockerPythonGCPKubeflowKubernetesPyTorchAzureTensorflowMLOps

Requirements

  • 3-5 years of experience supporting machine learning training platforms.
  • Bachelor’s degree in Computer Science, Data Science or a related field.
  • Strong understanding of machine learning principles and model lifecycle management.
  • Proficiency in programming languages such as Python, with hands-on experience in machine learning frameworks like TensorFlow or PyTorch.
  • Experience with cloud platforms like AWS, Azure, or Google Cloud and their respective machine learning services.
  • Experience managing technology such as JupyterHub and Kubeflow.
  • Familiarity with containerization and orchestration tools such as Kubernetes and Docker.
  • Strong problem-solving skills and ability to troubleshoot complex issues.
  • Experience with monitoring tools and practices for model performance in production.
  • Ability to work collaboratively in cross-functional teams.

Responsibilities

  • Design, develop, and implement the machine learning platform for the continuous deployment and integration of machine learning models.
  • Collaborate with data scientists and engineers to understand model requirements and optimize pipeline processes.
  • Automate the training, testing and deployment processes for machine learning models.
  • Continuously monitor and maintain model pipelines, ensuring optimal performance, accuracy and reliability.
  • Optimize machine learning pipelines for scalability, efficiency and cost-effectiveness.
  • Ensure compliance with security and data privacy standards in all MLOps activities.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now