Senior / Staff Machine Learning Infrastructure Engineer
New
Remote US & CanadaFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 3-5 years
- Required Skills
- AWSDockerPythonGCPKubeflowKubernetesPyTorchAzureTensorflowMLOps
Requirements
- 3-5 years of experience supporting machine learning training platforms.
- Bachelor’s degree in Computer Science, Data Science or a related field.
- Strong understanding of machine learning principles and model lifecycle management.
- Proficiency in programming languages such as Python, with hands-on experience in machine learning frameworks like TensorFlow or PyTorch.
- Experience with cloud platforms like AWS, Azure, or Google Cloud and their respective machine learning services.
- Experience managing technology such as JupyterHub and Kubeflow.
- Familiarity with containerization and orchestration tools such as Kubernetes and Docker.
- Strong problem-solving skills and ability to troubleshoot complex issues.
- Experience with monitoring tools and practices for model performance in production.
- Ability to work collaboratively in cross-functional teams.
Responsibilities
- Design, develop, and implement the machine learning platform for the continuous deployment and integration of machine learning models.
- Collaborate with data scientists and engineers to understand model requirements and optimize pipeline processes.
- Automate the training, testing and deployment processes for machine learning models.
- Continuously monitor and maintain model pipelines, ensuring optimal performance, accuracy and reliability.
- Optimize machine learning pipelines for scalability, efficiency and cost-effectiveness.
- Ensure compliance with security and data privacy standards in all MLOps activities.
View Full Description & ApplyYou'll be redirected to the employer's site