Apply

Principal MLOPs Engineer

Posted 2024-08-09

View full description

πŸ’Ž Seniority level: Principal, Minimum of 8+ years for Master's degree or 10+ years for Bachelor's degree

πŸ” Industry: Technology / Multicloud solutions

🏒 Company: RackspaceπŸ‘₯ 1001-5000πŸ’° $ Private on 2017-09-11πŸ«‚ on 2023-03-27IaaSBig DataCloud ComputingCloud Infrastructure

πŸ—£οΈ Languages: English

⏳ Experience: Minimum of 8+ years for Master's degree or 10+ years for Bachelor's degree

πŸͺ„ Skills: LeadershipPythonApache HadoopGCPHadoopJavaKerasMachine LearningC++AlgorithmsData StructuresSparkTensorflowCommunication SkillsC (Programming language)

Requirements:
  • Proven track record in designing and implementing cost-effective and scalable ML inference systems.
  • Hands-on experience with deep learning frameworks like TensorFlow, Keras, or Spark MLlib.
  • Strong foundation in machine learning algorithms, natural language processing, and statistical modeling.
  • Fundamental computer science concepts knowledge, including algorithms, distributed systems, and database management.
  • Experience in the Apache Hadoop ecosystem and public cloud services, specifically GCP and Vertex AI.
  • Demonstrated ability to tackle complex challenges and develop innovative solutions.
  • Proficiency in Java and understanding of model optimization techniques.
  • Technical degree: Bachelor's in Computer Science with minimum 10 years of experience or a Master's with 8 years, preferably with a specialization in Machine Learning.
Responsibilities:
  • Architect and optimize existing data infrastructure for machine learning and deep learning models.
  • Collaborate with cross-functional teams to convert business objectives into engineering solutions.
  • Manage the development and operation of high-performance inference systems for various models, including large language models.
  • Provide technical leadership and mentorship to the engineering team.
Apply

Related Jobs

Apply

πŸ“ U.S.

πŸ’Έ 140000 - 225000 USD per year

πŸ” Distributed Data Systems, Platforms at Scale, Complex Application Development

🏒 Company: Raft Company Website

  • 7+ years of relevant hands-on experience.
  • 5+ years experience with Docker and Kubernetes.
  • 5+ years experience supporting enterprise Cloud applications or infrastructure (AWS, Azure, etc.).
  • Solid understanding of Helm Charts.
  • Practical experience with Machine Learning on Kubernetes.
  • Experience managing clusters with GPU machines.
  • Experience building and maintaining machine learning platforms and pipelines.
  • Practical programming and scripting skills (Python preferred).
  • Strong communication skills, analytical thinking, and creative problem-solving.
  • Familiarity with modern software development practices including scrum/agile, Git, and DevOps.
  • Ability to obtain a Security+ certification within the first 90 days.

  • Deploy ML infrastructure.
  • Build MLOps pipelines.
  • Contribute to the development of a full-lifecycle ML platform.
  • Collaborate with a team to enhance operators' awareness of critical events.
  • Process large volumes of real-time data efficiently.

AWSDockerPythonSoftware DevelopmentAgileGitJavaKafkaKubernetesMachine LearningSCRUMAzureCommunication SkillsCollaborationCI/CDDevOpsAttention to detailCompliance

Posted 2024-11-23
Apply
Apply

πŸ“ Canada

πŸ” Machine Learning / Artificial Intelligence

  • Significant expertise in Machine Learning engineering.
  • Strong background in infrastructure related to ML.
  • Proven experience in building and scaling ML inference platforms in a production environment.

  • Architect, build, and optimize ML inference platform.
  • Focus on building Machine Learning inference systems.
  • Drive improvements to existing systems and processes.

LeadershipPythonSoftware DevelopmentArtificial IntelligenceMachine LearningCross-functional Team LeadershipCommunication SkillsAnalytical SkillsCollaboration

Posted 2024-11-07
Apply
Apply

πŸ” Machine Learning

  • Significant expertise in Machine Learning engineering.
  • Experience in building and scaling ML inference platforms.
  • Strong communication skills.
  • Ability to independently solve complex challenges with innovative solutions.

  • Architect the ML inference platform to ensure scalability and efficiency.
  • Build and optimize the platform for production use.
  • Focus on creating robust ML inference systems.
Posted 2024-11-07
Apply
Apply

πŸ“ Canada

πŸ” Multicloud solutions and technology services

🏒 Company: RackspaceπŸ‘₯ 1001-5000πŸ’° $ Private on 2017-09-11πŸ«‚ on 2023-03-27IaaSBig DataCloud ComputingCloud Infrastructure

  • Proven track record in designing and implementing scalable ML inference systems.
  • Hands-on experience with deep learning frameworks such as TensorFlow, Keras, or Spark MLlib.
  • Solid foundation in machine learning algorithms, natural language processing, and statistical modeling.
  • Strong understanding of computer science concepts including algorithms and distributed systems.
  • Proficiency and recent experience in Java is required.
  • Experience in Apache Hadoop ecosystem (Oozie, Pig, Hive, Map Reduce).
  • Expertise in public cloud services, particularly GCP and Vertex AI.
  • Understanding of LLM architectures and model optimization techniques.

  • Architect and optimize existing data infrastructure for machine learning and deep learning models.
  • Collaborate with cross-functional teams to translate business objectives into engineering solutions.
  • Own development and operation of high-performance inference systems for various models.
  • Provide technical leadership and mentorship to the engineering team.

LeadershipPythonApache HadoopGCPHadoopJavaKerasMachine LearningC++C (Programming language)AlgorithmsData StructuresSparkTensorflow

Posted 2024-08-14
Apply