Apply

Principal MLOPs Engineer

Posted 2024-11-07

View full description

💎 Seniority level: Principal, significant expertise

🔍 Industry: Machine Learning

🗣️ Languages: English

⏳ Experience: Significant expertise

Requirements:
  • Significant expertise in Machine Learning engineering.
  • Experience in building and scaling ML inference platforms.
  • Strong communication skills.
  • Ability to independently solve complex challenges with innovative solutions.
Responsibilities:
  • Architect the ML inference platform to ensure scalability and efficiency.
  • Build and optimize the platform for production use.
  • Focus on creating robust ML inference systems.
Apply

Related Jobs

Apply

📍 U.S.

💸 140000 - 225000 USD per year

🔍 Distributed Data Systems, Platforms at Scale, Complex Application Development

🏢 Company: Raft Company Website

  • 7+ years of relevant hands-on experience.
  • 5+ years experience with Docker and Kubernetes.
  • 5+ years experience supporting enterprise Cloud applications or infrastructure (AWS, Azure, etc.).
  • Solid understanding of Helm Charts.
  • Practical experience with Machine Learning on Kubernetes.
  • Experience managing clusters with GPU machines.
  • Experience building and maintaining machine learning platforms and pipelines.
  • Practical programming and scripting skills (Python preferred).
  • Strong communication skills, analytical thinking, and creative problem-solving.
  • Familiarity with modern software development practices including scrum/agile, Git, and DevOps.
  • Ability to obtain a Security+ certification within the first 90 days.

  • Deploy ML infrastructure.
  • Build MLOps pipelines.
  • Contribute to the development of a full-lifecycle ML platform.
  • Collaborate with a team to enhance operators' awareness of critical events.
  • Process large volumes of real-time data efficiently.

AWSDockerPythonSoftware DevelopmentAgileGitJavaKafkaKubernetesMachine LearningSCRUMAzureCommunication SkillsCollaborationCI/CDDevOpsAttention to detailCompliance

Posted 2024-11-23
Apply
Apply

📍 Canada

🔍 Machine Learning / Artificial Intelligence

  • Significant expertise in Machine Learning engineering.
  • Strong background in infrastructure related to ML.
  • Proven experience in building and scaling ML inference platforms in a production environment.

  • Architect, build, and optimize ML inference platform.
  • Focus on building Machine Learning inference systems.
  • Drive improvements to existing systems and processes.

LeadershipPythonSoftware DevelopmentArtificial IntelligenceMachine LearningCross-functional Team LeadershipCommunication SkillsAnalytical SkillsCollaboration

Posted 2024-11-07
Apply
Apply

📍 Canada

🔍 Multicloud solutions and technology services

🏢 Company: Rackspace👥 1001-5000💰 $ Private on 2017-09-11🫂 on 2023-03-27IaaSBig DataCloud ComputingCloud Infrastructure

  • Proven track record in designing and implementing scalable ML inference systems.
  • Hands-on experience with deep learning frameworks such as TensorFlow, Keras, or Spark MLlib.
  • Solid foundation in machine learning algorithms, natural language processing, and statistical modeling.
  • Strong understanding of computer science concepts including algorithms and distributed systems.
  • Proficiency and recent experience in Java is required.
  • Experience in Apache Hadoop ecosystem (Oozie, Pig, Hive, Map Reduce).
  • Expertise in public cloud services, particularly GCP and Vertex AI.
  • Understanding of LLM architectures and model optimization techniques.

  • Architect and optimize existing data infrastructure for machine learning and deep learning models.
  • Collaborate with cross-functional teams to translate business objectives into engineering solutions.
  • Own development and operation of high-performance inference systems for various models.
  • Provide technical leadership and mentorship to the engineering team.

LeadershipPythonApache HadoopGCPHadoopJavaKerasMachine LearningC++C (Programming language)AlgorithmsData StructuresSparkTensorflow

Posted 2024-08-14
Apply
Apply

🧭 Full-Time

🔍 Technology / Multicloud solutions

🏢 Company: Rackspace👥 1001-5000💰 $ Private on 2017-09-11🫂 on 2023-03-27IaaSBig DataCloud ComputingCloud Infrastructure

  • Proven track record in designing and implementing cost-effective and scalable ML inference systems.
  • Hands-on experience with deep learning frameworks like TensorFlow, Keras, or Spark MLlib.
  • Strong foundation in machine learning algorithms, natural language processing, and statistical modeling.
  • Fundamental computer science concepts knowledge, including algorithms, distributed systems, and database management.
  • Experience in the Apache Hadoop ecosystem and public cloud services, specifically GCP and Vertex AI.
  • Demonstrated ability to tackle complex challenges and develop innovative solutions.
  • Proficiency in Java and understanding of model optimization techniques.
  • Technical degree: Bachelor's in Computer Science with minimum 10 years of experience or a Master's with 8 years, preferably with a specialization in Machine Learning.

  • Architect and optimize existing data infrastructure for machine learning and deep learning models.
  • Collaborate with cross-functional teams to convert business objectives into engineering solutions.
  • Manage the development and operation of high-performance inference systems for various models, including large language models.
  • Provide technical leadership and mentorship to the engineering team.

LeadershipPythonApache HadoopGCPHadoopJavaKerasMachine LearningC++C (Programming language)AlgorithmsData StructuresSparkTensorflowCommunication Skills

Posted 2024-08-09
Apply

Related Articles

Remote Job Certifications and Courses to Boost Your Career

August 22, 2024

Insights into the evolving landscape of remote work in 2024 reveal the importance of certifications and continuous learning. This article breaks down emerging trends, sought-after certifications, and provides practical solutions for enhancing your employability and expertise. What skills will be essential for remote job seekers, and how can you navigate this dynamic market to secure your dream role?

How to Balance Work and Life While Working Remotely

August 19, 2024

Explore the challenges and strategies of maintaining work-life balance while working remotely. Learn about unique aspects of remote work, associated challenges, historical context, and effective strategies to separate work and personal life.

Weekly Digest: Remote Jobs News and Trends (August 11 - August 18, 2024)

August 18, 2024

Google is gearing up to expand its remote job listings, promising more opportunities across various departments and regions. Find out how this move can benefit job seekers and impact the market.

How to Onboard Remote Employees Successfully

August 16, 2024

Learn about the importance of pre-onboarding preparation for remote employees, including checklist creation, documentation, tools and equipment setup, communication plans, and feedback strategies. Discover how proactive pre-onboarding can enhance job performance, increase retention rates, and foster a sense of belonging from day one.

Remote Work Statistics and Insights for 2024

August 13, 2024

The article explores the current statistics for remote work in 2024, covering the percentage of the global workforce working remotely, growth trends, popular industries and job roles, geographic distribution of remote workers, demographic trends, work models comparison, job satisfaction, and productivity insights.