Apply

ML Ops Engineer

Posted 23 days agoViewed

View full description

๐Ÿ’Ž Seniority level: Middle, 4+ years

๐Ÿ” Industry: Software Development

๐Ÿข Company: Intuition Machines, Inc.๐Ÿ‘ฅ 51-100InternetEducationInternet of ThingsMachine LearningSoftware

๐Ÿ—ฃ๏ธ Languages: English

โณ Experience: 4+ years

Requirements:
  • 4+ years of professional experience building and maintaining large-scale production systems with a focus on performance, scalability, and reliability.
  • Significant experience coding and developing in Python.
  • Experience with orchestration platforms: Kubernetes, containerization, and microservice design; event-driven model for workflow orchestration, monorepo-scale CI/CD, and testing.
  • Experience with database systems (SQL, NoSQL), distributed systems and architectures, and stream processing (preferably ClickHouse and Redpanda).
  • Familiarity with machine learning fundamentals such as data preparation, model training, model evaluation, and model inference.
  • Focus on clearly structured, maintainable, well-documented, and tested code, including unit, integration, and end-to-end tests.
  • Bachelor's degree or foreign equivalent in a technical or related field from an accredited college or university, or equivalent job experience.
Responsibilities:
  • Optimize and maintain high-reliability systems and pipelines that scale to millions of RPS on large volumes of data.
  • Take care of aspects such as observability, automation, reliability, scalability, continuous integration, and securityโ€”across multiple business domains, in a platform-oriented way.
  • Work with other ML developers from multiple teams to integrate their work within our infrastructure.
  • Participate in code reviews, take part in architecture and design sessions.
  • Proactively watch for recent developments in relevant technologies and assess their applicability.
  • Help maintain, extend, and improve existing data/ML workflows and implement new ones to handle high-volume data flows.
  • Reduce time-to-deployment for dashboards and ML models.
Apply

Related Jobs

Apply
๐Ÿ”ฅ Principal ML Ops Engineer
Posted about 1 month ago

๐Ÿ“ United States, Canada

๐Ÿงญ Full-Time

๐Ÿ” Healthcare Data Science

๐Ÿข Company: Sift Healthcare๐Ÿ‘ฅ 11-50๐Ÿ’ฐ $20,000,000 Series B 11 months agoArtificial Intelligence (AI)Information ServicesPredictive AnalyticsInsurTechHealth CareFinTechSoftware

  • Master's degree in Computer Science or related field
  • 5+ years as ML Ops Engineer or Data Scientist
  • Expertise with infrastructure-as-code frameworks
  • Strong understanding of CI/CD pipelines and containerization
  • Experience with cloud platforms
  • Familiarity with monitoring tools
  • Experience with healthcare data
  • Design and implement ML Ops infrastructure
  • Develop CI/CD pipelines for model serving
  • Maintain monitoring systems for model performance
  • Utilize containerization and orchestration tools
  • Collaborate with data teams for production readiness
  • Champion ML Ops best practices
  • Integrate new technologies into the platform
  • Provide technical guidance to the team

AWSDockerKubernetesMachine LearningCI/CDTerraform

Posted about 1 month ago
Apply
Apply
๐Ÿ”ฅ ML Ops Engineer
Posted 4 months ago

๐Ÿ“ US

๐Ÿงญ Full-Time

๐Ÿ’ธ 170000.0 - 200000.0 USD per year

๐Ÿ” Healthcare technology

๐Ÿข Company: Inspiren๐Ÿ‘ฅ 11-50๐Ÿ’ฐ $2,720,602 over 2 years agoMachine LearningAnalyticsInformation TechnologyHealth Care

  • Bachelor's or Master's degree in Computer Science, Data Science, Software Engineering, or related field.
  • 5+ years of hands-on experience in ML Ops and managing machine learning projects.
  • Expertise in Python and familiarity with ML frameworks like TensorFlow and PyTorch.
  • Experience with ML pipeline tools like MLflow, Kubeflow, and Airflow.
  • Proficiency in cloud platforms such as AWS, GCP, or Azure.
  • Hands-on experience with Docker, Kubernetes, and CI/CD pipelines.
  • Familiarity with data pipelines, ETL processes, and tools such as Apache Spark.
  • Expertise in monitoring model performance and implementing automated retraining.
  • Understanding of data security and privacy best practices in machine learning.
  • Well-versed in Agile/Scrum methodologies and MLOps best practices.
  • Excellent verbal and written communication skills.
  • Comfortable working in a fast-paced, dynamic environment.
  • Oversee the end-to-end development and deployment of machine learning models and infrastructure.
  • Work closely with data scientists, software engineers, product managers, and DevOps teams.
  • Stay current with industry trends and introduce new methodologies to enhance performance.
  • Conduct infrastructure reviews and drive rigorous design for scalability and efficiency.
  • Provide technical guidance and mentorship to team members.
  • Establish best practices for model validation, monitoring, and performance tracking.
  • Troubleshoot complex ML pipeline issues and act as Tier-2 engineering support.
  • Contribute to the long-term ML roadmap aligned with company goals.

AWSDockerPythonApache AirflowETLGCPKafkaKubeflowKubernetesMachine LearningMLFlowPyTorchSnowflakeAzureTensorflowCI/CD

Posted 4 months ago
Apply
Apply

๐Ÿ“ United Arab Emirates

๐Ÿ” Machine Learning Operations

๐Ÿข Company: Brixio๐Ÿ‘ฅ 11-50๐Ÿ’ฐ Seed about 9 years agoEnterprise SoftwareReal Estate

  • At least 3 years of experience in ML/Ops or a related field.
  • Proven leadership skills with the ability to guide and inspire a team.
  • Excellent communication skills to effectively liaise with clients and internal teams.
  • Strong experience in deploying Machine Learning models.
  • Proficiency in Python for scripting and automation.
  • Hands-on experience with Cloud platforms, particularly Azure and GCP.
  • Prior experience in addressing scalability and pricing concerns in ML operations.
  • Lead a team of ML/Ops engineers in deploying and managing Machine Learning models.
  • Build and enhance capabilities to seamlessly integrate ML models into existing pipelines.
  • Collaborate with cross-functional teams to understand project requirements and ensure successful deployment.
  • Provide ongoing support to accommodate surges in requirements, focusing on scalability and efficiency.
  • Utilize Python for scripting and automation tasks.
  • Work with Cloud platforms, particularly Azure and GCP to optimize machine learning workflows.

LeadershipPythonGCPMachine LearningAzureCommunication Skills

Posted 6 months ago
Apply
Apply

๐Ÿ“ Poland

๐Ÿ” Financial services

๐Ÿข Company: Capco๐Ÿ‘ฅ 101-250Electric VehicleProduct DesignMechanical EngineeringManufacturing

  • Experience with tools like TensorFlow, PyTorch, MLflow, Kubeflow or similar
  • Experience managing data pipelines (Airflow)
  • Knowledge of DevOps practices (CI/CD, monitoring, automation)
  • Proficiency in Python (with ML libraries) and Git.
  • Experience with model monitoring solutions (e.g. Prometheus, Grafana).
  • Experience with Apache Spark, Hadoop or other big data tools is a huge plus!
  • Great communication skills in English (both written and spoken)
  • Problem solver, team player with positive attitude!
  • University degree in computer science, mathematics, natural sciences, or similar field and relevant working experience
  • Design and develop pipelines for training and deploying ML models
  • Automate and monitor ML processes, including version control and model updates
  • Collaborate with Data Science and Data Engineering teams to integrate ML models into production environments
  • Build and maintain systems for monitoring ML models to track performance and stability
  • Optimize cloud infrastructure (GCP) for ML deployments
  • Troubleshoot issues related to performance and availability of ML models in production
  • Ensure CI/CD best practices for ML models

PythonGitKubeflowMachine LearningMLFlowPyTorchAirflowData engineeringData scienceTensorflowCommunication SkillsCI/CD

Posted 7 months ago
Apply

Related Articles

Posted about 1 month ago

Why remote work is such a nice opportunity?

Why is remote work so nice? Let's try to see!

Posted 7 months ago

Insights into the evolving landscape of remote work in 2024 reveal the importance of certifications and continuous learning. This article breaks down emerging trends, sought-after certifications, and provides practical solutions for enhancing your employability and expertise. What skills will be essential for remote job seekers, and how can you navigate this dynamic market to secure your dream role?

Posted 8 months ago

Explore the challenges and strategies of maintaining work-life balance while working remotely. Learn about unique aspects of remote work, associated challenges, historical context, and effective strategies to separate work and personal life.

Posted 8 months ago

Google is gearing up to expand its remote job listings, promising more opportunities across various departments and regions. Find out how this move can benefit job seekers and impact the market.

Posted 8 months ago

Learn about the importance of pre-onboarding preparation for remote employees, including checklist creation, documentation, tools and equipment setup, communication plans, and feedback strategies. Discover how proactive pre-onboarding can enhance job performance, increase retention rates, and foster a sense of belonging from day one.