Apply

Lead Machine Learning Engineer

Posted 12 days agoViewed

View full description

💎 Seniority level: Lead, 5+ years

📍 Location: USA

💸 Salary: 260000.0 USD per year

🔍 Industry: Software Development

🏢 Company: Wizard👥 11-50Customer ServiceManufacturing

🗣️ Languages: English

⏳ Experience: 5+ years

🪄 Skills: DockerNode.jsPythonSQLFlaskGCPGitKubernetesMachine LearningPyTorchAirflowAlgorithmsData engineeringData StructuresFastAPIREST APIPandas

Requirements:
  • MS/Ph.D. in computer science, mathematics, or another quantitative discipline or exceptional work experience in ML engineering.
  • 5+ years ML Experience.
  • Experience developing machine learning-driven products at scale.
  • Experience solving problems using Machine Learning with PyTorch.
  • Exceptional understanding of designing ML systems in production at scale and applying a quantitative approach to identifying performance bottlenecks and cost/performance tradeoffs.
  • Practical experience with large data systems, data models, and batch and streaming data pipelines.
  • Ability to ramp up quickly on our tech stack, which includes GCP, Kubernetes, Airflow, Pandas, PyTorch, Python, and Node.js.
  • Extensive experience with model deployment, both static within applications and dynamic, using Flask, FastAPI, or similar frameworks.
Responsibilities:
  • Build and maintain a training pipeline for fine-tuning open-source LLMs.
  • Serve and Deploy the LLM on GPUs. It is important to know different configurations ahead of time to ensure we don’t OOM at runtime.
  • Formulate novel product issues to ML problems and propose and iterate on solutions quickly. We value the fast pace of development.
  • Triage customer issues to model/system shortcomings and communicate the complex architecture in a simple, comprehensible way to internal stakeholders.
  • Keep up with the fast-growing generative AI space and incorporate the latest developments into our system.
Apply

Related Jobs

Apply

📍 United States

🧭 Full-Time

💸 175500.0 - 277500.0 USD per year

🔍 Software Development

🏢 Company: Upwork👥 501-1000💰 over 8 years ago🫂 Last layoff almost 2 years agoMarketplaceFreelanceCopywritingPeer to Peer

NOT STATED
  • Design, implement, and optimize distributed systems and infrastructure components to support large-scale machine learning workflows, including data ingestion, feature engineering, model training, and serving.
  • Develop and maintain frameworks, libraries, and tools to streamline the end-to-end machine learning lifecycle, from data preparation and experimentation to model deployment and monitoring.
  • Architect and implement highly available, fault-tolerant, and secure systems that meet the performance and scalability requirements of production machine learning workloads.
  • Collaborate with machine learning researchers and data scientists to understand their requirements and translate them into scalable and efficient software solutions.
  • Stay current with the latest advancements in machine learning infrastructure, distributed computing, and cloud technologies, and integrate them into our platform to drive innovation.
  • Mentor junior engineers, conduct code reviews, and uphold engineering best practices to ensure the delivery of high-quality software solutions.

AWSDockerLeadershipPythonSQLApache AirflowCloud ComputingKubernetesMachine LearningMLFlowPyTorchAlgorithmsApache KafkaData engineeringData StructuresSparkTensorflowCI/CDRESTful APIsMentoringDevOpsTerraformMicroservicesData modelingSoftware Engineering

Posted 26 days ago
Apply
Apply

📍 US, Canada

🧭 Full-Time

🔍 Mobile industry

🏢 Company: Liftoff👥 501-1000💰 Private over 4 years agoAdvertising PlatformsBig DataMobile AdvertisingApp MarketingAd RetargetingMobileAd Network

  • 10+ years of industry experience in Machine Learning, Software Engineering and/or Infrastructure.
  • 5+ years of industry experience applying Machine Learning to large scale problems.
  • Very strong coding ability.
  • Good team communication and collaboration skills.
  • B.S. or higher in Computer Science (PhD is a plus).
  • Build state-of-the-art deep learning models to make accurate bidding decisions in millions of auctions per second.
  • Work with an experienced team of ML, Software, and Infrastructure Engineers.
  • Design, engineer & implement reliable, scalable, and cost-efficient systems.
  • Achieve core business objectives by enabling next generation ML models and technologies.
  • Utilize vendor-based products and open source technologies.

AWSPythonSoftware DevelopmentMachine LearningPyTorch

Posted 4 months ago
Apply