Apply

Machine Learning Engineer

Posted 4 days agoViewed

View full description

๐Ÿ’Ž Seniority level: Senior, 5+ years

๐Ÿ“ Location: United States

๐Ÿ’ธ Salary: 130000.0 - 200000.0 USD per year

๐Ÿ” Industry: Software Development

๐Ÿข Company: Sadaora

โณ Experience: 5+ years

๐Ÿช„ Skills: AWSDockerPythonETLGCPGitKubernetesMachine LearningMLFlowPyTorchAzureData engineeringTensorflowCI/CD

Requirements:
  • 5+ years of experience developing, deploying, and maintaining ML models in production environments.
  • Proficiency in Python and common ML libraries (e.g., Scikit-learn, TensorFlow, PyTorch, XGBoost).
  • Strong foundation in statistics, linear algebra, probability, and optimization.
  • Deep understanding of a range of ML techniques (regression, classification, clustering, NLP, deep learning).
  • Experience with cloud platforms such as AWS, GCP, or Azure.
  • Familiarity with containerization and orchestration tools (Docker, Kubernetes).
  • Solid understanding of software engineering principles, version control (Git), and CI/CD workflows.
Responsibilities:
  • Design, train, and evaluate machine learning models using best-in-class frameworks.
  • Architect scalable ML solutions and pipelines, from feature engineering to deployment.
  • Implement rigorous testing, validation, and monitoring processes to ensure model reliability in production.
  • Work closely with data engineers to shape the data architecture required for robust ML workflows.
  • Build efficient ETL pipelines to clean, preprocess, and transform large-scale datasets.
  • Partner with product managers, engineers, and business stakeholders to define ML use cases.
  • Collaborate with software engineers to integrate ML models into production-grade APIs and applications.
  • Translate complex ML concepts into business-relevant insights and recommendations.
  • Stay current with advancements in machine learning, AI, and related fields.
  • Experiment with new algorithms, architectures, and tools to continuously enhance our capabilities.
  • Contribute to a culture of experimentation, technical excellence, and intellectual curiosity.
Apply

Related Jobs

Apply

๐Ÿ“ United States, Canada

๐Ÿงญ Full-Time

๐Ÿ’ธ 127000.0 - 158700.0 USD per year

๐Ÿ” Remote Sensing

๐Ÿข Company: Planet๐Ÿ‘ฅ 501-1000๐Ÿ’ฐ $200,000,000 Post-IPO Equity over 3 years ago๐Ÿซ‚ Last layoff 9 months agoGeospatialRemote SensingBig DataAerospaceAnalyticsSoftware

  • Bachelor's or Master's degree in Computer Science or a related field
  • 4+ years of professional experience in software engineering of which 2+ years of this is experience in developing and designing Computer Vision and/or Machine Learning technologies and systems
  • Proficiency with Python and machine learning frameworks like TensorFlow or PyTorch
  • Proficiency with software engineering best practices such as version control, testing and continuous integration/continuous deployment (CI/CD)
  • Experience with containerization and container orchestration tools like Docker, Kubernetes, Flyte or Temporal
  • Experience implementing model versioning, monitoring and observability systems
  • Establish and maintain machine learning operations workflows for regular data generation
  • Run experiments to evaluate machine learning algorithms
  • ML operations to maintain production algorithms (monitoring, training, benchmarking, deploying, etc)
  • Develop and implement automated testing to ensure the reliability of deployed models
  • Contribute to full-stack development, from backend and APIs to DevOps tasks and occasional front-end work

DockerPythonData AnalysisImage ProcessingKubernetesMachine LearningPyTorchAlgorithmsTensorflowCI/CDRESTful APIsDevOpsSoftware Engineering

Posted 1 day ago
Apply
Apply

๐Ÿ“ United States

๐Ÿงญ Full-Time

๐Ÿ’ธ 185800.0 - 260100.0 USD per year

๐Ÿข Company: Reddit๐Ÿ‘ฅ 1001-5000๐Ÿ’ฐ $410,000,000 Series F over 3 years ago๐Ÿซ‚ Last layoff almost 2 years agoNewsContentSocial NetworkSocial Media

  • 2+ years of experience with industry-level deep learning models.
  • 2+ years of experience with mainstream ML frameworks (such as Tensorflow and Pytorch).
  • 3+ years of end-to-end experience of training, evaluating, testing, and deploying industry-level models.
  • 3+ years of experience of orchestrating complicated data generation pipelines on large-scale datasets.
  • Experience working with cross functional stakeholders across research, product & infrastructure to productize ML research
  • Knowledge of large scale search & recommender systems, or modern ads ranking/retrieval/targeting systems is preferred
  • Experience with deep learning, representation learning or transfer learning is preferred
  • Own end-to-end execution of ML-based targeting products like smart targeting expansion, keyword targeting, auto targeting, user lookalikes etc
  • Own offline & online experimentation of ML models for improving targeting products to drive advertiser outcomes
  • Research, implement, test, and launch new model architectures for retrieval using deep learning (GNNs, transformers, two tower models) with a focus on improving advertiser outcomes
  • Drive technical roadmaps and lead day to day project execution, and contribute meaningfully to team vision and strategy
  • Work on large scale data systems, backend services and product integration
  • Collaborate closely with multiple stakeholders cross product, engineering, research and marketing

AWSBackend DevelopmentPythonMachine LearningNumpyPyTorchData engineeringSparkTensorflow

Posted 3 days ago
Apply
Apply

๐Ÿ“ United States

๐Ÿงญ Full-Time

๐Ÿ” Software Development

๐Ÿข Company: Anomalo๐Ÿ‘ฅ 11-50๐Ÿ’ฐ $10,000,000 Series B 4 months agoData ManagementInformation TechnologySoftware

  • Strong Python expertise (ML engineering, data processing, and API development).
  • Experience deploying GenAI/LLM-based applications (e.g., chatbots, recommendation systems).
  • Familiarity with OpenAI, Anthropic Claude, and similar GenAI platforms.
  • Background in production-scale ML applications, particularly LLMs for enterprise use cases.
  • Develop and deploy GenAI-driven products, including customer-facing applications and internal tools for engineering teams.
  • Work with unstructured data and define product quality standards for a rapidly evolving space.
  • Collaborate with design partners and enterprise customers to understand how ML and LLMs can be applied effectively.
  • Build and refine retrieval-augmented generation (RAG) models and fine-tune LLMs for specific use cases.
  • Own projects from concept to production, ensuring scalability and performance in real-world environments.

AWSPythonMachine LearningAPI testing

Posted 5 days ago
Apply
Apply

๐Ÿ“ United States

๐Ÿงญ Full-Time

๐Ÿ’ธ 216700.0 - 303400.0 USD per year

๐Ÿ” Software Development

๐Ÿข Company: Reddit๐Ÿ‘ฅ 1001-5000๐Ÿ’ฐ $410,000,000 Series F over 3 years ago๐Ÿซ‚ Last layoff almost 2 years agoNewsContentSocial NetworkSocial Media

  • 5+ years of experience in machine learning engineering, with a strong focus on recommendation systems, representation learning, and deep learning.
  • Hands-on experience with Graph Neural Networks (GNNs), collaborative filtering, and large-scale embeddings.
  • Proficiency in Python and experience with ML frameworks such as PyTorch Geometric (PyG), Deep Graph Library (DGL), TensorFlow, or JAX.
  • Strong understanding of graph theory, network science, and representation learning techniques.
  • Experience building distributed training and inference systems using ML infrastructure components (data parallelism, model pruning, inference optimization, etc.).
  • Ability to work in a fast-paced environment, balancing innovation with high-quality production deployment.
  • Strong communication skills and the ability to collaborate cross-functionally with engineers, researchers, and product teams.
  • Design and implement scalable, high-performance machine learning models using Graph Neural Networks (GNNs), transformers, and knowledge graph approaches.
  • Develop and optimize large-scale embedding generation pipelines for Redditโ€™s recommendation systems.
  • Collaborate with ML infrastructure teams to enable efficient distributed training (multi-GPU, model/data parallelism) and low-latency serving.
  • Work closely with cross-functional teams (Ads, Feed Ranking, Content Understanding) to integrate embeddings into various personalization and ranking systems.
  • Drive feature engineering efforts, identifying and curating expressive raw data to enhance model effectiveness.
  • Monitor, evaluate, and improve model performance using A/B testing, offline metrics, and real-time feedback loops.
  • Stay up-to-date with the latest research in GNNs, transformers, and representation learning, bringing new ideas into production.
  • Participate in code reviews, mentor junior engineers, and contribute to technical decision-making.

PythonData AnalysisKerasMachine LearningMLFlowPyTorchAlgorithmsData StructuresTensorflowA/B testing

Posted 5 days ago
Apply
Apply

๐Ÿ“ United States

๐Ÿ’ธ 106000.0 - 120000.0 USD per year

๐Ÿ” Nonprofit

๐Ÿข Company: DataKind๐Ÿ‘ฅ 11-50๐Ÿ’ฐ $2,000,000 over 8 years agoArtificial Intelligence (AI)Big DataAnalyticsData Visualization

  • 3+ years of professional work experience in developing and deploying a machine learning product at scale
  • Foundational understanding of machine learning and statistical methods for predictive modeling
  • Expert in Python
  • Experience with cloud computing (GCP preferred)
  • Experience with databases (SQL, Postgres, PySpark, and/or other data query languages)
  • Experience with DataBricks or a similar data intelligence platform
  • Experience with data warehousing, orchestration, integration, and ETL tools
  • Experience with modern source code management and software repository systems (i.e. Git)
  • Experience documenting and implementing RESTful APIs
  • Design, build, test, and maintain machine learning pipeline architectures (70%)
  • Produce high-quality, reusable code for data ingestion, validation, and processing pipelines
  • Architect and implement end-to-end ML pipelines including training, retraining, and inference systems for schools using the SST
  • Design and build APIs to easily access, integrate, and manage data from different sources
  • Ensure data infrastructure is in compliance with data governance and security policies
  • Create comprehensive documentation for data infrastructure and ML pipelines, tailored for both technical and non-technical stakeholders
  • Advance internal analytics reporting and automation capabilities as needed
  • Provide direct data support to partners (15%)
  • Manage initial data lifecycle processes for new school onboarding including ingestion, transfer, audit, and validation
  • Collaborate with data platform partners on integration and data transfer pipelines
  • Provide technical guidance to partners on how to share data formatted in alignment with our data model and with appropriate data governance measures
  • Address partner concerns regarding data security and ensure their specific requirements are satisfied
  • Support data science initiatives through processing, cleaning, and analyzing data as needed
  • Collaborate and contribute across DataKind (15%)
  • Support other data team members through code reviews and knowledge sharing across products
  • Collaborate with the Product, Engineering, and Research teams to ensure seamless integration and alignment of work
  • Effectively communicate project status and manage expectations with internal teams and partner organizations
  • Maintain accurate and current project information in project management tools like Asana

PostgreSQLPythonSQLApache AirflowCloud ComputingETLGCPGitMachine LearningAPI testingData engineeringRESTful APIsData visualizationData modelingData management

Posted 7 days ago
Apply
Apply

๐Ÿ“ USA, Australia

๐Ÿงญ Fulltime

๐Ÿ” AI

๐Ÿข Company: Pluralis Research

  • Master's degree in Computer Science or related field, or equivalent experience.
  • Strong understanding of model parallelism techniques, distributed training architectures, and optimization methods.
  • Expert-level skills in PyTorch or similar frameworks, with experience scaling models across multiple devices.
  • Familiarity with networking concepts, distributed computing principles, and performance optimization.
  • Build and optimize systems for training large models across heterogeneous hardware connected by low-bandwidth networks.
  • Implement techniques to reduce communication overhead while maintaining model convergence in challenging network environments.
  • Design and develop robust training pipelines that can recover from node failures and network disruptions.
  • Create efficient systems for deploying sharded models in a protocol-locked environment.
  • Develop tools to track training progress, evaluate model quality, and identify bottlenecks in distributed environments.

PythonMachine LearningPyTorchAlgorithmsData StructuresNetworking

Posted 9 days ago
Apply
Apply

๐Ÿ“ USA

๐Ÿงญ Full-Time

๐Ÿ’ธ 260000.0 USD per year

๐Ÿ” Shopping, Generative AI

๐Ÿข Company: Wizard๐Ÿ‘ฅ 11-50Customer ServiceManufacturing

  • MS/Ph.D. in computer science, mathematics, or another quantitative discipline or exceptional work experience in ML engineering.
  • 5+ years ML Experience.
  • Experience developing machine learning-driven products at scale.
  • Experience solving problems using Machine Learning with PyTorch.
  • Exceptional understanding of designing ML systems in production at scale and applying a quantitative approach to identifying performance bottlenecks and cost/performance tradeoffs.
  • Practical experience with large data systems, data models, and batch and streaming data pipelines.
  • Ability to ramp up quickly on our tech stack, which includes GCP, Kubernetes, Airflow, Pandas, PyTorch, Python, and Node.js.
  • Extensive experience with model deployment, both static within applications and dynamic, using Flask, FastAPI, or similar frameworks.
  • Build and maintain a training pipeline for fine-tuning open-source LLMs. Testing different LLM architectures at different sizes and quantizations is important.
  • Serve and Deploy the LLM on GPUs. It is important to know different configurations ahead of time to ensure we donโ€™t OOM at runtime.
  • Formulate novel product issues to ML problems and propose and iterate on solutions quickly. We value the fast pace of development.
  • Triage customer issues to model/system shortcomings and communicate the complex architecture in a simple, comprehensible way to internal stakeholders.
  • Keep up with the fast-growing generative AI space and incorporate the latest developments into our system.

DockerNode.jsPythonSQLFlaskGCPGitKubernetesMachine LearningPyTorchAirflowAlgorithmsData engineeringData StructuresFastAPIREST APIPandas

Posted 10 days ago
Apply
Apply

๐Ÿ“ United States

๐Ÿงญ Full-Time

๐Ÿ’ธ 174000.0 - 242000.0 USD per year

๐Ÿ” Fintech

๐Ÿข Company: Mission Lane๐Ÿ‘ฅ 501-1000๐Ÿ’ฐ about 1 year agoCreditFinancial ServicesFinance

  • A good foundation in computer science (data structures, algorithms, design patterns) and apply solid fundamentals in your software engineering (test-driven development, code review, refactoring)
  • Developed and maintained production ML systems
  • Expertise with Python and cloud technology stacks like GCP or AWS
  • Excitement to architect machine-learning platforms
  • Interest in a wide range of ML solutions including established tools (e.g. Spark, Kubernetes, Airflow, MLFlow), emerging tools (like Chalk, BentoML, or DVC), and developing in-house tools
  • Partnered with business stakeholders to create, refine, and execute new ideas and helped improve data science workflows
  • Design, develop, and deploy machine learning systems to solve practical problems and support our data scientistsโ€™ and analysts' ability to create models and perform analysis.
  • Partner with business leaders and technical experts across the company to improve our feature extraction, model development process, data pipelines, and model serving.

AWSPythonDesign PatternsGCPKubernetesMachine LearningMLFlowAirflowAlgorithmsData StructuresREST APISparkSoftware Engineering

Posted 11 days ago
Apply
Apply

๐Ÿ“ United States

๐Ÿงญ Full-Time

๐Ÿ’ธ 216700.0 - 303400.0 USD per year

๐Ÿ” Software Development

๐Ÿข Company: Reddit๐Ÿ‘ฅ 1001-5000๐Ÿ’ฐ $410,000,000 Series F over 3 years ago๐Ÿซ‚ Last layoff almost 2 years agoNewsContentSocial NetworkSocial Media

  • 2+ years of experience with leading applied machine learning models with Tensorflow/Pytorch with large-scale ML systems
  • 5+ years of end-to-end experience of training, evaluating, testing, and deploying machine learning models
  • Experience with large scale data processing & pipeline orchestration tools like Spark, Dataflow, Kubeflow, Airflow, BigQuery
  • Experience working with nearest-neighbor search systems is a big plus
  • Experience building & improving MLOps tools and ML experimentation workflows
  • Experience working with cross functional stakeholders across research, product & infrastructure to productize ML research
  • Knowledge of large scale search & recommender systems, or modern ads ranking/retrieval/targeting systems is preferred
  • Experience with deep learning, representation learning or transfer learning is preferred
  • Tech lead experience in a product team is strongly preferred
  • Own end-to-end execution of ML-based targeting products like smart targeting expansion, keyword targeting, auto targeting, user lookalikes etc
  • Own offline & online experimentation of ML models for improving targeting products to drive advertiser outcomes
  • Research, implement, test, and launch new model architectures for retrieval using deep learning (GNNs, transformers, two tower models) with a focus on improving advertiser outcomes
  • Drive technical roadmaps and lead day to day project execution, and contribute meaningfully to team vision and strategy
  • Work on large scale data systems, backend services and product integration
  • Collaborate closely with multiple stakeholders cross product, engineering, research and marketing

KubeflowMachine LearningMLFlowPyTorchAirflowAlgorithmsApache KafkaData StructuresSparkTensorflow

Posted 12 days ago
Apply
Apply

๐Ÿ“ United States

๐Ÿ’ธ 119400.0 - 267500.0 USD per year

๐Ÿ” Software Development

  • 5+ years of professional experience in software engineering, data engineering, machine learning, or related field
  • Experience working in a team to write robust, maintainable, and extendable code in Python; containerized in Docker, and automated with CI/CD
  • Expertise with agile development and test driven development
  • Expertise with data structures, data modeling and software architecture
  • Experience with complex data sets, ETL pipelines, SQL, and general data engineering
  • Expertise with cloud architecture and technologies, in particular Amazon Web Services
  • Expertise with technologies like Spark, Kubernetes, Docker, Jenkins, Databricks, Terraform is also highly desirable
  • Develop robust advanced analytics and machine learning solutions
  • Own projects end-to-end - from conception to operationalization
  • Provide technical vision and guidance to teammates
  • Design and implement scalable applications that leverage prediction models and optimization programs
  • Contribute to core advanced analytics and machine learning platforms and tools

AWSDockerLeadershipPostgreSQLPythonSQLAgileETLKubernetesMachine LearningAlgorithmsAPI testingData engineeringData StructuresRedisSparkCommunication SkillsCI/CDTerraformData modelingSoftware Engineering

Posted 14 days ago
Apply