Machine Learning Engineer, ML Systems and Infrastructure

New
Canada; This role is fully remote-friendly, with team members distributed across the US and Canada.Full-TimeJunior
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Experience
At least 2 years
Required Skills
AWSPythonGCPAirflowAzureSparkCI/CDSoftware EngineeringDistributed Systems

Requirements

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field, or equivalent industry experience
  • At least 2 years of industry experience in software engineering, machine learning infrastructure, distributed systems, data platforms, or related areas
  • Strong software engineering fundamentals, including coding, testing, debugging, and code quality
  • Proficiency in Python and experience building production-quality software
  • Experience with cloud platforms such as AWS, Azure, or GCP
  • Familiarity with containers, version control, CI/CD, and modern development workflows
  • Experience working with data-intensive systems, backend systems, or ML pipelines
  • Ability to work independently on well-defined problems with moderate ambiguity
  • Experience building data pipelines for large-scale structured and semi-structured technical datasets
  • Familiarity with data lineage, provenance, governance, and responsible data usage in ML systems
  • Familiarity with distributed data processing and orchestration systems such as Ray, Airflow, Spark, or similar platforms
  • Familiarity with model deployment, inference services, monitoring, and observability for production ML systems
  • Familiarity with ML-ready representations for geometry, graph, hierarchical, or multimodal data
  • Experience working with CAD, BIM, AEC, or other complex domain-specific data formats

Responsibilities

  • Build and maintain components of ML pipelines for data preparation, model training, evaluation, deployment, and monitoring
  • Develop reliable software and infrastructure that supports scalable machine learning workflows
  • Contribute to distributed data processing and training systems used by researchers and engineering teams
  • Support data ingestion, transformation, validation, and serving for large-scale structured and semi-structured technical datasets
  • Improve automation, testing, CI/CD, observability, and operational reliability for ML systems
  • Troubleshoot data, infrastructure, and performance issues in collaboration with senior engineers
  • Participate in design discussions and contribute ideas that improve system scalability, maintainability, and efficiency
  • Document technical decisions, workflows, and operational processes clearly
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now