Machine Learning Engineer - Inference / Serving

Posted 2 months agoViewed

US, CanadaFull-TimeBehavioral AI

Company:Yobi

Location:US, Canada

Languages:English

Skills:

DockerPythonSoftware DevelopmentJavaKubernetesMachine LearningC++AlgorithmsData StructuresGoRustCI/CDDevOps

Requirements:

Deep expertise in model deployment and scaling production ML serving systems Experience with versioning, rollouts, rollback strategies, and live experimentation Low-latency mindset for inference optimization (model graph, quantization, caching, batching, feature retrieval) Systems fluency: robust, high-performance code in Go, Rust, C++, or Java, and bridging to Python Operational maturity: monitoring drift, tracking model lineage, ensuring observability Infrastructure intuition: reproducible and portable serving systems Applied ML understanding: reasoning about model performance and trade-offs

Responsibilities:

Design, optimize, and operate systems that deploy Behavioral AI models in real time Work at the core of the production environment Turn trained models into performant, reliable, and continuously improving services Shape how models are packaged, versioned, rolled out, and observed across environments