Infrastructure Engineer

New
R
RoboflowComputer Vision
Roboflow is distributed across the US and Europe., Daytime hours in the USFull-TimeMiddle
Salary165,000 - 200,000 USD per year
Apply NowOpens the employer's application page

Job Details

Required Skills
AWSNode.jsPythonGCPKubernetesPyTorchTensorflowCI/CDTerraformHelm

Requirements

  • Production experience with Kubernetes for containerized applications.
  • Infrastructure-as-Code (IaC) skills: Terraform, Helm, bash, Python.
  • Experience operating, monitoring, and scaling large-scale applications in AWS or GCP.
  • Proficiency in Node.js and Python.
  • Hands-on experience with ML/Big Data infrastructure (GPUs, Docker, Kubernetes).
  • Familiarity with ML libraries like PyTorch or Tensorflow.
  • CI/CD automation experience with GitHub Actions or Spacelift.
  • Knowledge of cloud operations security practices.
  • Ability to leverage AI tools to accelerate development.

Responsibilities

  • Run and optimize high-availability machine learning inference service.
  • Collaborate with customer security teams to ensure secure integration.
  • Develop creative IaC solutions to scale platform cost-effectively.
  • Define SLOs/SLAs and participate in incident response.
  • Improve Observability and Alerting stack.
  • Identify and act on cost-optimization opportunities.
  • Contribute code (Python, JavaScript) to product features.
  • Harden systems to meet SOC 2, HIPAA, and GDPR requirements.
  • Participate in on-call rotation.
View Full Description & ApplyYou'll be redirected to the employer's site
165,000 - 200,000 USD per year
Apply Now