Infrastructure Engineer
New
R
RoboflowComputer Vision
Roboflow is distributed across the US and Europe., Daytime hours in the USFull-TimeMiddle
Salary165,000 - 200,000 USD per year
Apply NowOpens the employer's application page
Job Details
- Required Skills
- AWSNode.jsPythonGCPKubernetesPyTorchTensorflowCI/CDTerraformHelm
Requirements
- Production experience with Kubernetes for containerized applications.
- Infrastructure-as-Code (IaC) skills: Terraform, Helm, bash, Python.
- Experience operating, monitoring, and scaling large-scale applications in AWS or GCP.
- Proficiency in Node.js and Python.
- Hands-on experience with ML/Big Data infrastructure (GPUs, Docker, Kubernetes).
- Familiarity with ML libraries like PyTorch or Tensorflow.
- CI/CD automation experience with GitHub Actions or Spacelift.
- Knowledge of cloud operations security practices.
- Ability to leverage AI tools to accelerate development.
Responsibilities
- Run and optimize high-availability machine learning inference service.
- Collaborate with customer security teams to ensure secure integration.
- Develop creative IaC solutions to scale platform cost-effectively.
- Define SLOs/SLAs and participate in incident response.
- Improve Observability and Alerting stack.
- Identify and act on cost-optimization opportunities.
- Contribute code (Python, JavaScript) to product features.
- Harden systems to meet SOC 2, HIPAA, and GDPR requirements.
- Participate in on-call rotation.
View Full Description & ApplyYou'll be redirected to the employer's site