Apply📍 United States, Canada
💸 114800.0 - 135000.0 USD per year
🔍 Software Development
🏢 Company: Abnormal👥 501-1000💰 $250,000,000 Series D 10 months agoArtificial Intelligence (AI)EmailInformation TechnologyCyber SecurityNetwork Security
- 4+ years of experience in DevOps, SRE, or Infrastructure Engineering roles.
- Proficiency with cloud providers (AWS preferred), Kubernetes, and Docker.
- Experience with infrastructure as code tools (Terraform, Ansible, or Pulumi).
- Strong scripting skills in Python, Bash, or similar.
- Familiarity with CI/CD systems such as GitHub Actions, Jenkins, or CircleCI.
- Understanding of networking, security, and identity management in cloud environments.
- Experience supporting ML workloads and GPU-based infrastructure.
- Ability to troubleshoot complex system issues in a distributed environment.
- Comfortable working across functional teams and communicating with technical and non-technical stakeholders.
- Architect and manage infrastructure that supports AI/ML pipelines, tools, and data platforms.
- Implement and maintain containerization (e.g., Docker) and orchestration (e.g., Kubernetes) environments.
- Develop CI/CD systems that integrate with ML workflows and ensure reproducible AI experiments.
- Collaborate with security and compliance teams to ensure infrastructure meets data protection standards.
- Automate provisioning and deployment using IaC tools like Terraform or Pulumi.
- Monitor and troubleshoot infrastructure issues with tools like Prometheus, Grafana, and ELK stack.
- Partner with AI and software engineers to optimize platform performance and resource utilization.
- Maintain clear, accessible documentation to scale platform knowledge across the org.
AWSDockerPythonBashCloud ComputingKubernetesMLFlowGrafanaPrometheusCI/CDLinuxDevOpsTerraformNetworkingAnsibleScripting
Posted 8 days ago
Apply