Apply📍 Germany, USA
🔍 Generative image and video models
🏢 Company: Black Forest Labs👥 20-100💰 $30,202,193 Seed 10 months agoArtificial Intelligence (AI)Media and EntertainmentGenerative AISoftware
- Strong proficiency in cloud platforms (AWS, Azure, or GCP) with focus on ML/AI services.
- Extensive experience with Kubernetes and Slurm cluster management.
- Expertise in Infrastructure as Code tools (e.g., Terraform, Ansible).
- Proven track record in managing and optimizing network-based cloud file systems and object storage.
- Experience with CI/CD tools and practices (e.g., CircleCI, GitHub Actions, ArgoCD).
- Strong understanding of security principles and best practices in cloud environments.
- Experience with monitoring and observability tools (e.g., Prometheus, Grafana, Loki).
- Familiarity with ML workflows and GPU infrastructure management.
- Demonstrated ability to handle complex migrations and breaking changes in production environments.
- Design, deploy, and maintain cloud-based ML training (Slurm) and inference (Kubernetes) clusters.
- Implement and manage network-based cloud file systems and blob/S3 storage solutions.
- Develop and maintain Infrastructure as Code (IaC) for resource provisioning.
- Implement and optimize CI/CD pipelines for ML workflows.
- Design and implement custom autoscaling solutions for ML workloads.
- Ensure security best practices across the ML infrastructure.
- Provide developer-friendly tools and practices for efficient ML operations.
AWSGCPKubernetesAzureGrafanaPrometheusCI/CDTerraform
Posted 7 months ago
Apply