Senior/Staff AI Cloud Infra Engineer

Posted 6 months agoViewed
100000 - 120000 USD per year
United StatesFull-TimeSoftware Development
Company:Comfy
Location:United States, EST, PST
Languages:English
Seniority level:Relevant experience
Experience:Relevant experience
Skills:
AWSBackend DevelopmentPythonArtificial IntelligenceCloud ComputingGCPKubernetesMachine LearningAzureCI/CDLinuxDevOpsTerraformSoftware Engineering
Requirements:
Relevant experience as an AI Cloud Infra Engineer for a high tech startup. Experience in participating in incident management processes. Strong foundation and experience in managing cloud infrastructure (AWS, GCP, or Azure). Solid understanding of container orchestration (Kubernetes preferred) and CI/CD principles and tools. Excellent communication skills. Proven ability to learn fast and ship quality infrastructure code and configurations.
Responsibilities:
Develop and maintain core Python platform for request routing, AI workload orchestration, GPU server capacity management, and observability. Develop and maintain infrastructure layer using Terraform and cloud provider APIs for managing GPU workers. Own and operate underlying platform technologies (e.g., K8s, Prometheus, DataDog). Architect and implement solutions impacting service performance and availability. Collaborate with core engineering team to design and build new infrastructure systems. Contribute to the vision and foundation for future infrastructure development. Shape technical direction and infrastructure best practices.
About the Company
Comfy
View Company Profile
Similar Jobs:
Posted 4 months ago
USFull-TimeAI Cloud Platform, GPU Cloud
Cloud Solutions Architect (Infra & AI Cloud)
Posted about 2 months ago
USFull-TimeAI/ML, Cloud Computing
Strategic Account Executive (AI Cloud & Infra)
Company:Lavendo
Posted 3 months ago
United StatesFull-TimeAI/ML, Cloud Computing
Senior AI/ML Specialist Solutions Architect (AI Infra & Cloud)
Company:Lavendo