Apply

Senior SRE Engineer

Posted 4 months agoViewed

View full description

πŸ’Ž Seniority level: Senior, 5+ years

πŸ“ Location: USA

πŸ’Έ Salary: 170000 - 210000 USD per year

πŸ” Industry: Artificial Intelligence

πŸ—£οΈ Languages: English

⏳ Experience: 5+ years

πŸͺ„ Skills: AWSDockerPythonBashGCPKubernetesAzureGrafanaPrometheusCommunication SkillsTerraformScripting

Requirements:
  • Must have proven customer-facing and customer support experience.
  • 5+ years of experience in Linux system internals, scripting, and configuration management tools.
  • 5+ years of experience in running production systems over the cloud, using containerization technologies.
  • 5+ years of experience with cloud-based infrastructure services and related tools.
  • 5+ years of experience with monitoring applications.
  • 5+ years of experience using Helm charts to package, configure, and deploy Kubernetes applications.
  • Excellent communication skills, both verbal and written.
  • Passionate about troubleshooting and investigating in unfamiliar environments.
Responsibilities:
  • Develop and maintain all deployment options for Comet, including multi-cloud, on-premises, and bare-metal deployments.
  • Utilize Helm charts to package, configure, and deploy Kubernetes applications efficiently.
  • Quickly identify and resolve infrastructure bugs, ensuring high system availability and reliability.
  • Work closely with customers to understand their deployment needs and provide effective support.
  • Collaborate with cross-functional teams to ensure seamless integration and deployment of new features and updates.
Apply

Related Jobs

Apply

πŸ“ United States

🧭 Full-Time

πŸ’Έ 180000.0 - 225000.0 USD per year

πŸ” Software Development

🏒 Company: PhantomπŸ‘₯ 51-100πŸ’° $109,000,000 Series B about 3 years agoCryptocurrencyEthereumBitcoinFinTech

  • 5+ years in a SRE or Software Engineer role.
  • Strong hands-on experience with Kubernetes (EKS) in production environments.
  • Proficiency with AWS infrastructure and services (EC2, S3, RDS, IAM).
  • Solid experience with Docker and Infrastructure-as-Code tools like Terraform or Pulumi.
  • Monitoring and observability experience using tools like Datadog or OpenTelemetry.
  • Manage and scale Kubernetes clusters on AWS EKS, ensuring reliability, performance, and security.
  • Implement and maintain Infrastructure-as-Code (Terraform/Pulumi) to automate infrastructure provisioning and management.
  • Monitor and optimize system performance, scalability, and resource utilization.
  • Configure and maintain crypto nodes across multiple blockchains to support our wallet’s operations.
  • Optimize and scale database infrastructure to handle terabytes of blockchain data efficiently.
  • Continuously improve system uptime, monitoring, and observability using tools like Datadog and OpenTelemetry.
  • Work closely with backend and product teams to support feature development and system scaling.

AWSDockerSQLAWS EKSBlockchainKubernetesCI/CDRESTful APIsLinuxDevOpsTerraform

Posted 10 days ago
Apply
Apply

πŸ“ US, Canada

🧭 Full-Time

πŸ” Cybersecurity

🏒 Company: Operant AIπŸ‘₯ 10-50

  • 3+ years of active hands-on SRE experience in a fast-paced engineering organization working with SaaS/cloud-native products.
  • Hands-on experience with Kubernetes, Golang, and Python.
  • Knowledge of major cloud providers like AWS, GCP, Azure and their automation toolchains.
  • Excellent communication skills and ability to work independently.
  • Build and lead the DevOps/SRE functions for the company's product including monitoring for availability, security, reliability, and scale.
  • Document and codify best practices around operational behavior.
  • Build infrastructure for incident management, CI/CD, and maintain security best practices.
  • Track SOC2 compliance and create on-call schedules.

AWSPythonCybersecurityGCPKubernetesAzureGoCommunication SkillsCI/CD

Posted 4 months ago
Apply