Senior Software Engineer, Storage

New
C
CriblTelemetry Infrastructure
Location: Remote - United StatesFull-TimeSenior
Salary175,000 - 205,000 USD per year
Apply NowOpens the employer's application page

Job Details

Required Skills
AWSNode.jsTypeScriptGoCI/CDLinuxTerraformDistributed Systems

Requirements

  • Significant experience building platform/infrastructure systems that manage, scale, and operate distributed services autonomously.
  • Strong software engineering skills in TypeScript/Node.js, Go, or similar languages.
  • Deep hands-on experience with infrastructure-as-code (Terraform) and AWS services (EC2, ECS/EKS, ASGs, DynamoDB, S3, CloudWatch).
  • Experience designing and implementing autoscaling systems, cluster orchestration, or automated provisioning for stateful workloads.
  • Track record operating data-intensive systems at scale (OLAP databases, NoSQL stores, or distributed storage platforms).
  • Strong platform engineering fundamentals: SLOs, error budgets, capacity planning, and incident response.
  • Strong understanding of Linux systems, networking, and performance profiling at the infrastructure level.

Responsibilities

  • Design and build autoscaling systems for storage clusters — automated provisioning, scale-up/scale-down policies, cluster rebalancing, and node lifecycle management.
  • Own the infrastructure-as-code stack (Terraform) that defines and deploys storage infrastructure end-to-end on AWS.
  • Build self-healing automation: health checks, automated failover, capacity rebalancing, and remediation controllers.
  • Develop the CI/CD pipelines and deployment tooling for storage services.
  • Design and implement observability for the entire storage platform — metrics, dashboards, SLOs, and capacity forecasting.
  • Own cluster management tooling: provisioning new tenants, managing cluster topology, coordinating upgrades and migrations.
  • Drive performance and cost optimization across the storage data path: ingest pipelines, compaction, partitioning, and query execution.
  • Partner with product engineering to define scalability limits and load test new features.
  • Contribute to incident response and lead blameless post-mortems.
View Full Description & ApplyYou'll be redirected to the employer's site
175,000 - 205,000 USD per year
Apply Now