Applyπ§ Fulltime
π Software Development
- 5+ years owning infrastructure end-to-end, ideally in startup environments.
- Comfortable at every layer β from bare-metal servers and NVMe drives to container orchestration and cloud-native tools.
- Strong Linux fundamentals, and you know your way around networking, storage, and distributed systems.
- Can code well enough to automate, debug, and build tooling across a variety of languages.
- Communicate clearly and collaborate well β especially with engineers who arenβt infra specialists.
- Designing and building a custom GPU cluster for deep learning workloads.
- Deciding how we manage and scale our infrastructure β both on-prem and in the cloud.
- Keeping systems running smoothly and securely β from data pipelines to distributed training jobs.
- Troubleshooting weird kernel errors, configuring systemd units, or debugging Kubernetes evictions.
- Making calls on when to script, when to automate, and when to just fix the thing.
Posted 14 days ago
Apply