Senior Site Reliability Engineer, Fleet Management
New
M
MongoDBDatabase, Cloud Infrastructure
This role can be based out of our Austin, Boston, Los Angeles, New York City, Raleigh, or San Francisco offices, remotely in the United States region, or our European office in Dublin.Full-TimeSenior
Salary127,000 - 249,000 USD per year
Apply NowOpens the employer's application page
Job Details
- Experience
- 6+ years
- Required Skills
- AWSPythonGCPKubernetesAzureGoLinuxTerraformNetworking
Requirements
- 6+ years of experience in software development and operating distributed systems
- Proficiency in Go, Python, or a similar language
- Deep experience using and extending containerization technologies, preferably Kubernetes
- Solid understanding of Linux operating system internals and networking concepts (e.g., filesystems, TCP/IP, DNS, TLS)
- Strong operational ownership and track record of debugging complex production issues
- Experience with Kubernetes ecosystem tools such as Helm, Kustomize, Gatekeeper, Kyverno, and CRDs/Operators, CRI, CSI
- Expertise in cloud infrastructure platforms, including AWS, GCP, or Azure
- Proficiency in provisioning infrastructure using tools like Terraform, Crossplane, and AWS Controllers for Kubernetes (ACK)
Responsibilities
- Contribute to developing and maintaining a scalable and secure runtime environment on top of Kubernetes that supports product needs across MongoDB
- Provide internal support for our Kubernetes ecosystem, partnering with engineering teams to help them solve domain-specific problems
- Participate in a 24/7 on-call rotation to resolve critical issues
- Prioritize blameless post-mortems and dedicate engineering time to systemic fixes
View Full Description & ApplyYou'll be redirected to the employer's site