Staff SRE

Posted 3 months agoViewed
United StatesFull-TimeSoftware Development
Company:Blip Global
Location:United States, EST, PST
Languages:English
Seniority level:Staff, Solid experience
Experience:Solid experience
Skills:
LeadershipPythonCloud ComputingJavaKubernetesMicrosoft .NETGoGrafanaPrometheusCI/CDDevOpsTerraformMicroservicesAnsibleSoftware Engineering
Requirements:
Degree in Software Engineering, Computer Science, or related fields. Solid experience in technical leadership and SRE role. Practical experience in development (preferably .NET, but experience in other languages like Python, Go, or Java is valued). Solid knowledge in cloud computing (Azure, AWS, and/or GCP). Experience with Kubernetes in production environments. Experience with observability using Grafana, Prometheus, ELK, OpenTelemetry, or similar. Proficiency in IaC (Infrastructure as Code) and automation (Terraform, Ansible, Python, etc.). Experience with resilience and chaos engineering practices. Analytical profile oriented towards reliability metrics (SLIs, SLOs, SLAs, error budgets). Proactive mindset, ownership, collaboration, and strong communication skills.
Responsibilities:
Understand and influence platform architecture, identifying risks and proposing reliability improvements. Apply DevOps and SRE practices focusing on automation, resilience, and 'You Build It, You Run It' culture. Design and implement self-healing strategies, mitigating incidents and reducing operational toil. Conduct technical investigations and failure diagnostics with a focus on root cause analysis. Evolve and maintain CI/CD pipelines, increasing visibility, security, and delivery quality. Utilize observability (logs, metrics, traces, intelligent alerts) to anticipate incidents and identify improvement opportunities. Support and mentor the team in strategic technical decisions, acting as a reference in reliability and scalability.
About the Company
Blip Global
View Company Profile
Similar Jobs:
Posted 4 months ago
United StatesFull-TimeCredit and Lending
Staff Software Engineer - SRE, Backend (Reliability Engineering)
Company:Affirm
Posted 5 months ago
United StatesFull-TimeAI Risk Decisioning
Sr./Staff - Infrastructure/Site Reliability Engineer (SRE)
Company:Oscilar
Posted about 1 month ago
United StatesFull-TimeCrypto
SRE / DevOps Engineer
Company:Kraken