Senior Site Reliability Engineer (Remote from Washington)

Posted about 3 hours agoViewed
WashingtonFull-TimeSoftware Development
Company:
Location:Washington
Languages:English
Seniority level:Senior, 5+ years
Experience:5+ years
Skills:
DockerPythonSoftware DevelopmentCloud ComputingKubernetesClickhouseGoGrafanaPrometheusCI/CDLinuxDevOpsTerraformMicroservicesNetworkingTroubleshootingSoftware EngineeringDebugging
Requirements:
5+ years of experience in Software Engineering, Site Reliability Engineering, or a development-focused DevOps role. Proficiency in Go and Python. Experience with Kubernetes, cloud systems, and distributed systems development. Familiarity with observability and monitoring tools (Prometheus, Thanos, Grafana, Vector, Clickhouse, Otel, Loki). Strong skills in debugging, optimizing code, and troubleshooting. Solid working knowledge of Linux and containerization technologies. Excellent collaboration, communication, and problem-solving abilities.
Responsibilities:
Design, build, and maintain resilient, high-performance systems. Enhance infrastructure and platform services for deployment, observability, and operational excellence. Develop automation tools to reduce manual tasks and mitigate risks. Monitor, troubleshoot, and optimize network, system, and service performance. Participate in incident response and conduct blameless postmortems. Contribute to open-source projects. Share on-call responsibilities.
Similar Jobs:
Posted 20 minutes ago
United StatesFull-TimeApplication Development
Senior Professional Application Developer
Company:HJ Staffing
Posted 26 minutes ago
United States, CanadaFull-TimeSaaS
Front End Engineer (Remote)
Company:Files.com
Posted about 1 hour ago
United States, CanadaFull-TimeSoftware Development
Full-Stack Software Engineer, Product Team