Apply📍 Americas
🧭 Full-Time
💸 160000.0 - 180000.0 USD per year
🔍 Software Development
- 7+ years of professional experience as a Site Reliability Engineer, with proven experience leading large complex projects affecting production SaaS environments.
- Professional experience with relational database systems, managing the servers and tuning performance, particularly MySQL.
- Proven experience managing scale, reliability and performance challenges managing distributed applications on cloud infrastructure (Google Cloud Platform is advantageous), both managed and self-hosted solutions.
- Proven ability to build cloud infrastructure using Terraform and develop operational tooling in various languages including Golang and Bash.
- Deep knowledge of UNIX environments and modern collaborative development practices.
- Excellent communication skills, both verbal and written, with a collaborative mindset to make informed, empathetic decisions.
- Ability to work autonomously in your timezone, advancing tasks and projects with minimal guidance.
- Demonstrated ability to influence product direction and contribute technical insights that help drive business value.
- A strong focus on proactive identification and resolving issues in production environments.
- A self-starter who thrives in both synchronous and asynchronous work environments.
- Architect and maintain critical infrastructure to enable Customer.io to scale and handle real-time processing of billions of messages.
- Strategically plan and implement infrastructure growth to meet evolving demands and repeatability.
- Streamline and automate processes for efficiency and reliability, removing manual toil.
- Participate in on-call rotations to swiftly address availability incidents and support technical engineers with customer-related issues.
- Develop observability to ensure comprehensive monitoring and effective alerting of infrastructure and applications.
- Troubleshoot and resolve production issues across various services and stack levels.
- Contribute to a collaborative and supportive team environment, fostering individual, professional, and team growth.
- Engage in continuous learning and knowledge sharing through code reviews, pair programming, and team collaborations to refine best practices.
Backend DevelopmentSQLBashCloud ComputingGCPKubernetesMySQLREST APICI/CDLinuxDevOpsTerraformMicroservicesTroubleshootingSaaS
Posted 28 days ago
Apply