Apply

Principal Site Reliability Engineer (Remote) KRWFH 1584

Posted 3 days agoViewed

View full description

💎 Seniority level: Principal, 7-10 years

🔍 Industry: Software Development

🏢 Company: Global InfoTek, Inc.

⏳ Experience: 7-10 years

Requirements:
  • Bachelor's degree in computer science, Mathematics, or equivalent technical degree; or equivalent industry experience.
  • Three-plus (3+) years of experience developing production software leveraging modern languages (including: Java, Python, Go, NodeJS, etc.)
  • One-plus (1+) years of experience developing containerized services deployed in production on orchestration platforms such as Kubernetes, Mesos, Swarm, etc.
  • Three-plus (3+) years of experience with agile and lean software development philosophies.
  • One-plus (1+) years of experience working with relational and/or non-relational databases e.g. PostgreSQL, MySQL, MongoDB, Elasticsearch etc.
  • Two-plus (2+) years of demonstrated experience with modern version control systems such as Git, Subversion, Mercurial, etc.
  • Five plus (5+) years, building and maintaining Kubernetes clusters across hybrid-cloud infrastructure
  • Eight-plus (8+) years of experience working in Operations, DevOps, or Site Reliability Engineering
  • Five-plus (5+) years in configuration / package management experience using tools like Terraform, Helm etc.
  • Five-plus (5+) years' experience with Cloud service monitoring like Prometheus, Grafana, FluentD, ElasticStack, Prometheus, SumoLogic, etc.
  • Exceptionally proficient (knowledge and work experience) in Linux system administration
  • Ability to assist with GitLab CI pipelines (build/promote artifacts and security scans)
  • Experience creating automation using APIs from Azure or Google Cloud
Responsibilities:
  • Build and maintain infrastructure as code on large scale multi-site deployments
  • Evaluate and assess new ways to scale platform capabilities
  • Automate workflows to help push the limit of the infrastructure and enable continuous delivery of capabilities onto a hybrid infrastructure
  • Troubleshoot issues until root causes are understood on high traffic production systems
  • Participate in design and code review processes
  • Interact with product owners to coordinate infrastructure changes
  • Be responsible for identifying bottlenecks and improving performance of the platform
Apply

Related Articles

Posted 14 days ago

Why remote work is such a nice opportunity?

Why is remote work so nice? Let's try to see!

Posted 7 months ago

Insights into the evolving landscape of remote work in 2024 reveal the importance of certifications and continuous learning. This article breaks down emerging trends, sought-after certifications, and provides practical solutions for enhancing your employability and expertise. What skills will be essential for remote job seekers, and how can you navigate this dynamic market to secure your dream role?

Posted 7 months ago

Explore the challenges and strategies of maintaining work-life balance while working remotely. Learn about unique aspects of remote work, associated challenges, historical context, and effective strategies to separate work and personal life.

Posted 7 months ago

Google is gearing up to expand its remote job listings, promising more opportunities across various departments and regions. Find out how this move can benefit job seekers and impact the market.

Posted 7 months ago

Learn about the importance of pre-onboarding preparation for remote employees, including checklist creation, documentation, tools and equipment setup, communication plans, and feedback strategies. Discover how proactive pre-onboarding can enhance job performance, increase retention rates, and foster a sense of belonging from day one.