Site Reliability Engineer

New
Serbia. Podgorica, Podgorica Municipality, Montenegro. Armenia. Georgia. TurkeyFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Languages
English (Intermediate/B1 or higher)
Required Skills
PythonSQLBashKubernetesMicrosoft SQL ServerGrafanaPrometheus

Requirements

  • Strong SQL skills (T-SQL preferred), including query optimization, performance tuning, and data integrity management.
  • Hands-on experience with Microsoft SQL Server, database design, migrations, and partitioning strategies.
  • Experience with monitoring and observability tools (Prometheus, Grafana, ELK).
  • Familiarity with cloud platforms (AWS, GCP, Azure).
  • Proficiency in Python and scripting (Bash/PowerShell).
  • Basic understanding of networking concepts (HTTP, DNS, CDN).
  • Experience with Apache Airflow, Docker, Kubernetes, Ansible/IaC, and CI/CD tools (GitLab, Jenkins).
  • Strong communication and collaboration skills.
  • English level: Intermediate (B1) or higher.

Responsibilities

  • Identify, analyze, and resolve issues in production and non-production systems.
  • Participate in incident response, root cause analysis, and follow-up actions.
  • Take part in an on-call rotation and support production incidents.
  • Develop and improve the observability system.
  • Collect and analyze metrics from operating systems, infrastructure, and applications.
  • Use monitoring data to support performance tuning, fault finding, and capacity planning.
  • Implement, maintain, and improve CI/CD processes.
  • Reduce manual work through automation and continuous improvement.
  • Partner with development teams to improve service reliability, testing, deployment, and release processes.
  • Create and maintain technical documentation, runbooks, and operational guides.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now