SysOps Engineer – Monitoring & Cloud Operations
New
IndiaFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- AWSGCPAzureGrafanaPrometheusLinuxDatadog
Requirements
- Bachelor’s degree in Computer Science, Engineering, Information Technology, or equivalent practical experience.
- Proven experience in SysOps, Cloud Operations, SRE, or Infrastructure Support roles.
- Strong hands-on experience with Linux and Windows system administration.
- Experience using monitoring and observability tools such as New Relic, Prometheus, Grafana, Datadog, or equivalent.
- Solid understanding of incident management, problem management, and root cause analysis.
- Experience working with cloud platforms such as AWS, Azure, or Google Cloud Platform.
- Strong knowledge of disaster recovery, backup strategies, and business continuity planning.
- Familiarity with infrastructure components such as virtual machines, compute instances, and physical servers.
- Understanding of web and system services such as Nginx, IIS, and systemd.
- Strong analytical and troubleshooting skills.
- Excellent communication and collaboration skills.
Responsibilities
- Monitor infrastructure and production systems using observability tools such as New Relic, Prometheus, Grafana, or similar platforms.
- Configure and maintain alerts, dashboards, and service-level monitoring.
- Lead incident management activities including troubleshooting, root cause analysis (RCA), and post-incident reporting.
- Ensure system uptime, performance, and SLA compliance across cloud and on-premise environments.
- Manage operating system-level tasks (Linux and Windows).
- Oversee backup processes and validate restoration procedures.
- Execute and support disaster recovery (DR) plans.
- Collaborate with DataOps and infrastructure teams to ensure system resilience.
- Perform capacity planning, performance optimization, and infrastructure health assessments.
- Maintain operational documentation, including runbooks, monitoring guidelines, and incident playbooks.
View Full Description & ApplyYou'll be redirected to the employer's site