Remote DevOps Engineer Jobs

Grafana
279 jobs found. to receive daily emails with new job openings that match your preferences.
279 jobs found.

Set alerts to receive daily emails with new job openings that match your preferences.

Apply

📍 United States, Canada

🧭 Full-Time

💸 168256.0 - 201907.0 USD per year

🔍 Software Development

🏢 Company: Grafana Labs👥 501-1000💰 $240,000,000 Series D almost 3 years agoSoftware Development

  • Experience with authentication and authorization systems at scale
  • Experience with Go
NOT STATED

Backend DevelopmentDockerPostgreSQLSQLCloud ComputingFrontend DevelopmentGitKubernetesLDAPMySQLReact.jsSQLiteTypeScriptGoGrafanaCI/CDRESTful APIsMicroservicesSoftware EngineeringDebugging

Posted 2 days ago
Apply
Apply

📍 United Kingdom

🏢 Company: careers_gm

  • Proficiency in at least one programming language (e.g., Python, Go, Java) and familiarity with multiple language ecosystems.
  • Solid understanding of operating systems, networking, distributed systems, databases, and storage architectures.
  • Deep understanding of how code runs on underlying hardware, including operating systems, algorithms, and data structures. Ability to optimize or troubleshoot code by understanding its execution and the impact on system resources.
  • Experience handling production incidents, including root cause analysis, mitigation, and working through complex system failures.
  • Strong communication skills, with an ability to explain technical concepts to both engineering and business stakeholders. Commitment to collaborative problem-solving and shared ownership of services.
  • Proven experience in automating manual processes, building deployment pipelines, or managing configuration systems
  • Develop tools and software to automate operational processes, improve system reliability, and reduce manual intervention.
  • Lead, Implement and improve monitoring and observability frameworks, enabling proactive detection and resolution of incidents.
  • Participate in an on-call rotation to diagnose, troubleshoot, and mitigate production incidents, ensuring minimal downtime and swift resolution.
  • Work alongside developers to ensure the quality, scalability, and reliability of our services. Practice shared ownership of services in production, fostering a "You build it, you run it" culture.
  • Manage Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs) to manage reliability expectations effectively.
  • Strong understanding of common application reliability patterns, with hands-on experience implementing them.
  • Conduct deep-dive analyses of incidents and collaborate on post-incident reviews to derive learnings and prevent recurrence. Champion a culture of continuous improvement.
  • Evaluate system performance and advocate for optimisations that reduce infrastructure costs while maintaining service reliability.

AWSBackend DevelopmentDockerPostgreSQLPythonSQLCloud ComputingGCPJavaJava EEJenkinsKafkaKubernetesSpring BootSpring MVCZabbixAlgorithmsAzureData StructuresGoGrafanaJava SpringPrometheusRDBMSCI/CDRESTful APIsLinuxDevOpsTerraformMicroservicesNetworkingAnsibleScriptingDebugging

Posted 2 days ago
Apply
Apply

📍 Argentina, Colombia

🧭 Full-Time

💸 4000.0 USD per month

🏢 Company: Workana

  • More than 5 years of experience in complex integrations in production environments.
  • Experience in integration design with REST, GraphQL, Webhooks APIs.
  • Knowledge of asynchronous messaging (Kafka, RabbitMQ, Redis Streams).
  • Experience with Swagger/OpenAPI.
  • Familiarity with Erlang and PHP (it is not necessary to program, but yes to understand them).
  • Experience with Docker containers and Linux administration.
  • Experience with Kubernetes (desirable).
  • Knowledge of observability (Datadog, Prometheus, Grafana, ELK, etc.).
  • Experience resolving incidents in production environments.
  • Design and maintain integration architecture, ensuring scalability, resilience, and efficiency.
  • Develop bidirectional integration strategies between clients and the platform.
  • Define and manage integration interfaces: APIs, connectors, and adapters.
  • Design orchestration solutions: data flows, real-time synchronization, and event handling.
  • Manage and evolve the platform's APIs with a focus on versioning and stability.
  • Maintain and operate integrations with messaging/event technologies like Kafka, RabbitMQ or Redis Streams.
  • Ensure observability of integrations: metrics, logs, and traceability.
  • Work with Docker and Kubernetes environments.
  • Diagnose and resolve production issues.
  • Document integration flows, architectures, and best practices.
  • Collaborate with technical teams and clients to ensure a smooth integration experience.

DockerGraphQLErlangKubernetesOAuthRabbitmqApache KafkaAPI testingGrafanaPrometheusREST APIRedisCI/CDRESTful APIsLinuxTerraformDocumentationMicroservicesJSONAnsible

Posted 3 days ago
Apply
Apply

📍 United States

🧭 Full-Time

💸 95000.0 - 160000.0 USD per year

🔍 Cybersecurity

🏢 Company: crowdstrikecareers

  • 5-7+ years of experience in Site Reliability Engineering (SRE), DevOps, or Cloud Infrastructure roles.
  • Experience managing Virtual Desktop Infrastructure (VDI) solutions such as Citrix, VMware Horizon, or AWS WorkSpaces.
  • Hands-on experience with AWS GovCloud (Azure/GCP is a plus).
  • Strong expertise in Infrastructure as Code (Terraform, CloudFormation).
  • Experience with monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK, Datadog, Splunk).
  • Expertise in IAM and PAM solutions such as Okta, CyberArk, or AWS IAM.
  • Strong scripting and automation skills (Python, Bash, PowerShell).
  • Experience with CI/CD pipelines and DevOps workflows.
  • Familiarity with FedRAMP, NIST 800-53, DoD IL 4/5 compliance standards.
  • Hands-on experience with VDI management, performance tuning, and security hardening.
  • Architect, deploy, and maintain highly available, scalable, and secure systems in AWS GovCloud (Azure and GCP experience is a plus).
  • Automate infrastructure provisioning, scaling, and failover using Infrastructure as Code (IaC) tools like Terraform or CloudFormation.
  • Implement SLOs, SLIs, and error budgets to drive reliability improvements.
  • Optimize cloud infrastructure for performance, cost-efficiency, and resilience while adhering to compliance requirements.
  • Manage and optimize Virtual Desktop Infrastructure (VDI) solutions to ensure seamless user experience, performance, and security.
  • Deploy and manage monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, Datadog, Splunk, ELK).
  • Implement automated self-healing mechanisms and proactive monitoring solutions.
  • Lead incident response, postmortems, and root cause analysis (RCA) to prevent future system disruptions.
  • Ensure 24/7 system uptime through on-call rotation and escalation handling.
  • Implement Identity and Access Management (IAM) best practices, including SSO, MFA, and RBAC across cloud environments.
  • Automate IAM governance and Privileged Access Management (PAM) to enforce the principle of least privilege.
  • Ensure audit readiness by maintaining accurate security configurations, logs, and compliance reports.
  • Work with security teams to align IAM and Zero Trust Architecture (ZTA) strategies with organizational policies.
  • Develop and maintain CI/CD pipelines for automated deployments and configuration management.
  • Use Python, Bash, or PowerShell to automate routine SRE workflows and security compliance checks.
  • Implement immutable infrastructure and support DevSecOps best practices.
  • Manage and optimize VDI environments, ensuring seamless DevOps integration for development and operational teams.
  • Contribute to chaos engineering and failure injection testing to enhance system resiliency.
  • Work closely with DevOps, IT Security, and Compliance teams to ensure system integrity and uptime.
  • Provide mentorship to junior engineers and contribute to knowledge-sharing initiatives.
  • Participate in architectural discussions and help drive improvements in cloud reliability and security posture.

AWSDockerPythonBashCloud ComputingCybersecurityGCPKubernetesAzureGrafanaPrometheusCI/CDLinuxDevOpsTerraformComplianceAnsibleScripting

Posted 3 days ago
Apply
Apply

📍 United States, Canada

🧭 Full-Time

🔍 Software Development

🏢 Company: BioRender👥 101-250💰 $15,319,133 Series A almost 2 years agoLife ScienceGraphic DesignSoftware

  • 10-12+ years of experience in the software/DevOps/SRE realm
  • Strong programming skills in 2 or more of these languages: javascript, typescript, python, Go
  • Ability to troubleshoot complex distributed systems at scale
  • Database Performance Monitoring and best practices
  • Comfortable innovating and establishing new practices, processes, and tooling
  • Strong analytical skills, system design, and architecture for cloud applications
  • CI/CD, configuration management, monitoring, and automation expertise
  • Advanced knowledge of observability and best practices (ELK, Datadog, OpenTelemetry, Prometheus, Grafana)
  • Deployment and orchestration via AWS ECS, k8s, CloudRun etc.
  • Understanding of Linux, virtualization, networking, VPCs, firewalls, security groups
  • Hands-on knowledge of AWS and resources provisioning via CLI/API/IaC
  • Bachelor's degree in Computer Science, similar technical field of study, or equivalent practical experience.
  • Enhance platform resilience by constantly seeking ways to improve the reliability, scalability and release efficiency of the platform
  • Develop Robust Observability and Monitoring Solutions: Define, build, deploy, maintain, and extend advanced observability and monitoring tools to bolster system reliability and availability.
  • Define and Monitor Performance Metrics: Play a key role in formulating and tracking Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to establish precise benchmarks for system performance.
  • Solve Complex Issues and Conduct Root Cause Analysis: Swiftly respond to escalated incidents, troubleshoot intricate system and application problems, and conduct thorough root cause analyses to implement corrective measures.
  • Thought Leadership and Innovation: stay up to date with the latest industry trends and emerging technologies and iterate on best practices to increase the quality & velocity of development and deliverables.
  • Architect Scalable and Reliable Systems: Lead in the design and architecture of scalable, distributed, fault-tolerant systems that uphold performance and reliability standards.
  • Mentorship and Evangelism: Champion the adoption of new technologies, disseminate best practices, and advocate for architectural patterns. Mentor and guide fellow engineers in the organization.

AWSDockerLeadershipPythonSQLBashCloud ComputingJavascriptKubernetesSoftware ArchitectureTypeScriptGoGrafanaPrometheusREST APICommunication SkillsAnalytical SkillsCI/CDProblem SolvingAgile methodologiesRESTful APIsMentoringLinuxDevOpsTerraformWritten communicationMicroservicesNetworkingAdaptabilityTeamworkTroubleshootingActive listeningStrong work ethicStrong communication skillsAnsibleSoftware EngineeringDebugging

Posted 3 days ago
Apply
Apply

📍 Germany

💸 250000.0 - 260000.0 EUR per year

🔍 Infrastructure Technology Sales

🏢 Company: Grafana Labs👥 501-1000💰 $240,000,000 Series D almost 3 years agoSoftware Development

  • 5+ Years of Experience in Infrastructure Technology Sales
  • Demonstrated history of consistent goal achievement in a highly competitive environment (top 10% performer)
  • Energetic, upbeat, entrepreneurial, tenacious team player
  • Adaptable and with demonstrable experience in high velocity technology companies
  • Experience using Salesforce
  • Familiarity with open source technology is a significant advantage
  • You will need to be an excellent communicator in all channels (in person, online, in writing) and able to form strong working relationships both in person and virtually
  • Experience using Command of the Message and MEDD(P)ICC is ideal
  • Meet and exceed individual quarterly and annual sales goals
  • Outbound prospecting into net-new customers
  • Manage all aspects of the sales process (prospecting, sales meetings, product demos, proofs of concept, proposals, and negotiations)
  • Cultivate sales through outbound prospecting and inbound leads
  • Be able to understand and convey the value of both Grafana Cloud and Grafana Enterprise
  • Become an expert in managing your sales pipeline in Salesforce
  • Manage quote creation, order processing, and day-to-day customer requests

SalesforceGrafanaCommunication SkillsCustomer serviceRESTful APIsAdaptabilityAccount ManagementNegotiation skillsRelationship managementSales experienceLead GenerationCRMEnglish communicationSaaS

Posted 3 days ago
Apply
Apply

📍 United Kingdom

🔍 Blockchain

🏢 Company: IO Global

  • Proficiency in Python, Bash, Terraform, Nix for DevOps services.
  • Extensive experience with AWS, specifically with services like EKS and RDS.
  • Familiarity with Container orchestration (e.g. Kubernetes) is essential.
  • Hands-on experience with PostgreSQL and its deployment on RDS.
  • Knowledge of monitoring tools (e.g., Prometheus, Grafana, Loki).
  • Solid troubleshooting and performance tuning capabilities.
  • Exceptional communication skills and team collaboration ethic.
  • Experience with CI/CD (e.g. Github Actions, Hydra, Earthly).
  • Design, write, and deliver tools and software primarily using Python, Bash, Terraform or Nix to improve the availability, scalability, and efficiency of our services.
  • Engage in and refine the whole lifecycle of services, from inception and design, through deployment, operation, and continuous improvement.
  • Practice sustainable incident response and promote blameless postmortems.
  • Collaborate with the development teams to ensure that solutions are designed with customer experience, scalability, and performance in mind.
  • Analyze system performance and reliability, offering recommendations for enhancement.
  • Develop and uphold service-level objectives (SLOs), service-level indicators (SLIs), and error budgets for our services.
  • Participate in on-call rotations, responding to and mitigating service interruptions and technical challenges.

AWSPostgreSQLPythonAmazon RDSAWS EKSBashKubernetesGrafanaPrometheusCI/CDDevOpsTerraform

Posted 3 days ago
Apply
Apply

📍 UK, Sweden, Spain, Germany

🧭 Full-Time

💸 80571.0 - 100713.0 EUR per year

🔍 Software Development

  • Solid experience with at least one programming language.
  • Some experience with delivering projects from gathering requirements, brainstorming ideas all the way to shipping a product to the customer’s hands in a self-driven way
  • Some experience with developing software that runs in the Cloud or some experience with systems engineering
  • Some experience with being on-call and following the DevOps model
  • Experience writing clean, robust, and performant software that is easily maintained by others
  • Familiarity with observability systems, know when to use metrics, logs, traces, to debug a problem.
  • Take an active role in influencing our roadmap and your own career objectives
  • Work with your team to deliver new features, then use the results to iterate and improve.
  • Drive projects from initial idea all the way to operations once it is in the hands of customers
  • Embrace our open-source culture and contribute to other projects that may not directly fall within your team’s scope
  • Design, build, operate, and maintain critical systems, owning the reliability, performance, and availability
  • Be a part of your team’s follow-the-sun on-call rotations and take ownership of the services you’re running
  • Support other team members, participate in design discussions and collaborate with the team
  • Learn new skills by gaining a deeper understanding of our cloud product and our customers and getting to know the codebase of a large distributed system

Backend DevelopmentSoftware DevelopmentCloud ComputingKubernetesApache KafkaGoGrafanaPrometheusCommunication SkillsCI/CDProblem SolvingCustomer serviceRESTful APIsLinuxDevOpsMicroservices

Posted 3 days ago
Apply
Apply

📍 UK, Sweden, Spain, Germany

🧭 Full-Time

💸 94208.0 - 117760.0 EUR per year

🔍 Software Development

  • Solid experience with at least one programming language. We use Go, but if you have familiarity with Python, C, C++, Rust or similar then that translates well
  • Some experience with delivering projects from gathering requirements, brainstorming ideas all the way to shipping a product to the customer’s hands in a self-driven way
  • Some experience with developing software that runs in the Cloud
  • or some experience with systems engineering
  • Some experience with being on-call and following the DevOps model
  • Experience writing clean, robust, and performant software that is easily maintained by others
  • Familiarity with observability systems, know when to use metrics, logs, traces, to debug a problem.
  • Take an active role in influencing our roadmap and your own career objectives
  • Work with your team to deliver new features, then use the results to iterate and improve.
  • Drive projects from initial idea all the way to operations once it is in the hands of customers
  • Embrace our open-source culture and contribute to other projects that may not directly fall within your team’s scope
  • Design, build, operate, and maintain critical systems, owning the reliability, performance, and availability
  • Be a part of your team’s follow-the-sun on-call rotations and take ownership of the services you’re running
  • Support other team members, participate in design discussions and collaborate with the team
  • Learn new skills by gaining a deeper understanding of our cloud product and our customers and getting to know the codebase of a large distributed system

Backend DevelopmentDockerSoftware DevelopmentCloud ComputingKubernetesAlgorithmsData StructuresGoGrafanaPrometheusREST APICI/CDProblem SolvingLinuxDevOpsMicroservicesScripting

Posted 3 days ago
Apply
Apply

📍 Poland

🧭 Contract

🔍 Software Development

🏢 Company: Nearform

  • Proficient communication in English (oral and written)
  • Experience delivering at an enterprise level and a remote agile environment
  • Experience of one or more of the following cloud platforms, AWS/Azure/GCP
  • Experience with observability tooling (Grafana, Datadog, Prometheus etc)
  • Knowledge of observability, monitoring, logging, tracing and dashboard definition/Integrations
  • Experience working with containers and container orchestration
  • Experience with infrastructure as code technology
  • Experience with CI and building CD pipelines
  • Data storage experience with RDBMS and NoSQL technologies
  • Solid understanding of observability practices across the stack
  • Strong understanding of security best practices across CI/CD pipelines, cloud infrastructure, and microservices
  • Ability to clearly articulate technical concepts to both technical and non-technical audiences
  • Exceptional communication and collaboration skills to work with stakeholders, feeling comfortable in client-facing roles where you will drive technical discussions and advise on best practices and approaches
  • Develop infrastructure to support cloud-based applications
  • Create deployment architect and continuous delivery pipelines
  • Design high-availability approaches
  • Implement monitoring architecture (dashboards, alerts, escalations)

AWSDockerGCPJenkinsKubernetesAzureGrafanaPrometheusRDBMSNosqlCI/CDLinuxTerraformMicroservicesAnsibleScriptingEnglish communication

Posted 3 days ago
Apply
Shown 10 out of 279

Ready to Start Your Remote Journey?

Apply to 5 jobs per day for free, or get unlimited applications with a subscription starting at €5/week.

Why Remote DevOps Engineer Jobs Are Becoming More Popular

The remote work from home is increasingly in demand among computer and IT professionals for several reasons:

  • Flexibility in time and location.
  • Collaboration with international companies.
  • Higher salary levels.
  • Lack of ties to the office.

Remote work opens up new opportunities for specialists, allowing them to go beyond geographical limits and build a successful remote IT career. This employment model is transforming traditional work approaches, making it more convenient, efficient, and accessible for professionals worldwide.

Why do Job Seekers Choose Remoote.app?

Our platform offers convenient conditions for finding remote IT jobs from home:

  • localized search — filter job listings based on your country of residence;
  • AI-powered job processing — artificial intelligence analyzes thousands of listings, highlighting key details so you don’t have to read long descriptions;
  • advanced filters — sort vacancies by skills, experience, qualification level, and work model;
  • regular database updates — we monitor job relevance and remove outdated listings;
  • personalized notifications — get tailored job offers directly via email or Telegram;
  • resume builder — create a professional VC with ease using our customizable templates and AI-powered suggestions;
  • data security — modern encryption technologies ensure the protection of your personal information.

Join our platform and find your dream job today! We offer flexible pricing — up to 5 applications per day for free, with weekly, monthly, and yearly subscription plans for extended access.