Remote Jobs in the UK

Kubernetes
1,947 jobs found. to receive daily emails with new job openings that match your preferences.
1,947 jobs found.

Set alerts to receive daily emails with new job openings that match your preferences.

Apply
🔥 Senior DevOps
Posted 24 minutes ago

📍 Worldwide, Cyprus, Malta, USA, Armenia, Georgia, Kazakhstan, Montenegro, Poland, Latvia, Serbia, Spain, Portugal, UAE, Israel, Turkey, Thailand, Indonesia, Japan, Hong Kong, Australia

🧭 Full-Time

🔍 Software Development

🏢 Company: Social Discovery Group👥 501-1000Venture CapitalFinanceInformation Technology

  • Expertise in Kubernetes on-prem, GPU in Kubernetes (cloud or on-prem)
  • Proficiency in Python or Go
  • Strong understanding of SRE and DevOps principles
  • Develop and enhance DevOps practices within the company, focusing on automation of development and deployment processes
  • Design, implement, and maintain a high-load, fault-tolerant ML system based on Kubernetes on bare metal
  • Investigate and resolve incidents, ensuring system stability and performance

AWSPythonGCPKubernetesGoCI/CDDevOpsTerraformAnsible

Posted 24 minutes ago
Apply
Apply

📍 United Kingdom

🧭 Full-Time

🏢 Company: Nearform

  • Strong experience managing and automating cloud-based infrastructure on GCP.
  • Proficiency in infrastructure as code (IaC) using Terraform or CloudFormation.
  • Hands-on experience with containerization and orchestration tools, ideally Kubernetes (K8s).
  • Experience developing and maintaining CI/CD pipelines, ideally with GitHub Actions or Jenkins.
  • Scripting and automation expertise, preferably with Python.
  • Experience with observability tools and monitoring frameworks for system performance optimization.
  • Solid understanding of security best practices across cloud infrastructure, CI/CD pipelines, and microservices.
  • Excellent communication and collaboration skills.
  • Professional proficiency in English.
  • Developing and managing cloud-based infrastructure on GCP.
  • Creating and maintaining deployment architectures and continuous delivery pipelines.
  • Designing high-availability and fault-tolerant solutions for applications.
  • Implementing monitoring frameworks, including dashboards, alerts, and escalation processes.
  • Automating infrastructure provisioning and management using Infrastructure as Code (IaC) tools such as Terraform or CloudFormation.
  • Managing containerized applications and orchestrating deployments using Kubernetes.
  • Ensuring security best practices are applied across CI/CD pipelines, cloud infrastructure, and microservices.
  • Optimizing system performance and scalability through observability and proactive monitoring.
  • Collaborating with development teams to streamline deployment workflows and improve DevOps processes.
  • Advising clients on best practices for cloud infrastructure, deployment automation, and system security.
  • Engaging in technical discussions with stakeholders and supporting project execution to ensure timely delivery.
  • Assisting with the analysis of client requirements.
  • Working with and supporting Technical Leaders in project execution and timely delivery.
  • Collaborating with client teams.

PythonAgileCloud ComputingGCPJenkinsKubernetesCI/CDRESTful APIsDevOpsTerraformMicroservicesExcellent communication skillsScriptingEnglish communication

Posted about 1 hour ago
Apply
Apply

📍 Brazil, the U.S., and Canada

🧭 Full-Time

🔍 Payments

  • Bachelor’s or Master’s degree in CS/Engineering/Data-Science or other technical disciplines.
  • Solid experience in DS/ML engineering.
  • Proficiency in programming languages such as Python, Scala, or Java.
  • Hands-on experience in implementing batch and real-time streaming pipelines, using SQL and NoSQL database solutions
  • Familiarity with monitoring tools for data pipelines, streaming systems, and model performance.
  • Experience in AWS cloud services (Sagemaker, EC2, EMR, ECS/EKS, RDS, etc.).
  • Experience with CI/CD pipelines, infrastructure-as-code tools (e.g., Terraform, CloudFormation), and MLOps platforms like MLflow.
  • Experience with Machine Learning modeling, notably tree-based and boosting models supervised learning for imbalanced target scenarios.
  • Experience with Online Inference, APIs, and services that respond under tight time constraints.
  • Proficiency in English.
  • Design the data-architecture flow for the efficient implementation of real-time model endpoints and/or batch solutions.
  • Engineer domain-specific features that can enhance model performance and robustness.
  • Build pipelines to deploy machine learning models in production with a focus on scalability and efficiency, and participate in and enforce the release management process for models and rules.
  • Implement systems to monitor model performance, endpoints/feature health, and other business metrics; Create model-retraining pipelines to boost performance, based on monitoring metrics; Model recalibration.
  • Design and implement scalable architectures to support real-time/batch solutions; Optimize algorithms and workflows for latency, throughput, and resource efficiency; Ensure systems adhere to company standards for reliability and security.
  • Conduct research and prototypes to explore novel approaches in ML engineering for addressing emerging risk/fraud patterns.
  • Partner with fraud analysts, risk managers, and product teams to translate business requirements into ML solutions.

AWSBackend DevelopmentDockerPythonSQLAmazon RDSAWS EKSFrontend DevelopmentJavaKafkaKubernetesMachine LearningMLFlowAirflowAlgorithmsData engineeringData scienceREST APINosqlPandasSparkCI/CDTerraformScalaData modelingEnglish communication

Posted about 1 hour ago
Apply
Apply

📍 CAN

💸 178000.0 - 228000.0 CAN per year

🔍 Software Development

🏢 Company: Affirm👥 1001-5000💰 Post-IPO Equity about 4 years ago🫂 Last layoff about 2 years agoLendingFinancial ServicesPaymentsFinTech

  • You have 8+ years of experience designing, developing and launching backend systems at scale using languages like Python or Kotlin.
  • You have an extensive track record of developing highly available distributed systems using technologies like AWS, MySQL, Spark and Kubernetes.
  • You have experience delivering major features, system components or deprecating existing functionality in a system through the definition of a technical and execution plan. You write high quality code that is easily understood and used by others.
  • You thrive in ambiguity, and are comfortable moving from low level language idioms all the way to the architecture of large systems to understand how they work.
  • Your growth and impact trajectory demonstrates that you have mastered gathering and iterating on feedback from your engineering and cross-functional peers.
  • You have strong verbal and written communication skills that support effective collaboration with our global engineering team.
  • This position requires either equivalent practical experience or a Bachelor’s degree in a related field.
  • You will be responsible for setting technical strategy for your team on a year-long time scale, and help your team tie it together with critical, business-impacting projects.
  • You will collaborate across teams in the product development lifecycle by collaborating with product management, design & analytics to ensure technical sustainability, risks and trade-offs are well understood and managed.
  • You will act as a force-multiplier for your team through your definition and advocacy of technical solutions and operational processes.
  • You take ownership of your team’s operations and availability by ensuring you have the right monitoring, triage rotations, playbooks, policies, testing and alerting in place to support “keep the lights on” & on-call efforts.
  • You will foster a culture of quality and ownership on your team by setting code review and design standards for your team, and advocating for them beyond your team through your writing and tech talks.
  • You will help develop talent on your team by providing feedback and guidance, and leading by example.

AWSBackend DevelopmentPythonKotlinKubernetesMySQLSparkSoftware Engineering

Posted about 2 hours ago
Apply
Apply

📍 Canada

💸 206000.0 - 256000.0 CAD per year

🏢 Company: Affirm👥 1001-5000💰 Post-IPO Equity about 4 years ago🫂 Last layoff about 2 years agoLendingFinancial ServicesPaymentsFinTech

  • 10+ years of experience in managing multiple diverse and inclusive teams and delivering large cross-functional technical programs.
  • Expertise in managing large-scale, geographically distributed compute and data processing systems.
  • Expertise in scaling technologies like Kubernetes, Redis, MySQL, and Kafka, in cloud providers like AWS.
  • Capable of mentorship, cross-functional program execution, and individual contribution.
  • Deep experience in cloud infrastructure and a passion for leading technical teams and contributing to Open Source solutions.
  • Develop frameworks, systems, and tools to create a culture of ownership and accountability for infrastructure costs.
  • Collaborate with Finance and Engineering leadership to define and meet ambitious financial targets, ensuring Affirm's scalable and efficient growth.
  • Lead technical decisions, projects, and roadmaps within the Infrastructure team, shaping Affirm’s strategy for managing our multi-million dollar annual spend.
  • Drive business and engineering metrics while promoting a culture of reliability, security, and productivity.
  • Lead a team of engineers with empathy while fostering a high-performance, ownership-driven & inclusive culture
  • Collaborate with tech leads, program managers, and product managers on tools, architecture, planning, and delivery of multiple concurrent projects.
  • Work across the engineering organization and with internal and external partners.
  • Provide leadership and growth opportunities to team members, mentor engineers, recruit, and represent Affirm hiring brands.
  • Guide, tutor, and aid in the professional growth of junior and senior engineers within the team.

AWSBackend DevelopmentLeadershipProject ManagementCloud ComputingKafkaKubernetesMySQLCross-functional Team LeadershipFinancial ManagementRedisCommunication SkillsCollaborationCI/CDProblem SolvingMentoringLinuxDevOpsWritten communicationExcellent communication skillsVerbal communicationTeam managementStakeholder managementStrategic thinkingSoftware EngineeringBudget management

Posted about 2 hours ago
Apply
Apply

📍 Estonia, Romania, Poland, Hungary, Portugal, Ukraine

🧭 Full-Time

🔍 Software Development

🏢 Company: trimblecareers

  • Strong proficiency in Python programming language
  • Minimum 6 months experience working with GenAI applications in production environment
  • Experience with cloud platforms (e.g Azure, AWS)
  • Knowledge of microservices architecture and containerization technologies (e.g., Docker, Kubernetes)
  • Experience with RESTful APIs and API design principles
  • Understanding of database management systems (e.g., NoSQL, PostgreSQL)
  • Proficiency with Git for version control
  • Version Control Systems (GitHub, managing code changes and collaborating with other team members, maintaining a history of code revisions)
  • Continuous Integration/Continuous Deployment (tools like GitHub Actions, integrating the automation into CI/CD pipelines)
  • Problem-Solving and Analytical Thinking (designing efficient automation solutions/frameworks, ability to identify and troubleshoot complex software defects)
  • Agile Methodologies (Scrum or Kanban, planning for iterative development cycles, manage frequent releases)
  • Risk Assessment and Mitigation (ability to identify and mitigate risks related to software quality, measure how well risks are documented and managed throughout the project)
  • Leadership and Mentoring (guiding and mentoring other engineers, providing technical expertise)
  • Architect, implement, and optimize Generative AI applications leveraging Large Language Models (LLMs).
  • Work with RAG frameworks
  • Keep track of latest research
  • Translate high-level product requirements into scalable, modular software designs that adhere to modern design principles, microservices architecture, and cloud-native best practices.
  • Develop comprehensive test suites (unit, integration, and end-to-end) to ensure code quality and ensure that automated tests cover a high percentage of the codebase.
  • Collaborate with cross-functional stakeholders, including business analysts, product managers, and global development teams.
  • Mentor junior engineers, guiding them through LLM-based solution design, implementation, and deployment.
  • Work in an agile environment, planning and executing sprints, meeting strict deadlines, and efficiently handling production issues across multiple time zones.
  • Employ CI/CD pipelines (GitHub Actions or similar) and maintain code versioning in GitHub for seamless, frequent releases.

AWSBackend DevelopmentDockerLeadershipPostgreSQLPythonAgileCloud ComputingGitKubernetesAPI testingAzureNosqlCommunication SkillsCI/CDProblem SolvingRESTful APIsMentoringMicroservicesTeamworkRisk ManagementSoftware EngineeringData analytics

Posted about 2 hours ago
Apply
Apply

📍 Sri Lanka, India, Latvia, Ukraine, Romania

🔍 Software Development

🏢 Company: First Focus👥 11-50Education

  • Strong automation scripting experience with one or more of the following - Bash, Python, Powershell, PHP etc
  • Strong database experience in SQL (MS-SQL, MySQL, NoSQL)
  • Automation scripting using one or more of the following technologies - Bash, Python, PHP and Powershell
  • Configuration and maintenance of API orchestration and orchestration platforms
  • Building complex automations that interact directly with MS-SQL, MySQL, NoSQL and APIs
  • Creating and configuring AI solutions

AWSPHPPythonSQLBashKubernetesMySQLAPI testingNosqlCI/CDRESTful APIsDevOpsScripting

Posted about 2 hours ago
Apply
Apply

📍 United States

🧭 Full-Time

💸 132000.0 - 172000.0 USD per year

🔍 Software Development

🏢 Company: Infinite Reality👥 101-250💰 $350,000,000 9 months agoMedia and EntertainmentWeb3Metaverse

  • Extensive DevOps & Security Experience: You bring 5+ years of hands-on experience in DevOps and security monitoring, with a strong focus on logging, monitoring, and incident response. Your background allows you to design, implement, and optimize observability frameworks that enhance system security and performance.
  • Incident Management Expertise: You have a proven track record of managing both security and operational incidents. From detection through resolution, you are adept at coordinating incident response efforts, leading post-incident reviews, and driving improvements to reduce future risks and downtime.
  • Scripting & Automation Skills: You are proficient in scripting languages like Python or Bash, and are passionate about automating repetitive tasks to increase operational efficiency. Your automation solutions help streamline workflows, improve response times, and reduce manual intervention.
  • Proficiency with Logging & Monitoring Tools: You have deep experience with tools like the ELK Stack, Splunk, Prometheus, and other observability platforms. Your expertise enables you to identify patterns, vulnerabilities, and trends in system health and security, empowering teams to act proactively.
  • Collaboration & Cross-Functional Teamwork: You excel at working across teams, engineering, IT, and security, helping foster a culture of observability and continuous improvement. Your ability to communicate technical concepts clearly ensures alignment across stakeholders with varying levels of technical expertise.
  • Strong Problem-Solving Skills: You thrive on solving complex issues, whether it’s a security breach or a system performance bottleneck. Your analytical mindset and experience with root cause analysis ensure that you can resolve problems efficiently and implement lasting solutions.
  • Design & Optimize Logging and Monitoring Systems: Lead the design and implementation of advanced logging and monitoring architectures, ensuring that system performance, security threats, and infrastructure health are captured in real-time. You will drive best practices in observability to ensure our systems are proactive, secure, and resilient.
  • Incident Response & Analysis: Own the full incident management lifecycle—from detection to resolution. Respond to both security and operational incidents, working across teams to minimize impact and quickly resolve issues. Lead post-incident analysis, identify root causes, and drive improvements to prevent future occurrences.
  • Develop Automation Solutions: Build and implement automation workflows to streamline alerting, incident detection, and response processes. You’ll reduce manual intervention and optimize workflows, helping teams respond more efficiently to system events and improve operational efficiency.
  • Collaborate with Cross-Functional Teams: Work closely with engineering, security, and operations teams to foster a culture of observability. Share best practices, establish clear protocols for incident detection and resolution, and ensure alignment across teams to improve overall system reliability.
  • Monitor Security & Operational Alerts: Establish and fine-tune alerting rules to ensure actionable, precise, and timely notifications for security and system performance events. You’ll ensure that alerts are well-defined and routed to the right teams, minimizing response time to critical issues.
  • Leverage Data for Continuous Improvement: Analyze logs and metrics to identify trends, anomalies, and potential security vulnerabilities. You’ll generate data-driven insights that help improve system health, performance, and security posture, contributing to ongoing process improvements.
  • Mentor and Coach: Provide guidance to junior engineers and colleagues, promoting best practices in monitoring, incident management, and automation. Lead by example to elevate the technical capabilities of the team and drive knowledge-sharing across the organization.

AWSDockerPythonBashCloud ComputingCybersecurityKubernetesMicrosoft AzureAPI testingAzureGrafanaPrometheusREST APICommunication SkillsAnalytical SkillsCollaborationCI/CDProblem SolvingRESTful APIsLinuxDevOpsTerraformComplianceAnsibleScripting

Posted about 3 hours ago
Apply
Apply
🔥 Cloud Security Engineer
Posted about 3 hours ago

📍 UK

🔍 Software Development

🏢 Company: Everway

  • 3+ years of experience in cloud security, with a strong focus on AWS and Azure.
  • Deep understanding of AWS security services, including IAM, Security Hub, GuardDuty, KMS, WAF, and S3 bucket ACLs and encryption.
  • Strong knowledge of AWS networking security, including VPCs, security groups, VPNs, and private link services.
  • Strong knowledge of Azure Defender, Sentinel, and Security Center.
  • Hands-on experience securing serverless architecture (e.g., AWS Lambda, API gateway) and containerized environments (e.g., Kubernetes).
  • Experience with cloud security monitoring, SIEM, and incident response.
  • Architect, implement, and manage security controls in AWS and Azure environments to protect cloud infrastructure, workloads, and data.
  • Conduct threat modeling and risk analysis to identify and remediate vulnerabilities.
  • Securely configure and audit cloud IAM policies, role-based access control (RBAC), and implement least-priviledge principles.
  • Familiar with cloud native compute, storage and security services, such as AWS Security hub, GuardDuty, CloudTrail, and Azure Monitor.
  • Work closely with DevOps and development teams to integrate security into CI/CD pipelines and cloud-native applications.
  • Investigate and respond to cloud security incidents, misconfigurations, and compliance gaps.

AWSCloud ComputingCybersecurityKubernetesAzureCI/CDLinuxDevOpsTerraformNetworking

Posted about 3 hours ago
Apply
Apply

📍 United States

🧭 Full-Time

💸 190000.0 - 230000.0 USD per year

🔍 Software Development

🏢 Company: Dagster Labs👥 11-50💰 $33,000,000 Series B almost 2 years agoCloud Data ServicesBig DataSoftware

  • 5+ years of relevant software development experience
  • Expertise in the full software development lifecycle, from scoping and planning to delivery and iteration
  • Strong command of software system design, including scalability, third party integrations, and API design
  • Fluent in Python
  • Strong written and oral communication skills
  • Experience in a high-functioning engineering organization working on large-scale distributed systems or B2B SaaS applications
  • Proven effectiveness at contributing to and executing as part of a team
  • Experience with using or supporting tools in the Modern Data Stack
  • Experience building and scaling services built on Amazon Web Services, Kubernetes & Postgres
  • Develop and optimize core backend systems and infrastructure components.
  • Enhance efficiency, scalability, and stability of critical system resources through analysis and refinement.
  • Partner with cross-functional teams to align on product development needs and deliver impactful solutions.
  • Review designs and code to maintain high standards of quality and performance across the team.

AWSBackend DevelopmentLeadershipPostgreSQLPythonSoftware DevelopmentSQLFull Stack DevelopmentKubernetesSoftware ArchitectureAlgorithmsAPI testingData engineeringData StructuresREST APICommunication SkillsCollaborationCI/CDAgile methodologiesRESTful APIsWritten communicationTeamworkData modelingSoftware Engineering

Posted about 4 hours ago
Apply
Shown 10 out of 1947

Ready to Start Your Remote Journey?

Apply to 5 jobs per day for free, or get unlimited applications with a subscription starting at €5/week.