Apply

Senior DevOps Engineer

Posted 2 days agoViewed

View full description

πŸ’Ž Seniority level: Senior, 5 years

🏒 Company: Marsie

⏳ Experience: 5 years

Requirements:
  • 5 years of experience in DevOps or related roles.
  • Proficiency in AWS services and best practices.
  • Strong experience with Kubernetes and container orchestration.
  • Hands-on experience with CI/CD tools and practices.
  • Expertise in Infrastructure as Code, particularly with Terraform.
  • Solid scripting skills in Python and Shell.
  • Familiarity with GitOps principles and tools.
Responsibilities:
  • Design, implement, and manage AWS infrastructure, including VPCs, subnets, NAT Gateways, Route53, TLS/Certificates, Security Groups, EC2, Load Balancers, EIPs, IAM, RDS, OpenSearch/Elasticsearch, S3, ECR, and EKS.
  • Develop and maintain Kubernetes clusters, preferably using EKS, ensuring high availability and scalability.
  • Utilize tools like Helm, FluxCD (or ArgoCD), Kustomize, and GitOps practices to manage deployments.
  • Implement and manage CI/CD pipelines using GitHub Actions.
  • Employ Infrastructure as Code (IaC) methodologies for consistent and repeatable infrastructure provisioning.
  • Develop automation scripts using Python and Shell scripting to streamline operations.
  • Monitor system performance, troubleshoot issues, and ensure system reliability and security.
Apply

Related Jobs

Apply

🏒 Company: TransifexπŸ‘₯ 51-100πŸ’° Debt Financing almost 3 years agoCrowdsourcingDeveloper ToolsSaaSInformation TechnologyTranslation Service

Posted 2 days ago
Apply
Apply

🧭 Full-Time

🏒 Company: EncoraπŸ‘₯ 10001-10001πŸ’° $200,000,000 Private almost 6 years agoBig DataCloud ComputingSoftware

  • Minimum 7 years of experience in DevOps, Site Reliability Engineering (SRE), or a related discipline.
  • Proven experience with AWS and Terraform.
  • Strong experience with CI/CD tools, especially GitHub Actions and GitLab CI/CD.
  • Experience managing and optimizing containerized applications, specifically with Kubernetes.
  • Strong problem-solving and root cause analysis skills.
  • Effective communication and collaboration skills across cross-functional teams.
  • Experience leading application upgrades and platform migrations.
  • Proficiency with monitoring/logging tools like New Relic and Splunk.
  • Hands-on experience building CI/CD pipelines from scratch and maintaining them in GitHub Actions.
  • Ability to work on secure, compliant, and cost-efficient cloud environments.
  • Design, implement, and maintain infrastructure using Terraform following Infrastructure as Code (IaC) best practices.
  • Develop and manage scalable, secure, and cost-efficient solutions on AWS.
  • Build, optimize, and maintain CI/CD pipelines using GitHub Actions and GitLab to streamline deployment workflows.
  • Implement monitoring, logging, and alerting tools such as New Relic and Splunk to ensure high system availability and performance.
  • Architect advanced Kubernetes deployment strategies to improve performance and reliability for key applications.
  • Enable real-time data streaming and event-driven architecture through Kafka.
  • Design and enforce cloud networking and security best practices.
  • Lead platform upgrades, migrations, and infrastructure updates with minimal disruption.
  • Collaborate closely with development and operations teams to ensure reliable and scalable solutions.
Posted 3 days ago
Apply
Apply

πŸ“ Brazil

🧭 Contract

πŸ” Software Development

🏒 Company: Nearform

  • Experience with cloud platforms (AWS/Azure/GCP)
  • Experience with observability tooling (Grafana, Datadog, Prometheus etc)
  • Knowledge of observability, monitoring, logging, tracing, and dashboard definition/Integrations
  • Experience working with containers and container orchestration
  • Experience with infrastructure as code technology
  • Experience with CI and building CD pipelines
  • Data storage experience with RDBMS and NoSQL technologies
  • Solid understanding of observability practices across the stack
  • Strong understanding of security best practices across CI/CD pipelines, cloud infrastructure, and microservices
  • Ability to clearly articulate technical concepts to both technical and non-technical audiences
  • Exceptional communication and collaboration skills
  • Develop infrastructure to support cloud-based applications
  • Create deployment architecture and continuous delivery pipelines
  • Design high-availability approaches
  • Implement monitoring architecture (dashboards, alerts, escalations)

AWSDockerPythonSQLBashElasticSearchExpress.jsGCPJenkinsKubernetesAzureGrafanaPrometheusRDBMSNosqlCommunication SkillsCollaborationCI/CDProblem SolvingAgile methodologiesRESTful APIsLinuxDevOpsTerraformMicroservicesJSONAnsibleNodeJSScriptingEnglish communication

Posted 3 days ago
Apply
Apply

πŸ“ Philippines

πŸ’Έ 2500.0 - 3000.0 AUD per month

πŸ” Financial

🏒 Company: Hunt St

  • 5+ years of experience in DevOps, Systems Engineering, or a related field.
  • Linux native, if you do not use Linux as your preferred OS this may not be the role for you
  • Strong experience with the following: Linux administration, Bash scripting, Kubernetes, Docker, AWS
  • Good knowledge of networking, DNS, load balancing and CDN's
  • Maintaining and improving the resiliency of our core applications and our hybrid infrastructure platform
  • Providing continued improvement to the platform infrastructure through automation and standardisation
  • Providing complementary skills and expertise to the teams and continuously learning from peers and seniors.
  • Ensuring that all of our core services are up to date and security patched
  • Working closely with development teams to ensure applications are configured for security, efficiency and scalability.

AWSDockerPythonBashJenkinsKubernetesRabbitmqPrometheusRedisCI/CDLinuxDevOpsTerraformNetworkingAnsibleScripting

Posted 4 days ago
Apply
Apply

πŸ“ United States

🧭 Full-Time

πŸ’Έ 150000.0 - 180000.0 USD per year

πŸ” Healthcare

🏒 Company: Oshi HealthπŸ‘₯ 51-100πŸ’° $60,000,000 Series C 7 months agoMedicalMobileHealth Care

  • Minimum of 4 years of devops experience.
  • Carry one or more AWS (or related cloud platform) certifications
  • Experience with software development best practices, including version control, testing, security, build pipelines, monitoring, release management, documentation, etc.
  • Created Infrastructure as Code projects to configure environments
  • Excellent at problem solving, and motivated to produce the best possible result
  • You take ownership of the success of the system, assisting others and showing the judgement to seek help when you need it.
  • You have worked in a fast-paced startup environment, or strongly want to make the switch to one.
  • If you come from an environment with effective mature processes, you are ready to evangelize for them
  • Experience in healthcare, especially telehealth, a plus
  • Build and maintain our AWS infrastructure securely and efficiently
  • Work with our engineering teams building deployment pipelines to integrate testing, scanning, build, and deploy functionalities for secure APIs, web applications, and mobile applications.
  • Configure monitoring and alerting systems to proactively monitor our systems.
  • Respond to production issues
  • Optimize our AWS infrastructure to ensure Oshi is utilizing our resources in the best way possible
  • Collaborate with Technology and Product leadership estimate infrastructure effort through various stages of product conception
  • Promote a test-first environment where quality gates ensure only the most stable code gets deployed to production
  • Recurring on-call shifts supporting the operations of the care coordination and clinical teams to ensure the platform is running smoothly
  • Collaborate with our Security Team for changes, monitoring and implementations

AWSCloud ComputingKubernetesRelease ManagementCommunication SkillsCI/CDProblem SolvingLinuxDevOpsTerraformAnsibleScripting

Posted 5 days ago
Apply
Apply

πŸ“ United States

πŸ” Software Development

🏒 Company: ScreenPalπŸ‘₯ 11-50InternetEducationMessagingTrainingVideo EditingSoftware

  • Significant experience managing and maintaining large-scale web services using AWS
  • Proficiency with AWS services such as Elastic Beanstalk, EC2, RDS, S3, VPCs, Security Groups, and Load Balancers
  • Hands-on experience deploying and configuring PHP, PHP-FPM, Redis, and MySQL
  • Experience using Datadog to monitor a production environment
  • Strong scripting skills in Bash, Python, PHP, or similar languages
  • Familiarity with build automation and CI/CD deployment tools
  • Experience with IT compliance, security assessments, and risk management requirements
  • Solid understanding of networking protocols (including SSL), firewalls, and routing
  • Demonstrated ability to thrive in fast-paced, team-oriented environments
  • Advanced critical thinking and problem-solving capabilities
  • Passion for delivering consumer-focused solutions and driving customer success
  • Proven ability to work independently with minimal supervision
  • Familiarity with Jenkins or similar continuous integration/continuous delivery tools
  • Experience conducting or working with penetration testing organizations
  • Expand, improve and maintain our web applications and services
  • Implement and maintain build and deployment processes, including developing tooling to support automation
  • Contribute to architecture, design, and development of our video hosting services
  • Ensure the highest possible availability services
  • Implement and enforce security standards

AWSPHPPythonAmazon RDSBashJenkinsMySQLRedisCI/CDDevOpsNetworkingScripting

Posted 6 days ago
Apply
Apply

πŸ“ United States

🧭 Full-Time

πŸ” Construction

  • 8+ years of experience with at least one of the following programming languages: JavaScript, Node.js, Golang, Python, or C#.
  • 5+ years of hands-on experience configuring and managing AWS environments.
  • 5+ years of experience with Linux systems, including administration, operations, and networking.
  • Extensive experience with Docker and Kubernetes in production environments.
  • Strong experience managing distributed messaging systems, preferably RabbitMQ.
  • Deep experience supporting and optimizing both SQL and NoSQL databases, ideally PostgreSQL and DynamoDB.
  • Experience deploying, managing, and scaling microservices in production environments, preferably using Node.js.
  • Experience with build automation tools, preferably GitHub Actions.
  • Experience with configuration management and workflow automation tools such as Ansible, Puppet, or Terraform.
  • Strong understanding of security best practices, DevSecOps principles, and compliance automation.
  • Hands-on experience in performance tuning, capacity planning, and cost optimization at the infrastructure level.
  • A Bachelor’s Degree in Computer Science or equivalent is required.
  • Design scalable, redundant, cost-effective, and fault-tolerant infrastructure for our production systems.
  • Proactively manage infrastructure costs and make optimization recommendations.
  • Help migrate existing applications into Kubernetes and support container orchestration strategies.
  • Build and maintain robust CI/CD pipelines and operational tooling to support rapid development and deployment cycles.
  • Write high-quality JavaScript code and Bash scripts to automate infrastructure processes and troubleshoot issues.
  • Design, build, and maintain systems and tools to automate operational processes and enhance developer productivity.
  • Develop comprehensive monitoring and alerting systems to increase visibility across environments.
  • Automate incident response where appropriate and ensure quick recovery from system issues.
  • Understand, implement, and automate security controls, governance processes, and compliance validation.
  • Work closely with Engineering, QA, and Product teams to troubleshoot issues and develop creative solutions.
  • Educate development teams to identify and eliminate performance bottlenecks, single points of failure, and scaling constraints.
  • Stays up to date on relevant technologies, trends, and opportunities to ensure we always use leading techniques and tools appropriate for the company.
  • Actively participate in planning meetings and collaborate with the team to identify and implement the best technical solution for the company.

AWSBackend DevelopmentDockerNode.jsPostgreSQLPythonSQLBashCloud ComputingDynamoDBElasticSearchGitJavascriptKubernetesRabbitmqCI/CDLinuxDevOpsTerraformMicroservicesAnsibleScripting

Posted 7 days ago
Apply
Apply

🧭 Full-Time

🏒 Company: LastPassπŸ‘₯ 501-1000InternetSaaSMobile AppsSoftware

  • Working with AWS services and technologies.
  • Familiarity with widely used DevOps tools and technologies in the industry.
  • Proficiency in at least one typed programming language, such as Go, Python, or JavaScript.
  • Embracing an infrastructure-as-code mindset, utilizing tools like CloudFormation, CDK, or similar.
  • Designing and implementing solutions that adhere to security industry best practices, standards, and compliance frameworks.
  • Delivering solutions, conducting root cause analysis, and recommending, influencing, and implementing operational improvements.
  • Strong problem-solving abilities.
  • A collaborative, team-oriented approach with a positive, can-do attitude.
  • Effective communication skills with various stakeholder groups of diverse backgrounds and technical expertise within LastPass.
  • Proficiency in English with strong written and verbal communication skills.
  • Maintain and update CI/CD pipelines and manage the LastPass environments, adhering to industry and company-wide best practices.
  • Maintain the build and deployment processes, along with related supportive environments.
  • Drive standardization and automation efforts for the build and release processes.
  • Document and communicate improvements to various stakeholders.
  • Work closely with product development teams to ensure quality gates are met for the codebase and that organizational standards for pipelines are followed.
Posted 7 days ago
Apply
Apply

πŸ” Software Development

  • 5+ years of experience managing systems at scale as a DevOps Engineer, Site Reliability Engineer, or Platform Engineer
  • Excellent technical analytical skills with the ability to implement DDOS mitigation, troubleshoot complex problems, analyze system bottlenecks, and implement effective solutions, from frontend through backend systems, sometimes during production degradation or outage for a high traffic site
  • Exceptional command line Linux skills, with proficiency in Bash and Python for investigation of server and services issues, scripting, and automation
  • In-depth knowledge of AWS services, infrastructure as code using Terraform, GitOps tools and methodologies, and container orchestration using Docker, Helm, and Kubernetes
  • Experience with setup, administration, and maintenance of sharded MySQL database clusters while maintaining no downtime or data loss
  • Excellent communication skills with fluent English, and the ability to collaborate effectively across teams while articulating technical concepts to non-technical stakeholders
  • The ability to get up to speed on systems, make decisions, be flexible, and execute independently with attention to detail for production systems
  • Architect and maintain a highly available infrastructure with a focus on proactive and reactive DDOS mitigation, autoscaling, self-healing, site performance, and cost optimization
  • Participate in a 24/7 on-call rotation, responding swiftly to outages or performance issues, and focus on less urgent alerts during normal work hours
  • Maintain and develop a developer environment and CI/CD pipelines in parity with production systems, for seamless testing and release of changes
  • Automate infrastructure provisioning and management using configuration management tools, complete with tests and documentation
  • Optimize and support sharded MySQL databases for efficient and reliable data handling amidst growing data reads and writes
  • Regularly update system components to avoid security issues and ensure up-to-date technology
Posted 8 days ago
Apply
Apply

πŸ” IGaming

🏒 Company: The Mill AdventureπŸ‘₯ 11-50InternetGamingInformation TechnologySoftware

  • Knowledge of AWS, CloudflaredServerless, and Node.js ecosystem.
  • Hands-on experience with SQL and NoSQL data stores (DynamoDB and MongoDB)
  • Previous experience working with some of the: AWS (CDK, Cloudformation, CloudWatch, Lambda, Kinesis, DynamoDB, etc.), Serverless framework, Cloudflare, MongoDB, GitLab/GitHub.
  • Coding skills (ideally in JavaScript or TypeScript).
  • Maintain core components within our infrastructure in particular AWS and Cloudflare.
  • Investigate and resolve issues related to the above-mentioned infrastructure.
  • Familiarize and improve our infrastructure monitoring and other components (such as Sentry) to help reduce the incidence of bugs in production.
  • Assist in database administration tasks (mainly MongoDB) and in improving this component to ensure guaranteed scalability and reliability.
  • Participate in code changes related to infrastructure components (TypeScript)
  • Troubleshoot issues and improve our CI/CD platform including improving pipelines, runner setup and also improving release tooling.
  • Maintain gaming platform infrastructure as code.
  • Participated in the company's disaster recovery processes and procedures.
  • Liaising with the platform developers to assist in investigating issues and eliminating potential bottlenecks.
  • Assist with data engineering tasks such as helping with improving data pipelines.
  • Automate processes as requested by Technical Compliance
  • Familiarize and keep up to date with best practices, technologies and tools related to our technology stack.
  • Help and take ownership of upcoming projects related to the technical infrastructure in line with the company's future plans.
  • Join the on-call team to help with addressing ongoing production incidents.
Posted 9 days ago
Apply

Related Articles

Posted about 1 month ago

How to Overcome Burnout While Working Remotely: Practical Strategies for Recovery

Burnout is a silent epidemic among remote workers. The blurred lines between work and home life, coupled with the pressure to always be β€œon,” can leave even the most dedicated professionals feeling drained. But burnout doesn’t have to define your remote work experience. With the right strategies, you can recover, recharge, and prevent future episodes. Here’s how.



Posted 6 days ago

Top 10 Skills to Become a Successful Remote Worker by 2025

Remote work is here to stay, and by 2025, the competition for remote jobs will be tougher than ever. To stand out, you need more than just basic skills. Employers want people who can adapt, communicate well, and stay productive without constant supervision. Here’s a simple guide to the top 10 skills that will make you a top candidate for remote jobs in the near future.

Posted 9 months ago

Google is gearing up to expand its remote job listings, promising more opportunities across various departments and regions. Find out how this move can benefit job seekers and impact the market.

Posted 10 months ago

Read about the recent updates in remote work policies by major companies, the latest tools enhancing remote work productivity, and predictive statistics for remote work in 2024.

Posted 10 months ago

In-depth analysis of the tech layoffs in 2024, covering the reasons behind the layoffs, comparisons to previous years, immediate impacts, statistics, and the influence on the remote job market. Discover how startups and large tech companies are adapting, and learn strategies for navigating the new dynamics of the remote job market.