Apply

Infrastructure Engineer

Posted 8 days agoViewed

View full description

💎 Seniority level: Middle, 3-5 years

📍 Location: Worldwide

💸 Salary: 145000.0 - 160000.0 USD per year

🗣️ Languages: English

⏳ Experience: 3-5 years

🪄 Skills: Cloud ComputingGCPKubernetesCI/CDTerraform

Requirements:
  • Proficiency in Google Cloud Platform (GCP) and container orchestration using Kubernetes/GKE.
  • Experience with infrastructure-as-code tools, particularly Terraform.
  • Familiarity with high-scale edge computing environments using CDNs and designing high-availability systems.
  • Basic understanding of security best practices in cloud infrastructure.
Responsibilities:
  • Assisting in the deployment, configuration, and maintenance of cloud resources on GCP, with a focus on Kubernetes/GKE environments.
  • Utilizing Terraform to automate infrastructure provisioning and ensure consistent, zero-downtime deployments.
  • Supporting the implementation and optimization of edge computing strategies, including managing CDN configurations and designing high-availability systems for global audiences.
  • Applying and advocating for security best practices across our cloud infrastructure to safeguard production systems.
  • Collaborating with team members to monitor system performance, troubleshoot issues, and drive continuous improvement in our infrastructure.
  • Working closely with senior engineers and other teams to integrate new technologies, share knowledge, and adopt modern infrastructure practices.
Apply

Related Jobs

Apply

📍 United States, Brazil, Tel Aviv

🔍 Software Development

🏢 Company: Axonius👥 600-600💰 $200,000,000 Series E about 1 year agoAsset ManagementCloud SecurityInformation TechnologyCyber SecurityNetwork Security

  • 2+ years of experience in building and maintaining automation infrastructure, CI/CD pipelines, and related tools.
  • Strong proficiency in Python and PyTest for automation scripting and framework development.
  • Hands-on experience with Docker and Docker Compose for creating test environments and managing containers.
  • Strong experience with version control systems (Git, GitHub, GitHub Actions).
  • Proficiency in Bash scripting and a good understanding of Linux systems and terminal usage.
  • Experience with cloud platforms (AWS, GCP, Azure) and containerization technologies (Docker).
  • Familiarity with automation tools and frameworks such as Selenium, Playwright, or similar UI testing frameworks.
  • Strong problem-solving skills and the ability to troubleshoot complex infrastructure issues.
  • Excellent communication and collaboration skills, with the ability to work with cross-functional teams.
  • Design, build, and maintain the test automation infrastructure that supports automated test execution for UI, API, Unittests and more.
  • Work closely with developers and QA teams to integrate automation tools into the CI/CD pipeline, ensuring automated tests run efficiently and reliably.
  • Manage the underlying infrastructure, including Docker, and cloud-based services, to ensure smooth operation of the test automation framework.
  • Create and optimize tools and scripts that facilitate the development and execution of automated tests (e.g., integrating with GitHub Actions, TeamCity, etc.).
  • Continuously monitor the performance of the test infrastructure, troubleshoot issues, and optimize the environment for speed and scalability.
  • Partner with cross-functional teams to understand testing needs and provide solutions that enable robust, scalable, and efficient test automation environments.
  • Promote and implement best practices for building and maintaining automation infrastructure, ensuring high standards for reliability and maintainability.

AWSDockerPythonBashGitSeleniumCI/CDLinux

Posted 2 days ago
Apply
Apply

📍 U.S

🧭 Full-Time

🏢 Company: LiveKit👥 11-50💰 $22,000,000 Series A about 1 year agoArtificial Intelligence (AI)Real TimeCloud Infrastructure

Experience managing complex multi-region distributed systems running on top of container orchestration systems like Kubernetes.
  • Build and own the foundational infrastructure that our products run upon.
  • Work directly on our products' golang code base to implement SRE related objectives.
  • Take a data driven approach to quantifying system performance and reliability and use it to drive project priorities.
  • Oncall participation including leading incident management for complex situations.
  • Work on automation and advanced configuration management to allow our team to manage large numbers of clusters distributed across the world running various products.
  • Work with infrastructure vendors when their solutions aren't meeting our real time performance and reliability needs.

Cloud ComputingKubernetesGrafanaPrometheusWebRTCCI/CDRESTful APIsLinuxTerraformNetworkingAnsibleSoftware Engineering

Posted 3 days ago
Apply
Apply

📍 United States

🧭 Full-Time

💸 125000.0 - 145000.0 USD per year

🔍 Software Development

🏢 Company: CompanyCam👥 101-250💰 $30,000,000 Series B over 3 years agoAndroidMessagingConstructionSaaSPhoto Sharing

  • Strong knowledge of AWS, including its networking, relational database management and cloud storage services.
  • Experience deploying containerized applications and working with Github Actions and Terraform.
  • Capable of scripting in bash, ruby and/or python.
Designing, improving and deploying infrastructure services to AWS.

AWSDockerPostgreSQLPythonBashCloud ComputingRubyTerraform

Posted 7 days ago
Apply
Apply

📍 United States

🧭 Full-Time

💸 115000.0 - 160000.0 USD per year

🔍 Software Development

🏢 Company: Taskrabbit👥 251-500💰 Secondary Market over 9 years agoMarketplaceE-CommerceJanitorial ServiceFacilities Support ServicesFreight ServicePeer to PeerSharing Economy

  • At least 5+ years of experience in Infrastructure and DevOps Space.
  • Experience with build automation and configuration management tools (e.g. Ansible, Puppet, Chef.)
  • Strong knowledge of the Amazon Web Services (AWS) ecosystem and other core AWS technologies, ElasticSearch Service, RDS, WAF, CloudFront, Kubernetes etc.
  • You have worked with common infrastructure tools like Docker, Terraform, Helm, Github Actions, ArgoCD
  • Experience with a microservice architecture running in containers (Docker or other containerisation technology)
  • Experience supporting 24x7, high availability internet application environments that include web, application, and database servers and load balancing systems.
  • Experience working with a product that has end-users
  • Bachelor's degree or higher in Computer Science, or equivalent experience
  • Building and maintaining CI / CD pipelines from scratch for testing and releasing configuration and software.
  • Monitor and resolve issues in all environments using tools such as DataDog, PagerDuty, AWS logs.
  • Engage in capacity planning and demand forecasting, anticipating performance bottlenecks, and scaling the environment as needed using DataDog and other tools.
  • Design and implement zero-downtime to accomplish highly available service (99.9%).
  • Ensure systems are secure against cyberthreats and implement fixes for Security vulnerabilities.
  • Automate tasks and develop tools to increase engineering efficiency and visibility.
  • Design and implement disaster recovery (DR) between different regions in cloud providers such as AWS.
  • Manage web domain and certificates.
  • Troubleshoot production and testing environment issues, including performance and function issues.
  • Provide support to the organization through on-call, resolving issues and driving infrastructure changes.
  • Identify, define and document system requirements and recommend solutions to management.
  • Perform on-call duties and be part of the on-call rotations

AWSDockerBashElasticSearchKubernetesREST APICI/CDLinuxDevOpsTerraformMicroservicesAnsibleScripting

Posted 8 days ago
Apply
Apply

📍 New Zealand, United States

🔍 AI-driven work automation

🏢 Company: Autohive

  • Exceptional ability in cloud infrastructure—AWS is your foundation, but you’re adaptable.
  • Understand AI infrastructure—you’ve worked with AI/ML services, or you’re eager to learn and can ramp up fast.
  • Strong operational mindset—security, observability, and cost efficiency are always in your considerations.
  • Fluent in Linux environments and have experience supporting containerized applications.
  • Architect and manage Autohive’s AWS-based infrastructure, ensuring high availability and performance.
  • Deploy and optimize AI workloads using AWS SageMaker, Bedrock, and other AI/ML services.
  • Integrate external AI services (e.g., Azure OpenAI, third-party LLMs) while maintaining a cloud-agnostic mindset.
  • Automate everything—from infrastructure as code (Terraform/CDK) to self-healing systems.
  • Own deployment pipelines and ensure smooth, zero-downtime releases for AI-driven applications.
  • Optimize Linux-based environments, supporting high-performance AI workloads.

AWSDockerPythonCloud ComputingKubernetesMachine LearningCI/CDLinuxDevOpsTerraformScripting

Posted 10 days ago
Apply
Apply

📍 Ireland

🔍 Cybersecurity

🏢 Company: crowdstrikecareers

  • Config Management (Chef, Puppet, Salt or similar solutions
  • Provisioning & Orchestration (Terraform, Ansible or similar solutions
  • Experience developing in: Python, Ruby or GO
  • Experience with large-scale, business-critical Linux environment
  • Experience operating within the cloud, AWS & GCP preferred.
  • Experience with TDD, CI/CD, Chaos Engineering or similar resilience and reliability practices for infrastructure development.
  • Submit and review PRs for your team mates’ Infrastructure as Code
  • Responsible for testing and building terraform modules, chef cookbooks & other core infrastructure engineering tasks
  • Rewriting vital services using Golang
  • Responsible for reviewing new design proposals from your peers
  • Participate in regular retros, capacity and planning meetings with your team, allowing team collaboration and discussions in a high fidelity manner.
  • Be part of “lunch and learn” demos: for new POCs or design sessions to work out new architectures.

AWSPythonCloud ComputingGCPGitRubyGoCI/CDLinuxDevOpsTerraformMicroservicesAnsibleScripting

Posted 12 days ago
Apply
Apply

📍 United States

🧭 Full-Time

🔍 Software Development

🏢 Company: Phosphorus Cybersecurity Inc.

  • 5-10 years of experience working with containerized applications (Docker, Kubernetes, etc.).
  • Strong proficiency in Go and/or Python.
  • Experience provisioning virtual machines and infrastructure automation with Terraform, Packer, Ansible, or similar tools.
  • Hands-on experience deploying and managing cloud-based infrastructure in AWS or GCP.
  • Experience with maintaining OS package mirrors, package deployment infrastructure, and custom DEB or RPM packaging (preferred).
  • Strong knowledge of production software release pipeline and asset management concepts.
  • Familiarity with cloud security best practices and ability to author customer-facing security documentation.
  • Experience maintaining system security plans for cloud services and their dependencies.
  • Proven track record of building and maintaining cloud-based or virtualized appliance clusters for high-availability (HA).
  • Design, build, and maintain scalable cloud infrastructure services in AWS and GCP.
  • Contribute production-quality Go and Python code to existing cloud services.
  • Develop and own automation and software deployment pipelines with maximum efficiency.
  • Implement Infrastructure as Code (IaC) practices using Terraform, Ansible, Packer, or similar tools.
  • Maintain and improve the custom RHEL-based OS platform hosting the Phosphorus solution.
  • Design and enhance secure, reliable field software update capabilities.
  • Manage production software release pipelines, ensuring stability and efficiency.
  • Maintain system security plans, ensuring compliance with best practices for security, observability, reliability, and performance.
  • Configure and optimize cloud networking components (firewalls, VPNs, load balancers, routing tables, etc.).
  • Support and improve internal DevOps tooling for engineering teams.
  • Monitor and enhance service availability, ensuring high uptime and resilience.
  • Collaborate with engineering teams to implement observability, logging, and security best practices.

AWSDockerPythonBashCloud ComputingGCPKubernetesGoRDBMSCI/CDRESTful APIsLinuxDevOpsTerraformNetworkingAnsibleScripting

Posted 15 days ago
Apply
Apply

📍 Australia, New Zealand

🔍 Software Development

  • Experience automating failover scenarios to improve system reliability.
  • Experience building and maintaining AWS cloud-based applications and infrastructure.
  • Knowledge of caching techniques, CDNs, and DDoS mitigation for performance and security.
  • Skilled in Linux administration, monitoring, logging, and infrastructure automation
  • Experience creating and maintaining Internal Development Platforms
  • Lead initiatives to enhance system reliability, performance, security and minimise toil.
  • Educate and mentor the team to build infrastructure knowledge and skills.
  • Continuously improve processes by learning from incidents and near-misses.
  • Collaborate on designing, building, and provisioning infrastructure for new products and initiatives.
  • Develop software to integrate infrastructure with services and APIs.
  • Support and coach engineers to strengthen technical infrastructure and monitoring expertise.
  • Improve monitoring and measurement systems to support operational scale and continuous delivery.
  • Participate in incident response, troubleshooting, and post-incident reviews.
  • Optimise application performance, efficiency, and latency across the stack.
  • Identify and mitigate risks, supporting security and change management processes.

AWSDockerLeadershipCloud ComputingKubernetesCI/CDMentoringLinuxDevOpsTerraformMicroservicesScriptingSoftware Engineering

Posted 15 days ago
Apply
Apply

📍 Canada

🧭 Full-Time

🔍 Software Development

🏢 Company: Centari👥 11-50💰 Seed 9 months agoArtificial Intelligence (AI)Legal TechKnowledge ManagementSoftware

At least 5 years of experience in SRE, DevOps, Infrastructure or a similar role.
NOT STATED

AWSDockerCloud ComputingGCPJenkinsKubernetesAzureGoCI/CDTerraform

Posted 17 days ago
Apply
Apply

📍 United States

🧭 Full-Time

🔍 Software Development

🏢 Company: ReadMe👥 51-100💰 $9,000,000 Series A over 5 years agoCommunitiesDeveloper APIsDocument ManagementSoftware

  • A programmer with professional experience developing server-side software with Node.js or a similar language.
  • Comfortable designing production infrastructure and implementing your ideas, leveraging technologies from service providers like AWS and Cloudflare.
  • Design and build infrastructure and tooling to make sure ReadMe’s data storage is fast, reliable, and secure.
  • Expand the tooling for our observability platform tooling to ensure that developers can make good use of logging, metrics, and alerts.
  • Add features to Deploybert — our internal automation system for deploying code to production — to support new services and expand its capabilities
  • Come up with ideas around what ReadMe’s infrastructure should look like, and communicate them to your coworkers.
  • Collaborate with other engineers and people around ReadMe.

AWSBackend DevelopmentDockerNode.jsPostgreSQLSoftware DevelopmentCloud ComputingKubernetesCI/CDRESTful APIsLinuxDevOpsTerraformMicroservicesJSONScripting

Posted 19 days ago
Apply