Apply

Infrastructure Engineer

Posted 5 months agoViewed

View full description

📍 Location: West Coast, Central Europe, PT, CET

🔍 Industry: Machine intelligence

🏢 Company: Gensyn👥 1-10💰 $43,000,000 Series A almost 2 years agoCryptocurrencyBlockchainMachine Learning

🪄 Skills: CI/CDDevOpsTerraform

Requirements:
  • Experience deploying large scale workloads to one or more cloud providers.
  • Experience monitoring and being on call for internal and external deployments.
  • Infrastructure as Code and GitOps expertise, preferably with Terraform.
  • Experience with Kubernetes or equivalent container orchestration solution.
  • Experience managing OTel, Prometheus, or similar open-source observability systems.
  • Proficiency in Python, shell scripting, and Linux.
Responsibilities:
  • Support the development, deployment, and observability of the mainnet and internal products.
  • Manage observability and deployments for a complex, distributed system.
  • Manage CI/CD and build pipeline infrastructure.
  • Author and maintain internal developer support tools.
Apply

Related Jobs

Apply

📍 United States, Brazil, Tel Aviv

🔍 Software Development

🏢 Company: Axonius👥 600-600💰 $200,000,000 Series E about 1 year agoAsset ManagementCloud SecurityInformation TechnologyCyber SecurityNetwork Security

  • 2+ years of experience in building and maintaining automation infrastructure, CI/CD pipelines, and related tools.
  • Strong proficiency in Python and PyTest for automation scripting and framework development.
  • Hands-on experience with Docker and Docker Compose for creating test environments and managing containers.
  • Strong experience with version control systems (Git, GitHub, GitHub Actions).
  • Proficiency in Bash scripting and a good understanding of Linux systems and terminal usage.
  • Experience with cloud platforms (AWS, GCP, Azure) and containerization technologies (Docker).
  • Familiarity with automation tools and frameworks such as Selenium, Playwright, or similar UI testing frameworks.
  • Strong problem-solving skills and the ability to troubleshoot complex infrastructure issues.
  • Excellent communication and collaboration skills, with the ability to work with cross-functional teams.
  • Design, build, and maintain the test automation infrastructure that supports automated test execution for UI, API, Unittests and more.
  • Work closely with developers and QA teams to integrate automation tools into the CI/CD pipeline, ensuring automated tests run efficiently and reliably.
  • Manage the underlying infrastructure, including Docker, and cloud-based services, to ensure smooth operation of the test automation framework.
  • Create and optimize tools and scripts that facilitate the development and execution of automated tests (e.g., integrating with GitHub Actions, TeamCity, etc.).
  • Continuously monitor the performance of the test infrastructure, troubleshoot issues, and optimize the environment for speed and scalability.
  • Partner with cross-functional teams to understand testing needs and provide solutions that enable robust, scalable, and efficient test automation environments.
  • Promote and implement best practices for building and maintaining automation infrastructure, ensuring high standards for reliability and maintainability.

AWSDockerPythonBashGitSeleniumCI/CDLinux

Posted 2 days ago
Apply
Apply

📍 U.S

🧭 Full-Time

🏢 Company: LiveKit👥 11-50💰 $22,000,000 Series A about 1 year agoArtificial Intelligence (AI)Real TimeCloud Infrastructure

Experience managing complex multi-region distributed systems running on top of container orchestration systems like Kubernetes.
  • Build and own the foundational infrastructure that our products run upon.
  • Work directly on our products' golang code base to implement SRE related objectives.
  • Take a data driven approach to quantifying system performance and reliability and use it to drive project priorities.
  • Oncall participation including leading incident management for complex situations.
  • Work on automation and advanced configuration management to allow our team to manage large numbers of clusters distributed across the world running various products.
  • Work with infrastructure vendors when their solutions aren't meeting our real time performance and reliability needs.

Cloud ComputingKubernetesGrafanaPrometheusWebRTCCI/CDRESTful APIsLinuxTerraformNetworkingAnsibleSoftware Engineering

Posted 2 days ago
Apply
Apply

📍 United States

🧭 Full-Time

💸 125000.0 - 145000.0 USD per year

🔍 Software Development

🏢 Company: CompanyCam👥 101-250💰 $30,000,000 Series B over 3 years agoAndroidMessagingConstructionSaaSPhoto Sharing

  • Strong knowledge of AWS, including its networking, relational database management and cloud storage services.
  • Experience deploying containerized applications and working with Github Actions and Terraform.
  • Capable of scripting in bash, ruby and/or python.
Designing, improving and deploying infrastructure services to AWS.

AWSDockerPostgreSQLPythonBashCloud ComputingRubyTerraform

Posted 7 days ago
Apply
Apply

📍 Worldwide

🧭 Full-Time

💸 145000.0 - 160000.0 USD per year

  • Proficiency in Google Cloud Platform (GCP) and container orchestration using Kubernetes/GKE.
  • Experience with infrastructure-as-code tools, particularly Terraform.
  • Familiarity with high-scale edge computing environments using CDNs and designing high-availability systems.
  • Basic understanding of security best practices in cloud infrastructure.
  • Assisting in the deployment, configuration, and maintenance of cloud resources on GCP, with a focus on Kubernetes/GKE environments.
  • Utilizing Terraform to automate infrastructure provisioning and ensure consistent, zero-downtime deployments.
  • Supporting the implementation and optimization of edge computing strategies, including managing CDN configurations and designing high-availability systems for global audiences.
  • Applying and advocating for security best practices across our cloud infrastructure to safeguard production systems.
  • Collaborating with team members to monitor system performance, troubleshoot issues, and drive continuous improvement in our infrastructure.
  • Working closely with senior engineers and other teams to integrate new technologies, share knowledge, and adopt modern infrastructure practices.

Cloud ComputingGCPKubernetesCI/CDTerraform

Posted 8 days ago
Apply
Apply

📍 United States

🧭 Full-Time

💸 115000.0 - 160000.0 USD per year

🔍 Software Development

🏢 Company: Taskrabbit👥 251-500💰 Secondary Market over 9 years agoMarketplaceE-CommerceJanitorial ServiceFacilities Support ServicesFreight ServicePeer to PeerSharing Economy

  • At least 5+ years of experience in Infrastructure and DevOps Space.
  • Experience with build automation and configuration management tools (e.g. Ansible, Puppet, Chef.)
  • Strong knowledge of the Amazon Web Services (AWS) ecosystem and other core AWS technologies, ElasticSearch Service, RDS, WAF, CloudFront, Kubernetes etc.
  • You have worked with common infrastructure tools like Docker, Terraform, Helm, Github Actions, ArgoCD
  • Experience with a microservice architecture running in containers (Docker or other containerisation technology)
  • Experience supporting 24x7, high availability internet application environments that include web, application, and database servers and load balancing systems.
  • Experience working with a product that has end-users
  • Bachelor's degree or higher in Computer Science, or equivalent experience
  • Building and maintaining CI / CD pipelines from scratch for testing and releasing configuration and software.
  • Monitor and resolve issues in all environments using tools such as DataDog, PagerDuty, AWS logs.
  • Engage in capacity planning and demand forecasting, anticipating performance bottlenecks, and scaling the environment as needed using DataDog and other tools.
  • Design and implement zero-downtime to accomplish highly available service (99.9%).
  • Ensure systems are secure against cyberthreats and implement fixes for Security vulnerabilities.
  • Automate tasks and develop tools to increase engineering efficiency and visibility.
  • Design and implement disaster recovery (DR) between different regions in cloud providers such as AWS.
  • Manage web domain and certificates.
  • Troubleshoot production and testing environment issues, including performance and function issues.
  • Provide support to the organization through on-call, resolving issues and driving infrastructure changes.
  • Identify, define and document system requirements and recommend solutions to management.
  • Perform on-call duties and be part of the on-call rotations

AWSDockerBashElasticSearchKubernetesREST APICI/CDLinuxDevOpsTerraformMicroservicesAnsibleScripting

Posted 8 days ago
Apply
Apply

📍 New Zealand, United States

🔍 AI-driven work automation

🏢 Company: Autohive

  • Exceptional ability in cloud infrastructure—AWS is your foundation, but you’re adaptable.
  • Understand AI infrastructure—you’ve worked with AI/ML services, or you’re eager to learn and can ramp up fast.
  • Strong operational mindset—security, observability, and cost efficiency are always in your considerations.
  • Fluent in Linux environments and have experience supporting containerized applications.
  • Architect and manage Autohive’s AWS-based infrastructure, ensuring high availability and performance.
  • Deploy and optimize AI workloads using AWS SageMaker, Bedrock, and other AI/ML services.
  • Integrate external AI services (e.g., Azure OpenAI, third-party LLMs) while maintaining a cloud-agnostic mindset.
  • Automate everything—from infrastructure as code (Terraform/CDK) to self-healing systems.
  • Own deployment pipelines and ensure smooth, zero-downtime releases for AI-driven applications.
  • Optimize Linux-based environments, supporting high-performance AI workloads.

AWSDockerPythonCloud ComputingKubernetesMachine LearningCI/CDLinuxDevOpsTerraformScripting

Posted 10 days ago
Apply
Apply

📍 United States

🧭 Full-Time

🔍 Software Development

🏢 Company: Phosphorus Cybersecurity Inc.

  • 5-10 years of experience working with containerized applications (Docker, Kubernetes, etc.).
  • Strong proficiency in Go and/or Python.
  • Experience provisioning virtual machines and infrastructure automation with Terraform, Packer, Ansible, or similar tools.
  • Hands-on experience deploying and managing cloud-based infrastructure in AWS or GCP.
  • Experience with maintaining OS package mirrors, package deployment infrastructure, and custom DEB or RPM packaging (preferred).
  • Strong knowledge of production software release pipeline and asset management concepts.
  • Familiarity with cloud security best practices and ability to author customer-facing security documentation.
  • Experience maintaining system security plans for cloud services and their dependencies.
  • Proven track record of building and maintaining cloud-based or virtualized appliance clusters for high-availability (HA).
  • Design, build, and maintain scalable cloud infrastructure services in AWS and GCP.
  • Contribute production-quality Go and Python code to existing cloud services.
  • Develop and own automation and software deployment pipelines with maximum efficiency.
  • Implement Infrastructure as Code (IaC) practices using Terraform, Ansible, Packer, or similar tools.
  • Maintain and improve the custom RHEL-based OS platform hosting the Phosphorus solution.
  • Design and enhance secure, reliable field software update capabilities.
  • Manage production software release pipelines, ensuring stability and efficiency.
  • Maintain system security plans, ensuring compliance with best practices for security, observability, reliability, and performance.
  • Configure and optimize cloud networking components (firewalls, VPNs, load balancers, routing tables, etc.).
  • Support and improve internal DevOps tooling for engineering teams.
  • Monitor and enhance service availability, ensuring high uptime and resilience.
  • Collaborate with engineering teams to implement observability, logging, and security best practices.

AWSDockerPythonBashCloud ComputingGCPKubernetesGoRDBMSCI/CDRESTful APIsLinuxDevOpsTerraformNetworkingAnsibleScripting

Posted 15 days ago
Apply
Apply

📍 United States

🧭 Full-Time

🔍 Software Development

🏢 Company: ReadMe👥 51-100💰 $9,000,000 Series A over 5 years agoCommunitiesDeveloper APIsDocument ManagementSoftware

  • A programmer with professional experience developing server-side software with Node.js or a similar language.
  • Comfortable designing production infrastructure and implementing your ideas, leveraging technologies from service providers like AWS and Cloudflare.
  • Design and build infrastructure and tooling to make sure ReadMe’s data storage is fast, reliable, and secure.
  • Expand the tooling for our observability platform tooling to ensure that developers can make good use of logging, metrics, and alerts.
  • Add features to Deploybert — our internal automation system for deploying code to production — to support new services and expand its capabilities
  • Come up with ideas around what ReadMe’s infrastructure should look like, and communicate them to your coworkers.
  • Collaborate with other engineers and people around ReadMe.

AWSBackend DevelopmentDockerNode.jsPostgreSQLSoftware DevelopmentCloud ComputingKubernetesCI/CDRESTful APIsLinuxDevOpsTerraformMicroservicesJSONScripting

Posted 19 days ago
Apply
Apply

📍 United States

🧭 Full-Time

💸 180000.0 - 190000.0 USD per year

🔍 Software Development

🏢 Company: Eppo

  • Hands-on experience with Google Cloud Platform (or other major cloud providers), including resource management, cost optimization, and security.
  • Proficiency with Docker and Kubernetes (including local development tools such as Docker Compose, Minikube, or K3s).
  • Experience with Terraform (or similar IaC tools) to deploy and maintain complex infrastructure in a repeatable, scalable manner.
  • Own and optimize our Google Cloud Platform (GCP) resources, ensuring our infrastructure remains secure, scalable, and cost-effective.
  • Improve developer productivity by optimizing our local development environments and establishing best-in-class CI/CD pipelines.
  • Collaborate with product and data engineering teams to provide guidance on observability, performance, and scalability best practices, while ensuring our Kubernetes deployments are streamlined and resilient.

DockerPythonSQLBashCloud ComputingGCPKubernetesGoREST APICI/CDLinuxDevOpsTerraformMicroservicesJSONAnsible

Posted 20 days ago
Apply
Apply

📍 United States, Hong Kong, United Kingdom

🧭 Full-Time

💸 175000.0 - 245000.0 USD per year

🔍 Software Development

🏢 Company: Ontra👥 101-250💰 $200,000,000 Series B over 3 years agoLegal TechDocument ManagementInformation TechnologyLegalSoftware

  • 6+ years setting up and maintaining tools like Jenkins, Travis CI, CircleCI, or GitHub Actions
  • 6+ years of experience using tools such as Terraform, CloudFormation, or Pulumi
  • A general background in using programming languages such as Python, Ruby, Go, and/or Javascript
  • 6+ years of experience provisioning, configuring, and optimizing cloud resources (we use AWS)
  • A background in working with cross-functional teams to facilitate an efficient software delivery pipeline and align on projects and goals
  • Use Kubernetes and Docker to create, manage, and scale containerized applications
  • Implement and manage monitoring and logging tools such as DataDog and Prometheus to ensure the health and performance of applications and infrastructure
  • Manage GitHub and implement branching strategies and code reviews to ensure code quality
  • Gather and analyze requirements from various stakeholders to define infrastructure and deployment needs.
  • Maintain clear documentation for infrastructure and processes to ensure that team members can understand and reproduce the environment as needed

AWSDockerPythonBashCloud ComputingGitJenkinsKubernetesPrometheusCI/CDTerraform

Posted 28 days ago
Apply