Apply

Infrastructure Engineer

Posted 9 days agoViewed

View full description

💎 Seniority level: Senior, 5+ years

📍 Location: United States, Canada

🔍 Industry: Software Development

🏢 Company: EvenUp👥 251-500💰 $135,000,000 Series D 5 months agoArtificial Intelligence (AI)Legal TechFinTechSoftware

🗣️ Languages: English

⏳ Experience: 5+ years

🪄 Skills: AWSDockerPythonSQLCloud ComputingKubernetesQAData engineeringREST APICI/CDDevOpsTerraform

Requirements:
  • 5+ years of infrastructure engineering/DevOps/SRE experience working with cloud-native environments
  • Familiarity with relevant technology stacks a plus (ie. AWS/GCP, Kubernetes, Docker, Infrastructure as a Code, CI/CD, Cloud Security, Monitoring Logging and Altering, MLOPS, QA)
  • Understand the value of having a high-quality code-based infrastructure that is simple, understandable, and reusable.
  • Can communicate technical ideas or issues in easy-to-understand and actionable terms
  • Learn quickly and are seeking opportunities to work cross-functionally (including data engineering, DevOps…) and with a diverse group of people
Responsibilities:
  • 75% doing system design and contributing infrastructure, starting with shipping solution within 2 weeks!
  • 25% collaborating with stakeholders and mentoring, lunch and learns, and more
  • Leverage a self-starter mindset by taking a product concept and building the feature end to end (whether it’s a component of the system or a significant piece of functionality).
  • Collaborate with the team to scale the tech stack based on our rapidly growing user base!
Apply

Related Jobs

Apply

📍 United States, Australia, Canada, South America

🧭 Full-Time

🔍 FinTech

🏢 Company: Flex

  • Proven experience in building, scaling and monitoring cloud infrastructure on AWS, especially EKS, S3, RDS, API Gateway, Load Balancers, VPC, Lambdas, DocumentDB and DynamoDB.
  • Proven experience using Terraform to update and maintain cloud infrastructure.
  • Proven experience with containerized applications, kubernetes and microservice deployments.
  • Strong knowledge of GitHub Actions and CI/CD best practices.
  • Experience with developer productivity tools: designing CI/CD workflows, building internal tools, and creating self-service solutions to streamline software development.
  • Knowledge of monitoring and observability tools and frameworks, with working knowledge of Datadog being a plus.
  • Familiarity with networking concepts (DNS, load balancing, firewalls, VPNs).
  • Strong collaboration skills with the ability to work effectively across teams and communicate technical ideas clearly.
  • Experience coding/reading in one of the industry standard language such as Java, Python, TypeScript
  • Collaborate with service engineering teams to design, implement, and maintain scalable and resilient infrastructure solutions optimizing for performance, resilience, and cost.
  • Ensure infrastructure aligns with business requirements and industry standards.
  • Leverage Terraform to automate infrastructure provisioning and configurations.
  • Implement SRE principles to improve system reliability and reduce downtime.
  • Improve developer workflows by creating self-service tools, optimizing CI/CD pipelines, and enhancing deployment processes to remove friction.
  • Develop and maintain robust monitoring and alerting systems to proactively identify and resolve issues.
  • Lead incident responses, manage on-call rotations, and facilitate post-incident reviews to drive continuous improvement and resilience.
  • Automate everything—drive adoption of Infrastructure as Code (IaC) and build automated pipelines for testing, monitoring, and deployments.
  • Leverage your excellent written and verbal communication skills, to create communications on upcoming changes and how they affect teams.

AWSDockerPythonCloud ComputingDynamoDBJavaKubernetesTypeScriptCI/CDRESTful APIsLinuxDevOpsTerraformMicroservicesNetworkingJSONAnsibleScripting

Posted 1 day ago
Apply
Apply

📍 United States, Canada

🧭 Full-Time

💸 150000.0 - 200000.0 USD per year

🔍 Software Development

  • 5+ years experience in infrastructure/platform engineering
  • Strong experience with AWS services
  • Experience managing infrastructure through code (Terraform)
  • Knowledge of distributed cloud-based platforms
  • Design, write, and deliver software for engineers
  • Build secure and efficient infrastructure
  • Partner with engineering teams for scalable architecture
  • Design and implement developer-facing services

AWSBashKubernetesClickhouseTerraform

Posted 4 days ago
Apply
Apply

📍 Canada

🔍 Software Development

🏢 Company: Finite State👥 101-250💰 $20,000,000 Series B 11 months ago🫂 Last layoff over 2 years agoInternet of ThingsSecurityRisk ManagementSupply Chain ManagementCyber Security

  • Experience managing and supporting development and production environment infrastructure in both cloud and private data centers
  • A deep knowledge of Kubernetes, Helm, Terraform, and Linux
  • Proficiency in the core AWS technologies for security, networking, provisioning, configuration management, messaging, and monitoring
  • Experience with Python and shell scripting
  • A solid understanding of CI/CD processes and related metrics
  • Experience establishing, automating, and managing system support
  • Excellent communication, prioritization, and troubleshooting skills
  • Being cool under pressure in the face of production incidents, and being able to methodically and quickly return a system to good health.
  • Build and enhance our AWS Cloud Infrastructure, as well as on-premise deployments.
  • Fully automate infrastructure provisioning and configuration.
  • Propose, plan, and lead  projects to update and improve our infrastructure.
  • Act as a thought leader in how to leverage AWS, Cloud, and stand-alone technologies cohesively.
  • Maintain strong Cybersecurity discipline and support third party security audits.
  • Work with stakeholders to understand the system requirements to meet business needs.
  • Work with software developers to understand the challenges they face balancing velocity, reliability, security, and cost.
  • Develop tooling that continually improves testability, eases deployments, fosters repeatability, and enables observability.
  • Ensure that all relevant parties understand and participate in a healthy DevOps culture of accountability and shared responsibility.
  • Mentor and coach other members of the Infrastructure Engineering team.

AWSLeadershipPostgreSQLPythonAmazon RDSAWS EKSCloud ComputingCybersecurityKubernetesRedisCommunication SkillsCI/CDRESTful APIsMentoringLinuxDevOpsTerraformTroubleshootingScripting

Posted 6 days ago
Apply
Apply

📍 USA

🧭 Full-Time

🔍 Software Development

🏢 Company: Dandy👥 501-1000Food and BeverageFood Processing

  • 5+ years of software engineering experience, preferably in a high growth startup environment
  • An expert in Google Cloud Platform and Google Kubernetes Engine
  • Experience with infrastructure as code platforms (Terraform, Pulumi)
  • Experience creating and maintaining fully automated CI/CD build processes for multiple environments
  • Experience designing the architecture and automation of infrastructure within a cloud environment
  • Develop and maintain infrastructure, systems, and tooling to support Dandy’s products in a secure, well-tested, and performant way.
  • Reinvent an analog experience and disrupt a legacy industry through novel and scalable system design.
  • Collaborate with Product Engineers and other stakeholders within Engineering, Product and Data to maintain a high bar for quality in a fast-paced, iterative environment.
  • Advocate for improvements to infrastructure quality, security, and performance.
  • Craft code that meets our internal standards for style, maintainability, and best practices.
  • Recognize impediments to our efficiency as a team ("technical debt"), propose and implement solutions.

GraphQLNode.jsPostgreSQLCloud ComputingGCPKubernetesTypeScriptNest.jsCI/CDDevOpsTerraformSoftware Engineering

Posted 6 days ago
Apply
Apply

📍 United States

🏢 Company: HSO👥 1001-5000Information Technology

  • 3-5 years of experience in cloud engineering or system automation roles
  • Prior experience in designing and managing automation workflows in Azure environments
  • Proficiency in PowerShell and Azure CLI
  • Experience with Azure Resource Manager template and Bicep
  • Knowledge of Azure DevOps, GitHub and CI/CD pipelines
  • Strong understanding of infrastructure as code principles
  • Azure certifications (preferred)
  • Design and implement Azure infrastructure: Building cloud-based solutions using Azure services like Landing Zones, virtual machines, storage, networking, and databases according to business requirements
  • Use Infrastructure as Code (IaC) and DevOps: Utilizing tools like Azure DevOps and Terraform, Bicep, or Azure Resource Manager (ARM) templates to automate infrastructure provisioning and configuration management.
  • Network management: Configuring Azure virtual networks, subnets, security groups, and load balancers to ensure network connectivity and security.
  • Security administration: Implementing Azure security best practices, managing user access, and monitoring for potential security threats.
  • Monitor and logging: Setting up monitoring solutions to track the health and performance of Azure infrastructure, analyzing logs to identify and troubleshoot issues.
  • Cost optimization: Analyzing Azure usage and optimizing resource allocation to control cloud costs.
  • Disaster recovery planning: Implementing strategies to ensure business continuity in case of system outages.
  • Collaboration with teams: Working closely with development teams, DevOps engineers, and other stakeholders to integrate cloud infrastructure with applications.

Cloud ComputingAzureCommunication SkillsCollaborationCI/CDProblem SolvingDevOpsTerraformNetworkingTroubleshootingScripting

Posted 8 days ago
Apply
Apply

📍 United States, Canada

🧭 Full-Time

🔍 AI Research

🏢 Company: Cohere👥 251-500💰 $169,509,482 Grant 3 months ago🫂 Last layoff 7 months agoArtificial Intelligence (AI)Machine LearningGenerative AINatural Language Processing

  • Experience with Opensearch or Elasticsearch clusters in production
  • Production deployment and management of Kubernetes clusters
  • Hands-on coding experience developing services and automated tests
  • Experience with GCP, AWS, or Azure for cloud-based infrastructure
  • Support Search Services for internal and external use cases
  • Automate observability and resilience
  • Participate in on-call rotation
  • Build relationships with developers
  • Share knowledge and review processes

AWSCloud ComputingElasticSearchGCPKubernetesAPI testingAzure

Posted 8 days ago
Apply
Apply

📍 United States, Canada

🧭 Full-Time

🔍 Software Development

🏢 Company: Varicent

  • 4+ years of professional experience in CloudOps / DevOps role.
  • Experience with cloud platforms like AWS, Azure, and IBM.
  • Proficiency in one or more programming languages, including Python.
  • Lead infrastructure service requests towards customer focused resolution and improvements.
  • Monitor and tune application infrastructure for optimal performance.
  • Work with application teams for automation of configuration and deployment.

AWSDockerPythonKubernetesAzureCI/CDLinuxNetworking

Posted 9 days ago
Apply
Apply

📍 United States

🧭 Full-Time

🔍 Software Development

🏢 Company: Aimpoint Digital👥 1-50ConsultingAnalyticsAdvice

  • Degree-educated in Computer Science, Engineering, Mathematics, or equivalent experience
  • Strong written and verbal communication skills
  • Experience managing stakeholders and collaborating with customers
  • Experience scoping cloud infrastructure cases, developing proposals, and presenting to key stakeholders
  • Experience building and maintaining cloud architecture at scale; primarily in AWS
  • Requirement: demonstrated administrative experience to guide best-in-class security architecture
  • Requirement: demonstrated experience designing resource monitoring processes (e.g. cost, performance)
  • Experience working with cloud data warehouses (Snowflake, Google BigQuery, AWS Redshift, Microsoft Synapse)
  • Experience working with modern Data + AI platforms (Databricks, Snowflake, SageMaker)
  • Experience with cloud security and governance principles
  • Experience working with cloud platforms (AWS, Azure, GCP) and container technologies (Docker, Kubernetes)
  • Working understanding of relational databases, query languages and data modeling practices
  • 7+ years designing, implementing and maintaining cloud architectures (primarily AWS, others a plus)
  • 5+ years building data pipelines in production and ability to work across structured, semi-structured and unstructured data
  • 3+ years writing clean, maintainable, and robust code in Python, Scala, Java, or similar coding languages
  • Become a trusted advisor working together with our clients, from data owners and analytic users to C-level executives
  • Engage and lead multi-disciplinary teams to solve complex use-cases across a variety of industries
  • Manage high-priority accounts that can range from global leaders to emerging disruptors within our target verticals as they look to modernize their cloud Infrastructure environments, develop repeatable deployment frameworks, and operationalize their cloud environment for the application of artificial intelligence
  • Help establish and expand commercial offerings within our priority cloud data platform partnerships (namely AWS)
  • Assess existing cloud infrastructure and business processes and advise on best-in-class modern solutions
  • Design the cloud architecture, building data ingestion and storage protocols, and ensuring best-in-class integration with modern data platforms (e.g. Snowflake, Databricks)
  • Work with common AWS infrastructure stack like Kinesis, IoT Core, Glue, Kafka, SageMaker, Redshift and ensure the environment adheres to best-in-class security, governance and monitoring principles (e.g. AWS IAM, Artifact, CloudWatch, CloudTrail).

AWSDockerLeadershipProject ManagementPythonSQLBusiness AnalysisCloud ComputingJavaKafkaKubernetesSnowflakeData engineeringREST APIResource PlanningCommunication SkillsCI/CDDevOpsTerraformJSONScalaStakeholder managementData modelingSoftware EngineeringData management

Posted 10 days ago
Apply
Apply

📍 United States

🧭 Full-Time

💸 130000.0 - 185000.0 USD per year

🔍 Software Development

🏢 Company: Datavant👥 1001-5000💰 $40,000,000 Series B over 4 years agoBiopharmaClinical TrialsData IntegrationHealth CareSoftware

  • 5+ years experience in software development
  • 2+ years experience building and maintaining a data lake and/or data warehouse
  • Strong understanding of cloud architecture
  • 2+ years experience using a cloud provider
  • Experience writing Infrastructure-as-Code
  • Deep knowledge of operational database products
  • Collaborate with department and software engineers
  • Plan and delegate complex projects
  • Mentor early career developers
  • Facilitate technical discussions
  • Engage with stakeholders
  • Build and maintain data-related infrastructure
  • Write performant, reusable code and Infrastructure-as-Code
  • Review code to ensure technical quality

AWSLeadershipPythonApache AirflowSnowflake

Posted 14 days ago
Apply
Apply

📍 U.S.

🧭 Full-Time

💸 161000.0 - 194000.0 USD per year

🔍 FinTech

🏢 Company: Flex

  • Proven experience in building, scaling, and monitoring cloud infrastructure on AWS (EKS, S3, RDS, etc.).
  • Experience using Terraform for cloud infrastructure management.
  • Experience with containerized applications, Kubernetes, and microservice deployments.
  • Strong knowledge of GitHub Actions and CI/CD best practices.
  • Experience with developer productivity tools to streamline software development.
  • Knowledge of monitoring tools and frameworks, Datadog preferred.
  • Familiarity with networking concepts like DNS and load balancing.
  • Experience coding/reading in Java, Python, or TypeScript.
  • Collaborate with service engineering teams to design, implement, and maintain scalable and resilient infrastructure solutions.
  • Ensure infrastructure aligns with business requirements and industry standards.
  • Leverage Terraform to automate infrastructure provisioning.
  • Implement SRE principles to enhance reliability.
  • Improve developer workflows by creating self-service tools and optimizing CI/CD pipelines.
  • Develop robust monitoring systems.
  • Lead incident responses and facilitate post-incident reviews.
  • Drive adoption of Infrastructure as Code and build automated pipelines.

AWSDockerPythonAWS EKSDynamoDBJavaKubernetesTypeScriptCI/CDTerraform

Posted 16 days ago
Apply