Apply

Infrastructure Engineer

Posted 22 days agoViewed

View full description

💎 Seniority level: Senior, 5+ years

📍 Location: United States

💸 Salary: 170000.0 - 190000.0 USD per year

🔍 Industry: Software Development

🏢 Company: Collective Inc

🗣️ Languages: English

⏳ Experience: 5+ years

🪄 Skills: AWSPostgreSQLPythonCloud ComputingDjangoCI/CDLinuxDevOpsTerraform

Requirements:
  • 5+ years of experience as a Engineer doing Devops/SRE/Infrastructure work
  • Strong Python experience, bonus if that includes Django!
  • Strong DevOps background, bonus if you have AWS and other certifications
  • You love AWS and Terraform, bonus if you know Terraform Cloud.
Responsibilities:
  • Manage all aspects of our cloud infrastructure. Implementing and supporting new cloud services to support Product development
  • Be a goto expert on our CI/CD systems. Innovate, support, and develop tooling to enhance the Developer Experience
  • Help operate and improve all aspects of our observability stack. Keep tabs on performance and optimization, and help us run an efficient, cost conscious infrastructure.
  • Contribute collaboratively to the Infrastructure roadmap, helping us scale efficiently and onboard new technologies.
  • Participate in oncall. Help us grow as a learning organization, improving process, building tooling, and being part of the process.t
Apply

Related Jobs

Apply

📍 United States, Australia, Canada, South America

🧭 Full-Time

🔍 FinTech

🏢 Company: Flex

  • Proven experience in building, scaling and monitoring cloud infrastructure on AWS, especially EKS, S3, RDS, API Gateway, Load Balancers, VPC, Lambdas, DocumentDB and DynamoDB.
  • Proven experience using Terraform to update and maintain cloud infrastructure.
  • Proven experience with containerized applications, kubernetes and microservice deployments.
  • Strong knowledge of GitHub Actions and CI/CD best practices.
  • Experience with developer productivity tools: designing CI/CD workflows, building internal tools, and creating self-service solutions to streamline software development.
  • Knowledge of monitoring and observability tools and frameworks, with working knowledge of Datadog being a plus.
  • Familiarity with networking concepts (DNS, load balancing, firewalls, VPNs).
  • Strong collaboration skills with the ability to work effectively across teams and communicate technical ideas clearly.
  • Experience coding/reading in one of the industry standard language such as Java, Python, TypeScript
  • Collaborate with service engineering teams to design, implement, and maintain scalable and resilient infrastructure solutions optimizing for performance, resilience, and cost.
  • Ensure infrastructure aligns with business requirements and industry standards.
  • Leverage Terraform to automate infrastructure provisioning and configurations.
  • Implement SRE principles to improve system reliability and reduce downtime.
  • Improve developer workflows by creating self-service tools, optimizing CI/CD pipelines, and enhancing deployment processes to remove friction.
  • Develop and maintain robust monitoring and alerting systems to proactively identify and resolve issues.
  • Lead incident responses, manage on-call rotations, and facilitate post-incident reviews to drive continuous improvement and resilience.
  • Automate everything—drive adoption of Infrastructure as Code (IaC) and build automated pipelines for testing, monitoring, and deployments.
  • Leverage your excellent written and verbal communication skills, to create communications on upcoming changes and how they affect teams.

AWSDockerPythonCloud ComputingDynamoDBJavaKubernetesTypeScriptCI/CDRESTful APIsLinuxDevOpsTerraformMicroservicesNetworkingJSONAnsibleScripting

Posted about 11 hours ago
Apply
Apply

📍 United States, Canada

🧭 Full-Time

💸 150000.0 - 200000.0 USD per year

🔍 Software Development

  • 5+ years experience in infrastructure/platform engineering
  • Strong experience with AWS services
  • Experience managing infrastructure through code (Terraform)
  • Knowledge of distributed cloud-based platforms
  • Design, write, and deliver software for engineers
  • Build secure and efficient infrastructure
  • Partner with engineering teams for scalable architecture
  • Design and implement developer-facing services

AWSBashKubernetesClickhouseTerraform

Posted 3 days ago
Apply
Apply

📍 United States

🏢 Company: HSO👥 1001-5000Information Technology

  • 3-5 years of experience in cloud engineering or system automation roles
  • Prior experience in designing and managing automation workflows in Azure environments
  • Proficiency in PowerShell and Azure CLI
  • Experience with Azure Resource Manager template and Bicep
  • Knowledge of Azure DevOps, GitHub and CI/CD pipelines
  • Strong understanding of infrastructure as code principles
  • Azure certifications (preferred)
  • Design and implement Azure infrastructure: Building cloud-based solutions using Azure services like Landing Zones, virtual machines, storage, networking, and databases according to business requirements
  • Use Infrastructure as Code (IaC) and DevOps: Utilizing tools like Azure DevOps and Terraform, Bicep, or Azure Resource Manager (ARM) templates to automate infrastructure provisioning and configuration management.
  • Network management: Configuring Azure virtual networks, subnets, security groups, and load balancers to ensure network connectivity and security.
  • Security administration: Implementing Azure security best practices, managing user access, and monitoring for potential security threats.
  • Monitor and logging: Setting up monitoring solutions to track the health and performance of Azure infrastructure, analyzing logs to identify and troubleshoot issues.
  • Cost optimization: Analyzing Azure usage and optimizing resource allocation to control cloud costs.
  • Disaster recovery planning: Implementing strategies to ensure business continuity in case of system outages.
  • Collaboration with teams: Working closely with development teams, DevOps engineers, and other stakeholders to integrate cloud infrastructure with applications.

Cloud ComputingAzureCommunication SkillsCollaborationCI/CDProblem SolvingDevOpsTerraformNetworkingTroubleshootingScripting

Posted 7 days ago
Apply
Apply

📍 United States, Canada

🧭 Full-Time

🔍 AI Research

🏢 Company: Cohere👥 251-500💰 $169,509,482 Grant 3 months ago🫂 Last layoff 7 months agoArtificial Intelligence (AI)Machine LearningGenerative AINatural Language Processing

  • Experience with Opensearch or Elasticsearch clusters in production
  • Production deployment and management of Kubernetes clusters
  • Hands-on coding experience developing services and automated tests
  • Experience with GCP, AWS, or Azure for cloud-based infrastructure
  • Support Search Services for internal and external use cases
  • Automate observability and resilience
  • Participate in on-call rotation
  • Build relationships with developers
  • Share knowledge and review processes

AWSCloud ComputingElasticSearchGCPKubernetesAPI testingAzure

Posted 7 days ago
Apply
Apply

📍 United States, Canada

🧭 Full-Time

🔍 Software Development

🏢 Company: EvenUp👥 251-500💰 $135,000,000 Series D 5 months agoArtificial Intelligence (AI)Legal TechFinTechSoftware

  • 5+ years of infrastructure engineering/DevOps/SRE experience working with cloud-native environments
  • Familiarity with relevant technology stacks a plus (ie. AWS/GCP, Kubernetes, Docker, Infrastructure as a Code, CI/CD, Cloud Security, Monitoring Logging and Altering, MLOPS, QA)
  • Understand the value of having a high-quality code-based infrastructure that is simple, understandable, and reusable.
  • Can communicate technical ideas or issues in easy-to-understand and actionable terms
  • Learn quickly and are seeking opportunities to work cross-functionally (including data engineering, DevOps…) and with a diverse group of people
  • 75% doing system design and contributing infrastructure, starting with shipping solution within 2 weeks!
  • 25% collaborating with stakeholders and mentoring, lunch and learns, and more
  • Leverage a self-starter mindset by taking a product concept and building the feature end to end (whether it’s a component of the system or a significant piece of functionality).
  • Collaborate with the team to scale the tech stack based on our rapidly growing user base!

AWSDockerPythonSQLCloud ComputingKubernetesQAData engineeringREST APICI/CDDevOpsTerraform

Posted 8 days ago
Apply
Apply

📍 United States, Canada

🧭 Full-Time

🔍 Software Development

🏢 Company: Varicent

  • 4+ years of professional experience in CloudOps / DevOps role.
  • Experience with cloud platforms like AWS, Azure, and IBM.
  • Proficiency in one or more programming languages, including Python.
  • Lead infrastructure service requests towards customer focused resolution and improvements.
  • Monitor and tune application infrastructure for optimal performance.
  • Work with application teams for automation of configuration and deployment.

AWSDockerPythonKubernetesAzureCI/CDLinuxNetworking

Posted 9 days ago
Apply
Apply

📍 United States

🧭 Full-Time

🔍 Software Development

🏢 Company: Aimpoint Digital👥 1-50ConsultingAnalyticsAdvice

  • Degree-educated in Computer Science, Engineering, Mathematics, or equivalent experience
  • Strong written and verbal communication skills
  • Experience managing stakeholders and collaborating with customers
  • Experience scoping cloud infrastructure cases, developing proposals, and presenting to key stakeholders
  • Experience building and maintaining cloud architecture at scale; primarily in AWS
  • Requirement: demonstrated administrative experience to guide best-in-class security architecture
  • Requirement: demonstrated experience designing resource monitoring processes (e.g. cost, performance)
  • Experience working with cloud data warehouses (Snowflake, Google BigQuery, AWS Redshift, Microsoft Synapse)
  • Experience working with modern Data + AI platforms (Databricks, Snowflake, SageMaker)
  • Experience with cloud security and governance principles
  • Experience working with cloud platforms (AWS, Azure, GCP) and container technologies (Docker, Kubernetes)
  • Working understanding of relational databases, query languages and data modeling practices
  • 7+ years designing, implementing and maintaining cloud architectures (primarily AWS, others a plus)
  • 5+ years building data pipelines in production and ability to work across structured, semi-structured and unstructured data
  • 3+ years writing clean, maintainable, and robust code in Python, Scala, Java, or similar coding languages
  • Become a trusted advisor working together with our clients, from data owners and analytic users to C-level executives
  • Engage and lead multi-disciplinary teams to solve complex use-cases across a variety of industries
  • Manage high-priority accounts that can range from global leaders to emerging disruptors within our target verticals as they look to modernize their cloud Infrastructure environments, develop repeatable deployment frameworks, and operationalize their cloud environment for the application of artificial intelligence
  • Help establish and expand commercial offerings within our priority cloud data platform partnerships (namely AWS)
  • Assess existing cloud infrastructure and business processes and advise on best-in-class modern solutions
  • Design the cloud architecture, building data ingestion and storage protocols, and ensuring best-in-class integration with modern data platforms (e.g. Snowflake, Databricks)
  • Work with common AWS infrastructure stack like Kinesis, IoT Core, Glue, Kafka, SageMaker, Redshift and ensure the environment adheres to best-in-class security, governance and monitoring principles (e.g. AWS IAM, Artifact, CloudWatch, CloudTrail).

AWSDockerLeadershipProject ManagementPythonSQLBusiness AnalysisCloud ComputingJavaKafkaKubernetesSnowflakeData engineeringREST APIResource PlanningCommunication SkillsCI/CDDevOpsTerraformJSONScalaStakeholder managementData modelingSoftware EngineeringData management

Posted 9 days ago
Apply
Apply

📍 U.S.

🧭 Full-Time

💸 161000.0 - 194000.0 USD per year

🔍 FinTech

🏢 Company: Flex

  • Proven experience in building, scaling, and monitoring cloud infrastructure on AWS (EKS, S3, RDS, etc.).
  • Experience using Terraform for cloud infrastructure management.
  • Experience with containerized applications, Kubernetes, and microservice deployments.
  • Strong knowledge of GitHub Actions and CI/CD best practices.
  • Experience with developer productivity tools to streamline software development.
  • Knowledge of monitoring tools and frameworks, Datadog preferred.
  • Familiarity with networking concepts like DNS and load balancing.
  • Experience coding/reading in Java, Python, or TypeScript.
  • Collaborate with service engineering teams to design, implement, and maintain scalable and resilient infrastructure solutions.
  • Ensure infrastructure aligns with business requirements and industry standards.
  • Leverage Terraform to automate infrastructure provisioning.
  • Implement SRE principles to enhance reliability.
  • Improve developer workflows by creating self-service tools and optimizing CI/CD pipelines.
  • Develop robust monitoring systems.
  • Lead incident responses and facilitate post-incident reviews.
  • Drive adoption of Infrastructure as Code and build automated pipelines.

AWSDockerPythonAWS EKSDynamoDBJavaKubernetesTypeScriptCI/CDTerraform

Posted 15 days ago
Apply
Apply

📍 United States, Canada

🧭 Full-Time

🔍 B2B SaaS

NOT STATED
  • Develop complex products
  • Manage infrastructure processing petabytes of data

AWSDockerPostgreSQLPythonData engineeringMicroservices

Posted 19 days ago
Apply
Apply

📍 U.S.

🔍 Blockchain and finance

  • 3+ years of hands-on experience in Infrastructure or DevOps Engineering roles.
  • Based in a U.S. time zone (PT, MT, CT, or ET).
  • Strong understanding of CI/CD pipelines and implementation in automated deployments.
  • Demonstrated experience with cloud computing platforms (e.g., GCP, AWS) and infrastructure-as-code tools like Terraform or CloudFormation.
  • Expertise in observability best practices and monitoring tools (e.g., Prometheus, DataDog, Grafana).
  • Demonstrated experience in deploying, operating, and optimizing high-performance, large-scale blockchain infrastructure including blockchain nodes.
  • Experience with at least 1 of EVM or Cosmos SDK blockchain ecosystems.
  • Experience with Linux, cloud networking, and containerization technology like Docker.
  • Excellent technical and non-technical communication skills, both verbal and written.
  • Implement improvements to optimize system scalability, performance, and fault tolerance.
  • Design and manage internal pipelines that automate build/test/deploy of key Ondo infrastructure.
  • Proactively handle incident response, investigate root causes, and resolve issues related to Ondo infrastructure with appropriate urgency.
  • Create a monitoring/alerting/on-call system to ensure Ondo infrastructure has optimal availability and performance.
  • Drive strategic technical decisions for the infrastructure roadmap, overall tech stack, and engineering practices.
  • Collaborate with protocol and smart contract engineers to integrate nodes and other relevant services.

AWSDockerPythonBlockchainGCPGoGrafanaPrometheusRustCI/CDLinuxTerraform

Posted 20 days ago
Apply