Apply

Infrastructure Engineer

Posted 6 days agoViewed

View full description

πŸ’Ž Seniority level: Senior, 5+ years

πŸ“ Location: United States

πŸ’Έ Salary: 160000.0 - 264000.0 USD per year

πŸ” Industry: Creative operations

🏒 Company: AirπŸ‘₯ 51-100πŸ’° $10,000,000 about 3 years agoProductivity ToolsDigital MediaContentCollaborationCloud Storage

πŸ—£οΈ Languages: English

⏳ Experience: 5+ years

πŸͺ„ Skills: AWSDockerPythonBashElasticSearchJavascriptKubernetesC#PostgresRedisCI/CDTerraform

Requirements:
  • 5+ years working with cloud platforms (AWS, GCP).
  • Experience managing large-scale production infrastructure.
  • Strong focus on reliability, performance, and security of distributed systems.
  • Experience with CI/CD pipelines (CircleCi, Jenkins, Github Actions).
  • Experience with containerization and orchestration (Docker, Kubernetes).
  • Familiarity with Infrastructure as Code (Terraform).
  • Scripting and automation tools (Chef, Puppet, Ansible / Bash).
  • Strong programming skills in Java, Python, C#, or JavaScript/TypeScript.
  • Database administration (Postgres, RDS Aurora, Elasticsearch, Redis).
Responsibilities:
  • Shape the future of Air’s infrastructure as an Infrastructure Engineer.
  • Design and build the foundation that powers product experiences.
  • Collaborate with teams to provide a secure, robust, and scalable platform.
  • Mentor junior engineers and contribute across the stack.
Apply

Related Jobs

Apply

πŸ“ United States of America

🧭 Full-Time

🏒 Company: computer_aid

  • Proven experience with Kubernetes in a production environment.
  • Strong understanding of containerization and orchestration technologies.
  • Hands-on experience with AWS, Azure, and GCP.
  • Proficiency with Infrastructure-as-Code (IaC) tools like Terraform or Pulumi.
  • Experience with CI/CD pipelines and GitOps practices.
  • Knowledge of Kubernetes security best practices.
  • Familiarity with monitoring and logging tools such as Prometheus, Grafana, and EFK stack.
  • Strong scripting and automation skills.
  • Excellent problem-solving and troubleshooting abilities.
  • Strong communication and documentation skills.
  • Analyze existing services for effective containerization on Kubernetes.
  • Design and implement Kubernetes cluster architecture for scalability and reliability.
  • Develop multi-cloud deployment strategy for Kubernetes across various platforms.
  • Migrate legacy applications to containerized environments.
  • Optimize container images and manage the container lifecycle.
  • Create reusable Kubernetes deployment templates.
  • Implement best practices for Kubernetes security.
  • Maintain monitoring and observability systems, and automate Kubernetes provisioning.

AWSDockerGCPKubernetesAzureGrafanaPrometheusCI/CDTerraform

Posted 3 days ago
Apply
Apply

πŸ“ California, Colorado, Hawaii, Illinois, Maryland, Massachusetts, New York, Oregon, Texas, Washington

🧭 Full-Time

πŸ’Έ 160000.0 - 180000.0 USD per year

πŸ” Recruiting and feedback tools

🏒 Company: TextioπŸ‘₯ 51-100πŸ’° $999,972 about 3 years agoπŸ«‚ Last layoff 11 months agoArtificial Intelligence (AI)Human ResourcesMachine LearningEnterprise SoftwareNatural Language ProcessingSoftware

  • A solid foundation in modern computing concepts, built through education or practical experience.
  • Extensive experience in SRE and DevOps (particularly supporting SaaS products), including proficiency with AWS services, Linux administration, Docker, and with building resilient cloud infrastructure.
  • Expertise writing maintainable scripts with Terraform, CDK, and shell scripts, and with languages like Python, Go, etc.
  • Versatile communication skills for diverse audiences and a collaborative, team-oriented approach.
  • Experience establishing GitOps practices and configuring CI/CD pipelines, particularly with CircleCI.
  • Model strong customer focus, communication skills, and empathy, bringing leadership experience to support business initiatives and goals.
  • Have a low ego but a strong point of view.
  • Prior security experience (Key Management, Encryption, etc.).
  • Employing automation to streamline operations and to proactively prevent and repair service disruptions.
  • Collaborating with other teams to align infrastructure with product, operational, and compliance objectives.
  • Staying informed about industry trends and technologies, and introducing new solutions for ongoing improvement.
  • Identifying, building, and guiding major initiatives from conception to launch.
  • Advancing CI/CD pipelines and championing a DevOps culture, focusing on practices like Infrastructure as Code (IaC), proactive monitoring, and effective alerting.
  • Ensuring operational environments effectively serve engineering teams and strategically contain cloud expenditures.
  • Defining and delivering critical infrastructure support and meticulously documenting designs and procedures to promote collective progress.
  • Participating in an on-call rotation that prioritizes a healthy work-life balance.

AWSDockerPythonGoCI/CDLinuxDevOpsTerraform

Posted 5 days ago
Apply
Apply

πŸ“ Bay Area, NYC

🧭 Full-Time

πŸ’Έ 125000.0 - 225000.0 USD per year

πŸ” AI and open-source software

🏒 Company: Arize AIπŸ‘₯ 51-100πŸ’° $38,000,000 Series B over 2 years agoArtificial Intelligence (AI)Machine LearningInformation TechnologySoftware

  • 3+ years of experience building scalable infrastructure and developer tools, ideally in an open-source context.
  • Strong understanding of the open-source development lifecycle and community dynamics.
  • Expertise in Kubernetes, Terraform, and other modern infrastructure tools.
  • Familiarity with build tools like Bazel and CI/CD systems.
  • A pragmatic approach to solving problems and prioritizing impactful solutions over trends.
  • Experience working with AI frameworks, LLM integrations, or observability platforms is a plus.
  • Familiarity with Go, Python, and some TypeScript is a Plus.
  • Collaborate with internal teams and the open-source community to architect and scale infrastructure that supports Arize Phoenix’s growth.
  • Enable and enhance Phoenix’s SaaS capabilities by building out features such as central authentication, data retention, and capacity scaling.
  • Build out a robust set of infrastructure tools and services that alleviates the operations of Arize's product offering.

PythonKubernetesTypeScriptGoTerraform

Posted 7 days ago
Apply
Apply

πŸ“ California, New York, Florida, Massachusetts, Arkansas, United States

🧭 Full-Time

πŸ” Software Development

🏒 Company: LaravelπŸ‘₯ 11-50πŸ’° $57,000,000 Series A 5 months agoDeveloper ToolsWeb DevelopmentEnterprise SoftwareSoftware

  • 5+ years of experience in similar roles, ideally with large-scale production systems.
  • Mastery of AWS and potentially other cloud providers, with experience in multi-account setups across regions.
  • Experience with Kubernetes (K8s) and container orchestration.
  • Experience building and maintaining Go-based K8s operators using tools like Kubebuilder and Operator SDK.
  • Experience orchestrating databases with Kubernetes, especially Percona XtraDB Cluster (PXC) and managing Postgres and Redis databases.
  • Comfortable using IaC in a team and enforcing policies.
  • Adaptable to a fast-paced, evolving, all-remote environment.
  • Join the Infrastructure team and work on the Laravel Cloud platform, a fully managed infrastructure platform for developers.
  • Focus on improving developer experience by building solid infrastructure.
  • Collaborate with team members to tackle infrastructure challenges.
  • Research and implement best solutions while documenting processes for remote teamwork.
  • Engage in programming and possibly building custom tools and integrations.

AWSKubernetesGoPostgresRedis

Posted 8 days ago
Apply
Apply

πŸ“ California, New York, Florida, Massachusetts, Arkansas, United States

πŸ” Cloud infrastructure

🏒 Company: LaravelπŸ‘₯ 11-50πŸ’° $57,000,000 Series A 5 months agoDeveloper ToolsWeb DevelopmentEnterprise SoftwareSoftware

  • 5+ years of experience from similar roles, preferably working closely with large scale production systems.
  • Mastery of AWS and potentially other cloud providers.
  • Experience with large AWS environments and customer data in multi-account setups across regions.
  • Experience with Kubernetes (K8s) and container orchestration.
  • Experience building and maintaining Go-based K8s operators using tools such as Kubebuilder and Operator SDK.
  • Experienced with logging, monitoring, alerting tooling, and processes.
  • Comfortable using IaC in a team, enforcing policies.
  • Knowledge of network protocols, load balancing, caching, and DNS management.
  • Ability to thrive in a fast-paced, evolving all-remote environment.
  • Work together with a world class team, touching the developer experience of hundreds of thousands of developers around the world.
  • Work on our most ambitious project yet, the recently announced Laravel Cloud platform.
  • Spend time researching the best solutions.
  • Value the collaborative process, often pair program or have quick huddles to tackle tricky problems.
  • Enjoy programming and build custom tooling and integrations.
  • Pride in documenting for ourselves and for our users.
  • Flexible hours and minimum bureaucracy.

AWSDockerCloud ComputingKubernetesGoMicroservices

Posted 8 days ago
Apply
Apply

πŸ“ USA, Canada

πŸ’Έ 200000.0 - 225000.0 USD per year

πŸ” Telecom

🏒 Company: Tucows Inc.

  • 3+ years in AI/ML deployment, solution engineering, or similar roles.
  • Proficiency with Apache Kafka for managing and scaling event-stream pipelines.
  • Familiarity with LangChain, TensorFlow, PyTorch, and cloud platforms like AWS, Azure, and Google Cloud.
  • Experience with RESTful APIs, microservices, and container technologies like Docker and Kubernetes.
  • Strong data engineering practices and experience in handling unstructured data.
  • Deploy AI and machine learning models to drive insights from telecom event streams.
  • Develop and maintain Kafka pipelines for real-time data processing.
  • Manage and optimize model training, tuning, and retraining workflows.
  • Collaborate with data engineers and product teams on data needs and model integration.
  • Implement monitoring solutions for model performance and resolve issues proactively.
  • Document processes, configurations, and best practices for knowledge sharing.

AWSDockerKubernetesMachine LearningPyTorchApache KafkaAzureData engineeringTensorflowCI/CDRESTful APIsMicroservices

Posted 8 days ago
Apply
Apply

πŸ“ United States

🧭 Full-Time

πŸ’Έ 70000.0 - 80000.0 USD per year

πŸ” Legal and accounting technology

🏒 Company: CaretπŸ‘₯ 1-10πŸ’° $1,291,130 Seed about 4 years agoPropTechCommercial Real EstateSaaSAppsProperty Management

  • Proficient in comprehending system architecture and the interplay between servers, databases, APIs, load balancers, firewalls, and networking.
  • Design, implement and own automation to reduce future manual work for projects and tasks using proven scripting practices.
  • Proficient in backup processes including VM snapshots, OS level, SAN snapshots.
  • Proficient in state management systems, e.g., Ansible or Terraform.
  • Proficient with Active Directory Users and Computers, and Azure AD.
  • Advanced understanding of code management repositories.
  • Self-autonomous with project planning skills.
  • Ability to troubleshoot and resolve complex issues in a timely manner.
  • Building, maintaining and supporting the virtualization environment through infrastructure-as-code, scripting, and automation methods.
  • Monitoring and logging of the various infrastructure and network components to provide a reliable environment for clients.
  • Provide backup and retention of critical client data and system components.
  • Work with client onboarding and client success managers to define, build, and deliver environments suited to the client needs.
  • Design, implement, support all networking related components including switches, routers, firewalls, VPN, iSCSI.
  • Security planning, execution and monitoring.
  • Participate in the on-call rotation.

Cloud ComputingMicrosoft Active DirectoryMicrosoft AzureTerraformNetworkingTroubleshootingAnsibleScripting

Posted 15 days ago
Apply
Apply

πŸ“ United States

🧭 Contract

πŸ’Έ 50.0 - 60.0 USD per hour

πŸ” Cloud Infrastructure

🏒 Company: Third Eye SoftwareπŸ‘₯ 11-50ConsultingInformation TechnologyRecruitingSoftware

  • 3-5 years of hands-on professional experience in a Cloud, Infrastructure, or Systems Engineering role.
  • Proficiency with Google Cloud Platform (GCP) services, including deployment and management of resources.
  • Expertise with Kubernetes for deploying, managing, and maintaining production clusters.
  • Strong proficiency with Terraform for infrastructure-as-code practices.
  • Experience with monitoring and logging tools such as Prometheus and Grafana.
  • Familiarity with CI/CD tools like GitHub Actions.
  • Knowledge of networking concepts and protocols such as network setup, IPs, and namespaces.
  • Strong problem-solving skills and attention to detail.
  • Outstanding communication skills for effective teamwork.
  • Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
  • Design, set up, and maintain cloud-based infrastructure including clusters, namespaces, networks, and IP management.
  • Support the development and optimization of internal tools, improving developer onboarding and automating workflows.
  • Contribute to backend automation, CI/CD pipelines, and tools to enhance productivity and reliability.
  • Work closely with cross-functional teams to address technical challenges and support project deliverables.
  • Provide expertise in GCP deployments and ensure smooth migration processes.
  • Troubleshoot and resolve issues with GCP services, Kubernetes deployments, Terraform configurations, and other cloud technologies.
  • Create and maintain documentation for best practices, troubleshooting procedures, and internal training.
  • Collaborate with team leads to align infrastructure strategies with project goals.

GCPKubernetesGrafanaPrometheusCI/CDTerraformNetworking

Posted 17 days ago
Apply
Apply

πŸ“ United States of America

🧭 Full-Time

πŸ’Έ 130295.0 - 260590.0 USD per year

πŸ” Healthcare

  • 7+ years experience managing expansive data platforms like Splunk and Clickhouse.
  • 6+ years mastering high-volume data pipelines with tools such as Vector, Cribl, and Confluent.
  • Strong understanding of contemporary data modeling and architecture.
  • Proven collaboration skills across different teams.
  • Exceptional problem-solving abilities in a healthcare IT environment.
  • Excellent communication skills to convey technical data solutions to diverse audiences.
  • Experience with project management, CI/CD pipelines, and GitHub.
  • Proficiency in query languages like SPL2 and programming with Python or Java.
  • Architect and cultivate a scalable observability data platform using tools like Splunk and Clickhouse.
  • Innovate and refine enterprise data models to boost performance and reliability.
  • Support data management policies and the data lifecycle.
  • Enhance data integrity through robust governance processes.
  • Ensure compliance with regulations regarding data security.
  • Develop sophisticated data pipelines for data collection and processing.
  • Optimize data flows and long-term storage strategies.
  • Collaborate with various IT teams for a unified operational data view.
  • Drive enhancements in data platform architecture and security measures.

AWSPythonETLJavaKafkaClickhouseData engineeringCI/CDData modelingData management

Posted 23 days ago
Apply
Apply

πŸ“ USA

🧭 Full-Time

πŸ’Έ 136000.0 - 170000.0 USD per year

πŸ” Crypto and Web3

🏒 Company: GeminiπŸ‘₯ 501-1000πŸ’° $1,000,000 Secondary Market over 2 years agoπŸ«‚ Last layoff about 2 years agoCryptocurrencyWeb3Financial ServicesFinanceFinTech

  • Bachelor’s degree in a technical field or 4-8 years of experience in a DevOps-focused IT/infrastructure role.
  • Strong experience managing macOS endpoints and familiarity with Linux principles.
  • Demonstrated experience with Infrastructure as Code tools (e.g., Terraform).
  • Strong understanding of AWS services and cloud-native operations.
  • Solid CI/CD pipeline experience and version control with Git.
  • Proficiency in scripting and programming languages (e.g., Python, Go, Swift).
  • Understanding of identity and access management technologies and authentication protocols.
  • Detail-oriented with excellent communication and documentation skills.
  • Proactive self-starter able to identify and implement solutions.
  • Build, maintain, and improve internal infrastructure using DevOps methodologies.
  • Integrate and automate workflows across various SaaS platforms.
  • Design and implement CI/CD pipelines for automated deployments.
  • Develop internal tools and scripts to manage global device fleet.
  • Collaborate with support teams and serve as an escalation point.
  • Engineer and maintain integrations with AWS services.
  • Support identity management and automate user management.
  • Develop and maintain technical documentation and FAQs.
  • Partner with cross-functional teams for continuous improvement.

AWSPythonSwiftGoREST APICI/CDLinuxTerraformAnsible

Posted 23 days ago
Apply