Apply

Senior Infrastructure Engineer

Posted 2024-11-14

View full description

πŸ’Ž Seniority level: Senior, 4+ years experience

πŸ” Industry: Analytics Engineering

πŸ—£οΈ Languages: English

⏳ Experience: 4+ years experience

Requirements:
  • Experience with AWS, Azure, or GCP, and familiarity with tools like Terraform and Kubernetes.
  • Solid experience with Infrastructure as Code, preferably with Terraform.
  • Ability to work asynchronously as part of a remote team with excellent communication and writing skills.
Responsibilities:
  • Design, operate, and support infrastructure systems with single and multi-tenancy models across AWS and Azure.
  • Work with engineering teams to ensure consistent deployment of services.
  • Create a great developer experience in collaboration with partners in Architecture, SRE, Release Engineering, and Security.
  • Participate in a balanced on-call rotation and work on continuous improvement initiatives.
Apply

Related Jobs

Apply

πŸ“ CA, CO, DC, FL, IL, MD, NJ, MA, NY, PA, SC, TX, VA

🧭 Full-Time

πŸ’Έ 120000 - 145000 USD per year

πŸ” Credit Rating Agency

🏒 Company: KBRA

  • 7+ years of Systems/Infrastructure Engineering experience required.
  • Computer Science degree or similar work experience required.
  • Expert level knowledge of Windows Server 2016/2019/2022.
  • Expert level knowledge of Windows Active Directory, DNS, DFS, DHCP.
  • Expert knowledge of VMware vSphere 7/8.
  • Experience with enterprise and remote site backup solutions.
  • Knowledge of enterprise SAN technologies required.
  • Proficiency in automation/scripting, preferably PowerShell.
  • Advanced experience working with Public Cloud (AWS/Azure).
  • Advanced knowledge of Windows Certificate Authority infrastructure management.

  • Provide architecture/design and operational support for existing infrastructure.
  • Contribute to the development and implementation of projects.
  • Collaborate with various technology teams.
  • Monitor computing platforms proactively.
  • Perform system builds, vulnerability remediation, and system patching.
  • Create comprehensive support documentation.
  • Provide technical leadership and mentor team members.
  • Participate in on-call rotations.

CollaborationMentoringAttention to detailDocumentation

Posted 2024-11-23
Apply
Apply

🏒 Company: Aretum

Posted 2024-11-21
Apply
Apply

πŸ“ Romania

🧭 Full-Time

πŸ” Analytics Engineering

🏒 Company: dbt Labs

  • Experience with AWS, Azure, GCP, Terraform, Kubernetes, Python, and Bash.
  • Solid experience with declarative Infrastructure as Code, ideally with Terraform or willingness to learn.
  • Experience working asynchronously in a fully-remote, distributed team with excellent communication and writing skills.

  • Design, operate, and support infrastructure systems with parity across tenancy models and public clouds.
  • Work with engineering teams to ensure consistent service deployment.
  • Create a great developer experience in collaboration with Architecture, SRE, Release Engineering, and Security teams.
  • Participate in a balanced on-call rotation and help improve tooling.

AWSPythonBashKubernetesAzureTerraform

Posted 2024-11-14
Apply
Apply

πŸ“ United States

πŸ” Software

🏒 Company: Hashgraph

  • 8+ years of experience monitoring, deploying, and patching Linux-based server infrastructure.
  • 5+ years of experience managing and maintaining bare metal Kubernetes clusters.
  • 5+ years of experience managing, maintaining, and deploying mission-critical network & server infrastructure.
  • 3+ years of experience with Zero-Trust Privileged Access Management solutions such as Teleport or StrongDM.
  • 3+ years of experience writing network infrastructure documentation including diagrams and policies.
  • 2+ years of experience with network infrastructure design and architecture.
  • 2+ years of experience with IPv4/IPv6 address space planning and management.
  • 2+ years of experience planning and deployment of routing protocols (eg: OSPF, BGP).
  • 2+ years of experience planning and deploying bare metal configuration management solutions.
  • 2+ years of experience writing effective design and process documentation.
  • Self-motivated with excellent communication, organizational, and leadership skills.
  • Experience in Iterative and Incremental Engineering Practices.
  • Cisco CCNP, CCDE, Juniper JNCIA-ENT, or JNCIA-DC certifications recommended.
  • Bachelor’s degree in Computer Science or equivalent work experience.

  • Managing, monitoring, and maintaining a fleet of 170+ bare metal servers.
  • Developing and maintaining Kubernetes clusters for CI/CD workflows and release management automation.
  • Ensuring the integrity and security of bare metal server deployments.
  • Developing and maintaining a scalable network and server infrastructure.
  • Maintaining network/server automation tools for the bare metal fleet.
  • Collaborating with IT and Security stakeholders for security auditing and threat mitigation.
  • Working with DevOps and Software Engineering stakeholders to align strategies and execution.
  • Ensuring product releases meet business goals.

LeadershipCiscoKubernetesStrategyRelease ManagementCI/CDLinuxDevOpsDocumentation

Posted 2024-11-07
Apply
Apply

πŸ“ Georgia

🧭 Full-Time

πŸ” Integration and automation

🏒 Company: Workato

  • 7+ years of professional experience in hands-on engineering roles (DevOps/SRE).
  • 1+ year of experience with hosting AI models (ML flow, AWS Sagemaker, Azure AI, Kubernetes).
  • 1+ year of experience with ML Ops (ML flow, vector databases, dagster).
  • Strong experience managing Kubernetes clusters and workloads, specifically using EKS.
  • Proficiency in Python and ability to program in other languages such as Go, Ruby, or JavaScript.
  • Experience creating scalable development and integration pipelines using CI/CD tools.
  • Expertise in deploying Kubernetes-based services using tools like Kustomize, Helm, ArgoCD.
  • Hands-on experience with cloud-based architectures, particularly AWS.
  • Strong understanding of networking fundamentals and web services architecture.
  • Experience managing complex infrastructure using Infrastructure as Code tools.
  • Hands-on experience with containers and related technologies.
  • Familiarity with software packaging tools, testing, and security validation tools.
  • Experience operating Kubernetes clusters in compliance-regulated environments.
  • Experience with cloud and infrastructure security regulations and compliance programs.

  • Deploy, scale, and maintain services at the ML/AI team.
  • Collaborate closely with ML Engineers and Data Scientists.
  • Directly impact modernization and maturation of the platform including infrastructure architecture decisions.

DevOps

Posted 2024-11-07
Apply
Apply

πŸ“ USA, UK, Germany, France, Canada, India, Chile

🧭 Full-Time

πŸ” Automation and Cloud services

🏒 Company: Make

  • At least 5 years of experience in managing Linux/Unix-based infrastructure.
  • Knowledge of at least one cloud provider, ideally AWS.
  • Day-to-day experience with container orchestration platforms, preferably Kubernetes.
  • Proficiency in Infrastructure as Code tools such as Terraform and Ansible, maintaining versioned code.
  • Experience with CI/CD tools like ArgoCD and GitHub actions, and various deployment strategies.
  • Participate in on-call rotations for incident response.
  • Understanding of SLI, SLO, and SLA for service reliability.
  • Effective communication skills in English.
  • Willingness to mentor and share knowledge.
  • Desire for continuous education and improvement.
  • Experience with troubleshooting and debugging.
  • Working knowledge of programming/scripting languages like Python, Go, or Java.

  • Design, build, and maintain a scalable & resilient infrastructure on AWS.
  • Follow the Infrastructure as Code principle to version changes and promote reuse.
  • Build and manage cloud infrastructure using Terraform for scalability and security.
  • Evolve and maintain Kubernetes clusters to ensure uninterrupted service.
  • Implement and maintain observability and monitoring frameworks.
  • Educate others on technologies like Kubernetes, Docker, and more.
  • Contribute to service blueprints with infrastructure best practices.
  • Test and enhance system reliability, removing impediments for improvements.
  • Collaborate with the Security team for infrastructure compliance.
  • Design, deploy, and support automation of continuous deployment tooling.
  • Participate in on-call rotations for incident response.
  • Debug production issues across services.

AWSDockerNode.jsPostgreSQLPythonElasticSearchKubernetesRabbitmqElasticsearchGoPostgresRedisCommunication SkillsCI/CDTerraform

Posted 2024-11-07
Apply
Apply

πŸ“ United States

🏒 Company: AssistRx

  • 7+ years of experience as a Linux Engineer or Systems Administrator in a production environment.
  • Proven experience with Linux systems, virtualization technologies, and cloud platforms.
  • Deep knowledge of Linux operating systems, especially RedHat and CentOS.
  • Strong knowledge of networking technologies and devices.
  • Proficiency in scripting tools like Bash, Python, and PowerShell.
  • Knowledge of configuration management and Infrastructure as Code practices.

  • Design and deploy Linux-based infrastructure to support business applications, ensuring scalability, performance, and security.
  • Manage and monitor Linux and Windows servers, ensuring optimal performance and uptime.
  • Implement and maintain security measures across the environment.
  • Collaborate with cross-functional teams to integrate Linux/Windows systems.
  • Provide advanced technical support for production issues.
  • Mentor junior team members and document infrastructure designs.

LeadershipPythonBashCloud Computing*NixCommunication SkillsAnalytical SkillsCollaboration

Posted 2024-10-25
Apply
Apply

πŸ“ Dubai, London

πŸ” Data Infrastructure

🏒 Company: Eqvilent

  • 3+ years in a similar role.
  • Proven experience with AWS or other cloud providers.
  • Experience with distributed systems (e.g. Apache Kafka, Apache Airflow, Apache Hadoop).
  • Proficiency with Terraform.
  • Extensive experience with Docker and Kubernetes, including cluster setup, node pools, and Helm charts.
  • Experience with CI/CD tools (e.g. GitLab CI, Jenkins).
  • Familiarity with observability tools such as Prometheus, Grafana, ELK stack.
  • Solid understanding of networking, security, and system architecture.
  • Strong scripting skills (e.g., Python, Bash).
  • Excellent problem-solving skills, communication, and collaboration abilities.

  • Design, implement, and maintain both cloud and on-premise compute and storage infrastructure.
  • Set up and manage Kubernetes clusters, implement Helm charts, ensuring high availability and performance.
  • Set up, maintain, and scale distributed systems (e.g. Apache Kafka, Apache Airflow) ensuring data integrity and security.
  • Automate code delivery processes and implement CI/CD, monitoring, logging, and alerting solutions.
  • Collaborate with development and operations teams, provide production support, and participate in on-call rotations.

AWSDockerPythonApache AirflowApache HadoopBashHadoopJenkinsKafkaKubernetesAirflowApache KafkaGrafanaPrometheusCollaborationCI/CDTerraform

Posted 2024-10-21
Apply
Apply

πŸ“ Canada

πŸ” AI-powered Fraud and Risk Platform

🏒 Company: DataVisor

  • BS in computer science or computer engineering required.
  • Solid background in computer systems including operating system, networking, database, and distributed systems.
  • Experience in hyper-scalable infrastructure in the past is a strong plus.
  • 5+ years of industry experience.
  • 5+ years programming experience Java/Go/Python.
  • 1+ years experience with Kubernetes.
  • 3+ years experience with big data infrastructure such as Spark, NoSQL database, real-time streaming pipeline.
  • Preferred: contribution to open source big data infrastructure.
  • Preferred: familiar with cloud platforms such as AWS, GCP, Azure, or Alibaba Cloud.
  • Past Site Reliability Engineering experience in the past is a plus too.
  • Have a strong culture match with a fast-growing, agile startup environment: passionate, collaborative, fast iteration, and hardworking.

  • We are looking for a senior level infrastructure developer with strong background and expertise in building and supporting hyper scalable infrastructure to join the team.
  • DataVisor supports a wide variety of clouds including AWS, Azure, GCP, AliCloud, as well as some of the largest on-premise environments, all on top of Kubernetes with the latest big data and parallel computing technology.
  • The ideal candidate will be an excellent domain expert in this area, and have strong passion in combining the most advanced machine learning technology, including unsupervised machine learning, with hyper scalable computation infrastructure to make the next generation solution that supports billions of real time sophisticated feature computation and unsupervised learning decisions.
  • To be successful in this role, you should have solid experience working with different technical teams in understanding different technologies and computation requirements, as well as interacting with business teams to understand product vision and value.

AWSPythonAgileGCPJavaKubernetesMachine LearningAzureGoNosqlSpark

Posted 2024-10-15
Apply
Apply

πŸ“ United States

🧭 Full-Time

πŸ’Έ 83000 - 125000 USD per year

πŸ” IT Infrastructure

🏒 Company: Sinch

  • Technical expert with excellent communication skills and a methodical work approach.
  • At least 5 years of professional experience in IT, specifically in IT infrastructure and engineering.
  • Experience with IT infrastructure, networks, applications engineering, Linux server, database & cloud engineering, Windows server, and IT security standards.
  • Relevant infrastructure certifications (e.g., CCNA, CCNP) and familiarity with Cisco Meraki, Cisco AnyConnect, and Infrastructure as code are a plus.

  • Pro-actively contribute to architecture strategy and principles with the Enterprise Architecture Team.
  • Administrate and configure system and infrastructure technologies such as VPN, Office Network, Linux/Windows servers, and cloud infrastructure (AWS, Azure).
  • Consult application owners and business on infrastructure-related requirements and provide training to system users.
  • Provide 2nd and 3rd level support for system and infrastructure issues while participating in the on-call rotation.

AWSStrategyAzureCommunication Skills

Posted 2024-10-15
Apply

Related Articles

Remote Job Certifications and Courses to Boost Your Career

August 22, 2024

Insights into the evolving landscape of remote work in 2024 reveal the importance of certifications and continuous learning. This article breaks down emerging trends, sought-after certifications, and provides practical solutions for enhancing your employability and expertise. What skills will be essential for remote job seekers, and how can you navigate this dynamic market to secure your dream role?

How to Balance Work and Life While Working Remotely

August 19, 2024

Explore the challenges and strategies of maintaining work-life balance while working remotely. Learn about unique aspects of remote work, associated challenges, historical context, and effective strategies to separate work and personal life.

Weekly Digest: Remote Jobs News and Trends (August 11 - August 18, 2024)

August 18, 2024

Google is gearing up to expand its remote job listings, promising more opportunities across various departments and regions. Find out how this move can benefit job seekers and impact the market.

How to Onboard Remote Employees Successfully

August 16, 2024

Learn about the importance of pre-onboarding preparation for remote employees, including checklist creation, documentation, tools and equipment setup, communication plans, and feedback strategies. Discover how proactive pre-onboarding can enhance job performance, increase retention rates, and foster a sense of belonging from day one.

Remote Work Statistics and Insights for 2024

August 13, 2024

The article explores the current statistics for remote work in 2024, covering the percentage of the global workforce working remotely, growth trends, popular industries and job roles, geographic distribution of remote workers, demographic trends, work models comparison, job satisfaction, and productivity insights.