Apply

Infrastructure Engineer

Posted 2024-10-22

View full description

πŸ’Ž Seniority level: Seasoned

πŸ“ Location: United States

πŸ’Έ Salary: 100000 - 120000 USD per year

πŸ” Industry: SaaS, Cloud-Based Managed File Transfer

🏒 Company: Files.com

⏳ Experience: Seasoned

πŸͺ„ Skills: AWSElasticSearchRubyRuby on RailsElasticsearchGoRedisTerraform

Requirements:
  • Strong background in building internal tools.
  • Ability to prototype features.
  • Comfort in investigating issues to discover root causes.
  • Desire to build solutions and tools rather than maintaining existing ones.
Responsibilities:
  • Take the lead on helping build internal tools and cutting edge features into the Files.com product.
  • Drive value for the 4,000 businesses that use the platform.
  • Work with an API that receives hundreds of requests per second.
  • Build prototypes and tackle deep-rooted issues.
Apply

Related Jobs

Apply

πŸ“ United States

πŸ” Mental health care technology

  • 5+ years of industry experience building production-level ML platforms and infrastructure.
  • Ability to write high-quality code in Python, Java, or Scala.
  • Experience building production-ready RESTful APIs and scaling platforms for large user bases.
  • Desire to own parts of an ML Platform with understanding of ML models and principles.
  • Experience with containers and deploying applications to Kubernetes.
  • Familiarity with LLMs and building infrastructure for LLM applications.
  • Experience with relational and low-latency databases.
  • Experience in transforming data in batch and streaming contexts.
  • Ability to manage large projects from scoping to delivery.
  • Strong communication and organizational skills, with the ability to simplify complex problems.

  • Be part of a team building scalable infrastructure for training, evaluating, deploying, performing inference, and monitoring ML models.
  • Build, deploy, and maintain generative AI services and applications.
  • Create data systems to collect, clean, label, and store data used for model features.
  • Deploy and manage applications in Kubernetes clusters.
  • Collaborate with Machine Learning engineers to support experimentation platforms and training frameworks.
  • Work with stakeholders to address requirements for ML infrastructure.

AWSPythonJavaKubeflowKubernetesMachine LearningMLFlowPyTorchCommunication SkillsRESTful APIsOrganizational skills

Posted 2024-11-21
Apply
Apply

πŸ“ United States

🧭 Full-Time

πŸ’Έ 140000 - 180000 USD per year

πŸ” Fintech, specialty finance

🏒 Company: Libertas Funding

  • Experience configuring monitoring / alerting different attributes in AWS.
  • Proven experience implementing infrastructure as code (IaC) using Terraform / Ansible.
  • 7+ years of software engineering (4+ as data engineer) experience designing, developing, and delivering application solutions for enterprise-level development and integration projects.
  • Strong communication skills and previous experience working with cross-functional business groups.
  • Experience with maintaining, updating, and integrating data pipelines and data warehouses.
  • Proficiency in capture and maintenance of data in SQL and NoSQL databases.
  • Proficiency in fast-paced software engineering team, following software engineering development cycles.
  • Strong sense of agency and self-initiative / intrinsic motivation to push projects forward, learn new tech & tools.
  • Process-oriented, detail-oriented, and analytically oriented mindset.
  • Natural enjoyment from solving complex problems with a methodical approach.
  • Strong analytical and problem-solving skills with a focus on solving business problems.
  • Experience working in Agile software development processes.
  • Deep knowledge of Continuous Integration and Delivery and how to build deployment pipelines and infrastructure.
  • Experience with CloudWatch, IAM, Lambda.
  • Terraform, Cloud Formation, Puppet, or other Infrastructure as Code (IaC) technologies.
  • Proficiency in Git, Git Actions, and Git workflows.
  • Experience with logging and monitoring tools.
  • Strong development skills while implementing best practices in development to prevent breaking changes.
  • Knowledge of data governance best practices, including data quality/integrity and privacy.

  • Scope, design, and implement repeatable automation of the data, build, test, deploy and release processes.
  • Automate the timely and accurate delivery of reporting and analytics data.
  • Build and maintain robust, observable (ETL) data pipelines.
  • Update and maintain AWS cloud and on-premises data warehouse configuration and components.
  • Investigate and remediate technical issues.
  • Create and maintain documentation as it relates to system configuration, mapping, processes, and service records.
  • Build and maintain continuous integration and continuous delivery (CI/CD) technologies.
  • Provide technical expertise at building and managing infrastructure as code (IaC).
  • Maintain performance and SLA metrics for build/release systems.
  • Collaborate within the development team to enable the team to meet goals and objectives.
  • Proactively document and communicate knowledge to enable others to quickly solve the next challenge.
  • Adhere to compliance procedures and internal/operational risk controls.

AWSDockerPythonSoftware DevelopmentSQLAgileBusiness IntelligenceETLGitKubernetesMySQLData engineeringData sciencePostgresServerlessNosqlCommunication SkillsCI/CDDevOpsTerraformDocumentationCompliance

Posted 2024-11-15
Apply
Apply

πŸ“ United States

🧭 Full-Time

πŸ’Έ 128000 - 176000 USD per year

πŸ” Cybersecurity

🏒 Company: SentinelOne

  • Bachelor’s degree in an IT-related field or equivalent work experience.
  • 7+ years of experience in building and managing networks in multi-geo enterprise organizations.
  • Expertise in deploying and managing network switches and firewalls like Juniper, Palo Alto, or similar hardware.
  • Expertise in deploying, configuring, and supporting zero-trust networks.
  • Proficiency in various operating systems, network software, and utility programs.
  • Knowledge of risk assessment tools, technologies, and methods.
  • Advanced skills in scripting/programming in Powershell/Python or other languages.
  • Advanced troubleshooting skills in Infrastructure devices and applications areas.
  • Working knowledge of AWS cloud services.
  • Experience in managing Linux and VMware is a plus.
  • Strong understanding of ITIL and change management.
  • Experience working with NIST/FedRAMP controls a plus.
  • Ability to write detailed documentation based on regulatory requirements.
  • Excellent written and verbal communication skills.
  • Self-starter and a team player.
  • Ability to multitask and manage priorities.
  • Able to interact with senior leadership and provide executive support.
  • Strong desire to learn and implement new technologies.

  • Install, configure, and manage network switches, firewalls, routers, and WiFi access points.
  • Manage and support internet and intranet connectivity issues between offices and multi-cloud environments.
  • Deploy and support network and network security services.
  • Keep network and application infrastructure secured and patched.
  • Manage SSO integrations with multiple platforms and applications.
  • Maintain detailed documentation and change records.
  • Work with Dell servers and VMware virtualization.
  • Actively participate in the on-call program.

AWSLeadershipPythonCybersecurityCommunication SkillsCollaborationLinuxDocumentation

Posted 2024-11-15
Apply
Apply

πŸ“ Los Angeles, Boston

🧭 Full-Time

πŸ’Έ 150000 - 175000 USD per year

πŸ” Healthcare

🏒 Company: Heyday Health

  • Bachelor’s degree in Computer Science, Engineering, or a related field.
  • 5+ years of experience in infrastructure engineering, with a focus on AWS and IaC using tools like AWS CDK.
  • Demonstrated experience in using ECS, Lambda, and RDS in a production environment.
  • Familiarity with healthcare industry standards and compliance requirements, particularly HIPAA.
  • Strong analytical and problem-solving abilities with a track record of delivering scalable infrastructure.
  • Excellent communication skills and a team-oriented approach to project management.

  • Support Software Development Lifecycle by owning and improving CI/CD pipelines and git workflows.
  • Develop and maintain infrastructure using AWS CDK, ensuring all cloud resources are version-controlled.
  • Build and scale applications using AWS ECS, Lambda, and RDS.
  • Implement monitoring solutions to track health, performance, and security.
  • Ensure infrastructure complies with healthcare regulations, including HIPAA.
  • Automate deployment processes and enhance system scalability and uptime.
  • Collaborate with a small team to deploy impactful solutions.

AWSSoftware DevelopmentAgileGitServerlessCommunication SkillsCollaborationCI/CDCompliance

Posted 2024-11-14
Apply
Apply

πŸ“ United States

πŸ” Cybersecurity

NOT STATED

  • Advising clients on cybersecurity challenges.
  • Assessing current cybersecurity measures.
  • Automating solutions to enhance security processes.

AWSDockerLeadershipAgileBashCloud ComputingCybersecurityElasticSearchGitJavaJavascriptJenkinsKubernetesMicrosoft Azure*NixJavaScriptCross-functional Team LeadershipAmazon Web ServicesAzureElasticsearchREST APICommunication SkillsAnalytical SkillsCollaborationCI/CDProblem SolvingAgile methodologiesRESTful APIsLinuxDevOpsAttention to detailOrganizational skillsWritten communication

Posted 2024-11-13
Apply
Apply

πŸ“ United States

πŸ” Software

🏒 Company: Hashgraph

  • 8+ years of experience monitoring, deploying, and patching Linux-based server infrastructure.
  • 5+ years of experience managing and maintaining bare metal Kubernetes clusters.
  • 5+ years of experience managing, maintaining, and deploying mission-critical network & server infrastructure.
  • 3+ years of experience with Zero-Trust Privileged Access Management solutions such as Teleport or StrongDM.
  • 3+ years of experience writing network infrastructure documentation including diagrams and policies.
  • 2+ years of experience with network infrastructure design and architecture.
  • 2+ years of experience with IPv4/IPv6 address space planning and management.
  • 2+ years of experience planning and deployment of routing protocols (eg: OSPF, BGP).
  • 2+ years of experience planning and deploying bare metal configuration management solutions.
  • 2+ years of experience writing effective design and process documentation.
  • Self-motivated with excellent communication, organizational, and leadership skills.
  • Experience in Iterative and Incremental Engineering Practices.
  • Cisco CCNP, CCDE, Juniper JNCIA-ENT, or JNCIA-DC certifications recommended.
  • Bachelor’s degree in Computer Science or equivalent work experience.

  • Managing, monitoring, and maintaining a fleet of 170+ bare metal servers.
  • Developing and maintaining Kubernetes clusters for CI/CD workflows and release management automation.
  • Ensuring the integrity and security of bare metal server deployments.
  • Developing and maintaining a scalable network and server infrastructure.
  • Maintaining network/server automation tools for the bare metal fleet.
  • Collaborating with IT and Security stakeholders for security auditing and threat mitigation.
  • Working with DevOps and Software Engineering stakeholders to align strategies and execution.
  • Ensuring product releases meet business goals.

LeadershipCiscoKubernetesStrategyRelease ManagementCI/CDLinuxDevOpsDocumentation

Posted 2024-11-07
Apply
Apply

πŸ“ USA, UK, Germany, France, Canada, India, Chile

🧭 Full-Time

πŸ” Automation

🏒 Company: Make

  • At least 5 years of experience in managing and operating Linux/Unix-based infrastructure.
  • Knowledge of at least one cloud provider, ideally AWS.
  • Day-to-day experience with a container orchestration platform, preferably Kubernetes.
  • Proficiency in Infrastructure as Code practices and tools such as Terraform.
  • Hands-on experience with CI/CD tools and various deployment strategies.
  • Understanding of Service Level Indicators, Objectives, and Agreements.
  • Effective communication skills in English.
  • Openness to knowledge sharing and mentoring.
  • Experience with troubleshooting and debugging issues.
  • Working knowledge of programming/scripting languages like Python or Go.

  • Design, build, and maintain a scalable & resilient infrastructure on AWS.
  • Follow the Infrastructure as Code principle to keep changes versioned.
  • Build and manage cloud infrastructure using Terraform.
  • Continuously evolve & maintain Kubernetes clusters.
  • Implement and consult on observability and monitoring framework.
  • Share knowledge in technologies like Kubernetes, Docker, and more.
  • Contribute to service blueprints.
  • Actively test and evolve system reliability.
  • Cooperate with Security on infrastructure compliance.
  • Design and support continuous deployment tooling.
  • Be on-call for incidents affecting availability.
  • Debug production issues across services.

AWSDockerNode.jsPostgreSQLPythonElasticSearchKubernetesRabbitmqElasticsearchGoPostgresRedisCommunication SkillsCI/CDTerraform

Posted 2024-11-07
Apply
Apply

πŸ“ United States

🏒 Company: AssistRx

  • 7+ years of experience as a Linux Engineer or Systems Administrator in a production environment.
  • Proven experience with Linux systems, virtualization technologies, and cloud platforms.
  • Deep knowledge of Linux operating systems, especially RedHat and CentOS.
  • Strong knowledge of networking technologies and devices.
  • Proficiency in scripting tools like Bash, Python, and PowerShell.
  • Knowledge of configuration management and Infrastructure as Code practices.

  • Design and deploy Linux-based infrastructure to support business applications, ensuring scalability, performance, and security.
  • Manage and monitor Linux and Windows servers, ensuring optimal performance and uptime.
  • Implement and maintain security measures across the environment.
  • Collaborate with cross-functional teams to integrate Linux/Windows systems.
  • Provide advanced technical support for production issues.
  • Mentor junior team members and document infrastructure designs.

LeadershipPythonBashCloud Computing*NixCommunication SkillsAnalytical SkillsCollaboration

Posted 2024-10-25
Apply
Apply

πŸ“ India, Portugal, UK, USA, Romania, Brazil

🧭 Full-Time

πŸ” Software Engineering

🏒 Company: Mindera

  • Expertise in GCP and understanding of networking, including service meshes (Istio) and multi-cluster setups.
  • Extensive experience with Kubernetes (GKE).
  • Proficiency in Terraform and Helm for IaC deployments.
  • Experience with GitLab CI/CD pipelines.
  • Proficient in Python and GoLang for scripting and automation.
  • Knowledge of security best practices, governance, and compliance standards.
  • Familiarity with cloud well-architected framework principles.
  • Experience in working with Architects.

  • Implement a hub and spoke model on GCP for multi-tenant environments.
  • Implement multi-region resilience, security, and cloud-agnostic infrastructure.
  • Set up GCP environments with networking architecture, IAM roles, and security configurations.
  • Deploy and manage GKE clusters, Istio service mesh, and K8s services.
  • Utilize Terraform, Helm, Argo CD, and GitLab CI/CD for IaC.
  • Ensure high availability, resilience, and optimal performance.
  • Maintain security best practices, compliance, and governance.
  • Facilitate knowledge transfer within teams and create comprehensive documentation.

LeadershipPythonAgileGCPKubernetesAzureGoGolangCI/CDTerraform

Posted 2024-10-24
Apply
Apply

πŸ“ Dubai, London

πŸ” Data Infrastructure

🏒 Company: Eqvilent

  • 3+ years in a similar role.
  • Proven experience with AWS or other cloud providers.
  • Experience with distributed systems (e.g. Apache Kafka, Apache Airflow, Apache Hadoop).
  • Proficiency with Terraform.
  • Extensive experience with Docker and Kubernetes, including cluster setup, node pools, and Helm charts.
  • Experience with CI/CD tools (e.g. GitLab CI, Jenkins).
  • Familiarity with observability tools such as Prometheus, Grafana, ELK stack.
  • Solid understanding of networking, security, and system architecture.
  • Strong scripting skills (e.g., Python, Bash).
  • Excellent problem-solving skills, communication, and collaboration abilities.

  • Design, implement, and maintain both cloud and on-premise compute and storage infrastructure.
  • Set up and manage Kubernetes clusters, implement Helm charts, ensuring high availability and performance.
  • Set up, maintain, and scale distributed systems (e.g. Apache Kafka, Apache Airflow) ensuring data integrity and security.
  • Automate code delivery processes and implement CI/CD, monitoring, logging, and alerting solutions.
  • Collaborate with development and operations teams, provide production support, and participate in on-call rotations.

AWSDockerPythonApache AirflowApache HadoopBashHadoopJenkinsKafkaKubernetesAirflowApache KafkaGrafanaPrometheusCollaborationCI/CDTerraform

Posted 2024-10-21
Apply