Apply

DevOps Engineer

Posted 2024-09-20

View full description

πŸ’Ž Seniority level: Experience as a DevOps Engineer or similar role

πŸ“ Location: United States

πŸ” Industry: AI technology, ecommerce, fashion

🏒 Company: SpreeAIπŸ‘₯ 11-50Artificial Intelligence (AI)E-CommerceMachine Learning

⏳ Experience: Experience as a DevOps Engineer or similar role

πŸͺ„ Skills: AWSDockerSoftware DevelopmentSQLCloud ComputingKubernetesMachine LearningAzureNosqlCollaborationCI/CDDevOps

Requirements:
  • Bachelor's degree in Computer Science, Engineering, or related field, or equivalent practical experience.
  • Experience as a DevOps Engineer or similar role with a strong portfolio of projects.
  • Strong experience with NVIDIA tools specifically, Kubernetes tools is highly preferred.
  • Strong experience working with machine learning and AI GPU hardware, infrastructure, and workflows.
  • Strong experience with cloud platforms such as AWS and Azure as well as experience in deploying and managing applications in a cloud environment.
  • Familiarity with databases (SQL and NoSQL), web servers, and UI/UX design.
  • Experience with GPUs and Lustre file systems inside Kubernetes.
  • Experience with Docker, Kubernetes, and CI/CD tools is highly desirable.
  • Excellent problem-solving skills, attention to detail, and the ability to work in a fast-paced, deadline-driven environment.
  • Strong communication and collaboration skills, with the ability to work effectively in a remote or onsite team setting.
  • Must provide link to GitHub account.
Responsibilities:
  • Manage GPU's and ML/AI workflows
  • Designs, implements, and maintains scalable tools and processes for quality and secure software development life cycle
  • Collaborate closely with development and design teams to identify and address errors
  • Implement robust DevOps services to power web applications, ensuring high performance, security, and scalability.
  • Apply best practices for continuous integration and continuous deployment (CI/CD) with a focus on automation and infrastructure as code.
  • Monitor, troubleshoot, and optimize application performance and reliability.
  • Continuously improve processes and tools to enhance productivity
  • Stay up-to-date with emerging trends and technologies in web development, cloud computing, and DevOps practices.
  • Ensure compliance with industry best practices and organizational policies
  • Contribute to a culture of technical excellence, teamwork, and continuous learning within the engineering team
Apply

Related Jobs

Apply
πŸ”₯ DevOps Engineer
Posted 2024-11-23

πŸ“ US or Canada

🧭 Full-Time

πŸ” Business-to-business (B2B) software-as-a-service (SaaS)

🏒 Company: Cordance

  • Bachelor's Degree or equivalent experience.
  • At least 4 years of professional DevOps experience.
  • At least 2 years of professional experience with Amazon Web Services.
  • At least 2 years of Unix or Linux system administration experience.
  • Direct experience implementing an IaC platform on AWS, ideally using Pulumi.
  • Experience with containerization technologies such as Docker/Kubernetes.
  • Exposure to the full stack development lifecycle.
  • Experience with Jenkins is a plus.

  • Take ownership of our AWS infrastructure and deployment processes.
  • Architect and implement an infrastructure-as-code solution for the full eRezLife stack.
  • Analyze the current infrastructure landscape and partner with the engineering team for improvements.
  • Develop a deep understanding of our software stack and infrastructure.
  • Maintain and improve the eRezLife CI/CD pipeline.
  • Administer AWS servers, services, and roles.
  • Have a direct impact on delivering meaningful solutions.

AWSDockerPHPPostgreSQLPythonFull Stack DevelopmentJenkinsKubernetesAmazon Web ServicesReactCI/CDProblem SolvingLinuxDevOps

Posted 2024-11-23
Apply
Apply

πŸ“ US

🧭 Full-Time

πŸ’Έ 140000 - 160000 USD per year

πŸ” Technology consulting

  • Experience with AWS services like S3, VPCs, EC2, ECS/EKS.
  • Containerization expertise in Docker and Kubernetes.
  • Proficiency in Infrastructure as Code (IaC) using Terraform or CDK.
  • Experience with CI/CD tools such as Jenkins or GitHub Actions.
  • Programming knowledge in Python.
  • Demonstrated ability to learn new technologies quickly.
  • Detail-oriented with high integrity and strong work ethic.
  • Excellent communication skills for team and client interactions.

  • Public cloud architecture design and operation, preferably with AWS.
  • Cloud-native architecting, operations, and governance.
  • Config-driven deployment and scalable configuration management.
  • Interpret regulatory security mandates into technical controls.
  • Implement software delivery automation and CI/CD systems.
  • Drive branching, artifact management, and deployment strategies.
  • Conduct system engineering and web application troubleshooting.
  • Manage PKI, secrets, and certificate management for CI/CD.

AWSDockerSoftware DevelopmentJenkinsCommunication SkillsCI/CDDevOpsTerraform

Posted 2024-11-23
Apply
Apply

πŸ“ California, Arizona, Colorado, Connecticut, Massachusetts, Minnesota, New Jersey, New York, Oregon, Texas, Wisconsin

πŸ’Έ 100000 - 110000 USD per year

πŸ” Retail Industry

🏒 Company: Deckers

  • Strong collaboration skills and team-oriented mindset.
  • Experience developing CI/CD workflows and tools.
  • Proficiency in one or more coding languages such as Java, JavaScript, Python, and Oracle SQL/PLSQL.
  • Experience with CI/CD tools like Jenkins and Flex Deploy.
  • Familiarity with platforms such as AWS, Docker, and Kubernetes.
  • Strong automation scripting skills.
  • Experience with JIRA and its integrations.
  • Knowledge in configuration management, test-driven development, and release management.
  • Strong analytical and troubleshooting abilities.
  • Understanding of agile development and DevOps principles.
  • Flexibility and adaptability to learn new technologies.

  • Streamline the software development lifecycle by identifying pain points and productivity barriers.
  • Collaborate with development teams to understand and improve their build and release processes.
  • Partner with cross-functional stakeholders to enhance delivery workflows.
  • Build and maintain the CI/CD pipelines for improved productivity and code quality.
  • Develop automation solutions for efficient and consistent code deployment.
  • Create automated testing to enhance product quality.
  • Monitor and manage application performance, troubleshooting issues as needed.
  • Prepare and present documentation to multiple stakeholders.
  • Promote DevOps principles across the organization.

AWSDockerPythonSoftware DevelopmentSQLAgileJavaJavascriptJenkinsKubernetesOracleJavaScriptJiraRelease ManagementCommunication SkillsCollaborationCI/CDDevOpsWritten communicationDocumentation

Posted 2024-11-22
Apply
Apply

πŸ“ North America

πŸ’Έ 100000 - 175000 USD per year

πŸ” Asset Intelligence Cybersecurity

🏒 Company: Armis Security

  • 4+ years of experience as a DevOps Engineer.
  • Significant expertise in AWS, Infrastructure as Code (IaC), and Docker.
  • Extensive hands-on experience with AWS for infrastructure deployment and management.
  • Proven track record in building and sustaining CI/CD pipelines.
  • Proficiency in tools and technologies such as Terraform, CDK, Kubernetes, ECS, and GitHub Actions.
  • Strong background in configuring cloud services, particularly Amazon Web Services.
  • Previous experience in Cyber Security and/or SaaS environments is advantageous.

  • Collaborate with software development teams to automate deployments and scale applications efficiently.
  • Deploy and manage infrastructure across multiple regions, ensuring high availability and performance.
  • Design, build, and maintain Continuous Integration/Continuous Deployment (CI/CD) pipelines for seamless service delivery.
  • Implement and manage container orchestration solutions, focusing on performance optimization, security, and reliability in large-scale environments.
  • Provide on-call support during off-peak hours if needed.
  • Continuously optimize deployment processes, particularly in expanding services to new regions.

AWSDockerSoftware DevelopmentCybersecurityIoTKubernetesAmazon Web ServicesCI/CDDevOpsTerraform

Posted 2024-11-19
Apply
Apply

πŸ“ United States, Costa Rica, Colombia, Mexico

🧭 Full-Time

πŸ” Software development

  • 5+ years of experience in Infrastructure as Code (IaC), with Java or Typescript being a plus.
  • Experience in software development and continuous integration.
  • Proficiency in Linux management.
  • Strong problem-solving and analytical skills.
  • Excellent communication and leadership abilities.
  • Bachelor's degree in computer science or equivalent experience.
  • Relevant certifications, such as AWS or Azure, are a plus.

  • Troubleshoot production issues to ensure effective resolution.
  • Ensure the site reliability of the infrastructure.
  • Maintain and update the disaster recovery plan.
  • Document DevOps processes and practices for Infrastructure as Code.
  • Automate the process for deploying new services.

DockerLeadershipPythonSoftware DevelopmentBashKubernetesTypeScriptAnalytical SkillsCI/CDLinuxDevOpsMicroservices

Posted 2024-11-16
Apply
Apply

πŸ“ United States

🧭 Full-Time

πŸ’Έ 110500 - 166000 USD per year

πŸ” Nonprofit fundraising/donor CRM

🏒 Company: Bloomerang

  • Leadership experience implementing complex projects, such as multi-system consolidation or monolith to service-driven infrastructure.
  • Production-level containerization and orchestration experience.
  • Experience with AWS ECS with Fargate or Kubernetes within EKS or GKE.
  • Familiarity with observability practices and deployment, including OpenTelemetry or agent-based tools.
  • Experience with Prometheus, Graylog, Stackdriver, and Datadog.
  • Knowledge of configuration management tools, such as Ansible, Chef, or Puppet.

  • Be a mentor and leader to team members.
  • Set up monitoring/alerting to ensure uptime and reliability of the app infrastructure.
  • Work on both new and refactoring infrastructure projects.
  • Present and discuss process improvements with your team.
  • Respond to incidents and other issues as part of an on-call rotation.
  • Understand the systems at hand and industry best practices.
  • Communicate across teams and proactively support other Engineering team members.
  • Perform and understand required system maintenance tasks.
  • Leverage system logs, metrics, and other data sources to detect and solve problems proactively.
  • Demonstrate a strong understanding of Agile methodologies, cloud operations, and workflows.

AWSLeadershipAgileKubernetesPrometheusCollaborationAgile methodologiesDevOps

Posted 2024-11-15
Apply
Apply

πŸ“ United States, India, France, Australia

πŸ” DeFi and data platform

🏒 Company: Career Renew

  • 3+ years of DevOps/SRE experience, preferably in an institutional node operations firm
  • 1+ years experience managing Web3 nodes and blockchain infrastructure
  • Proficiency in TypeScript and Node.js ecosystem
  • Experience with cloud platforms (AWS/GCP/Azure)
  • Proficiency in Infrastructure as Code (Terraform)
  • Must be able to work on a 4 hour overlap with PST

  • Design, deploy, and maintain high-availability node infrastructure (ETH and other major protocols)
  • Develop and maintain automated deployment pipelines for TypeScript-based backend
  • Implement monitoring, alerting, and logging solutions for nodes and backend services
  • Optimize resource utilization and cost management across cloud platforms
  • Ensure security best practices in infrastructure deployment and maintenance
  • Collaborate with backend developers to improve system reliability and performance

AWSNode.jsBlockchainGCPTypeScriptAzureCI/CDDevOpsTerraform

Posted 2024-11-14
Apply
Apply

πŸ“ U.S.

🧭 Full-Time

πŸ’Έ 80000 - 110000 USD per year

πŸ” Benefits technology and services

🏒 Company: Businessolver

  • BS or equivalent work experience in Computer Science, MIS, or related degree.
  • 2+ years of experience in Linux systems administration and configuration.
  • Strong scripting skills, preferably in Python or Bash.
  • 1+ year of experience with Amazon Web Services (e.g., EC2, S3, RDS).
  • Experience with monitoring and alerting solutions.
  • Experience with DNS and web server configurations.
  • Knowledge of automating infrastructure using tools like Ansible, Puppet, Chef, or CloudFormation.
  • Familiarity with containers and orchestration tools such as Kubernetes.
  • Understanding of CI/CD pipelines and Agile methodologies.
  • Excellent communication, troubleshooting, and problem-solving skills.

  • Perform daily system monitoring and verify the integrity and availability of hardware and key processes.
  • Conduct performance tuning, infrastructure upgrades, and resource optimization.
  • Provision and configure hardware and services according to standards.
  • Develop and implement automated approaches for system tasks.
  • Provide Tier II support for incidents and troubleshoot issues.
  • Maintain and contribute to documentation and knowledge base.
  • Influence technology groups in adopting Cloud technologies and best practices.
  • Propose creative solutions to enhance client satisfaction.

AWSPythonAgileBashKubernetesAmazon Web ServicesCommunication SkillsAgile methodologiesLinuxDevOpsDocumentation

Posted 2024-11-13
Apply
Apply

πŸ“ United States

🧭 Full-Time

πŸ’Έ 191000 - 239000 USD per year

πŸ” Streaming media / Anime

🏒 Company: Crunchyroll, LLC

  • 12+ years of relevant experience in DevOps and infrastructure management.
  • 8+ years working in production environments, integrating DevOps and software engineering.
  • 5+ years managing containerized infrastructure in Kubernetes or ECS.
  • Proficiency in automation, creating reusable patterns and solutions.
  • Expertise in GitOps practices and CI/CD, including tools like ArgoCD.
  • 5+ years of experience with AWS cloud technologies using Infrastructure as Code (IaC) with Terraform.
  • Proficient in TypeScript.
  • Familiarity with relational and non-relational databases (e.g., PostgreSQL, MySQL, DynamoDB).
  • Experience guiding software engineers on production best practices.

  • Automate and scale systems and services that power the streaming platform.
  • Work with the SRE team and delivery teams on infrastructure projects.
  • Lead projects related to infrastructure automation and scaling.
  • Drive projects to completion, ensuring best practices are followed.
  • Collaborate with engineers to develop tools that support services and software delivery.

AWSPostgreSQLDynamoDBKubernetesMySQLTypeScriptCollaborationCI/CDDevOpsTerraform

Posted 2024-11-12
Apply
Apply
πŸ”₯ DevOps Engineer
Posted 2024-11-12

πŸ“ West Coast, Central Europe

πŸ” Machine intelligence

🏒 Company: Gensyn

  • Experience deploying large scale workloads to one or more cloud providers.
  • Experience monitoring and being on call for internal and external deployments.
  • Infrastructure as Code and GitOps expertise, preferably with Terraform.
  • Experience with Kubernetes or equivalent container orchestration solution.
  • Experience managing OTel, Prometheus, or similar open-source observability systems.
  • Proficiency in Python, shell scripting, and Linux.

  • Support the development, deployment, and observability of the mainnet and internal products.
  • Manage observability and deployments for a complex, distributed system.
  • Manage CI/CD and build pipeline infrastructure.
  • Author and maintain internal developer support tools.

CI/CDDevOpsTerraform

Posted 2024-11-12
Apply