Apply

Sr. DevSecOps Engineer

Posted about 23 hours agoViewed

View full description

💎 Seniority level: Middle, 3+ years

🔍 Industry: Software Development

🏢 Company: Craft Machine Inc

⏳ Experience: 3+ years

Requirements:
  • 3+ years of experience in a DevOps or MLOPs role (or similar experience), with a proven track record of implementing machine learning lifecycle solutions.
  • Experience using Terraform or other infrastructure-as-code tools.
  • Experience with designing and optimizing infrastructure for machine learning workloads, including GPU cost optimization.
  • Familiar with tools and practices for model management, deployment, and monitoring, such as MLflow, ElasticSearch, or OpenSearch.
  • Able to think holistically about data platforms and their integration with machine learning workflows.
  • Deep knowledge of Amazon Web Services (AWS) Infrastructure along with experience with at least a part of the other components of our technology stack: Airflow, Apollo, CircleCI, Cloudflare, Databricks, Docker, GraphQL, Pandas, PyTorch, Python, spaCy, Transformers.
Responsibilities:
  • Contribute to and implement the DevSecOps and MLOps strategies, working with multiple stakeholders that are focused on automating and securing the development lifecycle.
  • Design and manage the entire MLOps lifecycle, including data versioning, model training, deployment, and monitoring.
  • Implement tools and frameworks for model versioning, tagging, and QA processes.
  • Optimize infrastructure for model training and inference, focusing on cost efficiency and avoiding vendor lock-in .
  • Automate security and compliance measures across the machine learning lifecycle, ensuring seamless integration with CI/CD pipelines.'
  • Develop monitoring systems for production models, including active learning workflows to flag problematic records for further verification.
  • Build and manage scalable infrastructure to support machine learning workloads, including data storage, versioning, and evaluation platforms.
  • Perform threat modeling, risk assessment, and code reviews to assess cybersecurity implications.
Apply

Related Articles

Posted 14 days ago

Why remote work is such a nice opportunity?

Why is remote work so nice? Let's try to see!

Posted 7 months ago

Insights into the evolving landscape of remote work in 2024 reveal the importance of certifications and continuous learning. This article breaks down emerging trends, sought-after certifications, and provides practical solutions for enhancing your employability and expertise. What skills will be essential for remote job seekers, and how can you navigate this dynamic market to secure your dream role?

Posted 7 months ago

Explore the challenges and strategies of maintaining work-life balance while working remotely. Learn about unique aspects of remote work, associated challenges, historical context, and effective strategies to separate work and personal life.

Posted 7 months ago

Google is gearing up to expand its remote job listings, promising more opportunities across various departments and regions. Find out how this move can benefit job seekers and impact the market.

Posted 7 months ago

Learn about the importance of pre-onboarding preparation for remote employees, including checklist creation, documentation, tools and equipment setup, communication plans, and feedback strategies. Discover how proactive pre-onboarding can enhance job performance, increase retention rates, and foster a sense of belonging from day one.