Apply

AI/ML Engineer

Posted 3 days agoViewed

View full description

📍 Location: United States, Canada

🔍 Industry: Mental Health

🏢 Company: Blueprint👥 1-10Business DevelopmentFitnessWellnessHealth Care

🗣️ Languages: English

🪄 Skills: PythonArtificial IntelligenceData AnalysisMachine LearningQACI/CD

Requirements:
  • Built or owned evaluation infrastructure for LLMs or generative AI products
  • Experience designing QA workflows, human-in-the-loop systems, or LLM-as-a-judge pipelines
  • Think in terms of feedback loops — and can turn fuzzy product goals into testable quality metrics
  • Write code, ship experiments, and are comfortable working across the stack to get the right signals flowing
Responsibilities:
  • Design and build our end-to-end evaluation infrastructure: LLM-as-a-judge, human QA pipelines, offline scoring, and more
  • Define and implement application-specific quality metrics — not just accuracy, but tone, structure, clinical alignment, and more
  • Collaborate with product and clinical leads to turn subjective requirements into structured evaluation criteria
  • Monitor and analyze model performance across different therapist cohorts and workflows
  • Build tools and processes to capture in-the-wild feedback from clinicians and route it back into model and product improvement loops
  • Work closely with engineers to integrate eval into our CI, deployment, and iteration cycles
  • Help shape data labeling, prompt evaluation, experiment design, and prompt tuning frameworks
Apply

Related Jobs

Apply

📍 US

🧭 Full-Time

🔍 Engineering and Sciences

  • Active Top Secret Clearance with the ability to obtain a TS/SCI.
  • Bachelor’s degree in Data Science, Computer Science, Engineering, or related field.
  • 14 years of experience in Data Collection, Processing, or Analysis systems.
  • 5 years of experience with classified DoD/IC systems and AI/ML development.
  • 5 years of experience leading technical teams.
  • Experience with AI/ML frameworks (TensorFlow, Keras, PyTorch).
  • Experience with programming in Python, R, C++, or similar.
  • Experience using analytics and visualization tools (Tableau, Power BI, matplotlib, Plotly).
  • Experience in big data solutions (Accumulo, Hadoop, Spark, Kafka).
  • DoD 8570/8140 certifications or equivalent in Information Assurance and Cybersecurity (e.g., Security+ or higher).
  • Design, build, and maintain ML/DL models for various AI applications in classified environments.
  • Leverage the Enterprise Data Architecture for secure and efficient data management.
  • Collaborate with cross-functional teams to test, validate, and deploy AI/ML models.
  • Enhance system performance through AI/ML models and big data engineering.
  • Analyze large datasets to identify patterns and insights.
  • Design and manage data pipelines for rapid prototyping.
  • Implement data visualization and UX solutions to enhance user experience.
  • Apply zero-trust principles in data security and management.
  • Develop solutions for MLS systems, integrating data from different security enclaves.
  • Translate mission needs into analytical approaches to achieve mission outcomes.
  • Perform data collection, cleansing, integration, and storage in classified settings.
  • Ensure AI solutions comply with privacy regulations and ethical AI standards.
  • Implement monitoring for AI model performance.
  • Build trust and "explainability" in AI, ensuring transparent processes.
  • Stay updated with AI/ML advancements and incorporate them into operations.
  • Manage and support a team of engineers, fostering a collaborative environment

AWSLeadershipProject ManagementPythonSQLApache HadoopArtificial IntelligenceCloud ComputingCybersecurityData AnalysisKerasMachine LearningNumpyPyTorchC++TableauAlgorithmsApache KafkaData engineeringData scienceData StructuresREST APITensorflowCommunication SkillsAnalytical SkillsCollaborationCI/CDProblem SolvingMicrosoft OfficeAgile methodologiesLinuxWritten communicationCritical thinkingActive listeningData visualizationTeam managementData modelingScriptingData analyticsData managementDebugging

Posted 3 days ago
Apply
Apply
🔥 AI/ML Engineer | GT
Posted 9 days ago

📍 UK, USA, Canada, Germany, Netherlands

🧭 Full-Time

🔍 Healthcare and Pharmacy

🏢 Company: GT👥 101-250Information and Communications Technology (ICT)Product ManagementInformation Technology

  • At least 2 years of AI/ML experience
  • 3+ years of Python experience
  • Experience with Generative AI and LLMs
  • Advanced RAG techniques implementation
  • Experience with vector databases (Weaviate DB or Milvus db)
  • Time-series forecasting tools: Prophet, ARIMA, or TimeGPT
  • Databricks experience
  • Experience in model deployment
  • Experience with similarity, NLP, labeling, clustering, and data cleaning solutions
  • Cloud platform experience (AWS, GCP, or Azure)
  • Git version control
  • SQL knowledge
  • PyTorch
  • LlamaIndex or LangChain
  • Hugging Face Transformers
  • Pandas/NumPy for time-series data manipulation
  • LLM evaluation frameworks (e.g DeepEval)
  • At least Upper-intermediate English
  • Understand and translate end-user requirements into technical solutions
  • Develop solutions for semantic similarity, NLP, labeling, clustering, and vector search problems
  • Design and implement rapid proof-of-concept solutions
  • Participate in discovery stages with clients to define problem scope and solution approaches
  • Present work results and contribute ideas in team meetings
  • Create and maintain documentation for ML systems and processes
  • Stay current with ML/AI advancements and evaluate new techniques for potential implementation

PythonSQLArtificial IntelligenceCloud ComputingGitMachine LearningNumpyPyTorchPandasEnglish communication

Posted 9 days ago
Apply
Apply

📍 United States

🧭 Full-Time

💸 175500.0 - 277500.0 USD per year

🔍 Software Development

🏢 Company: Upwork👥 501-1000💰 over 8 years ago🫂 Last layoff about 2 years agoMarketplaceFreelanceCopywritingPeer to Peer

  • Strong proficiency in Python and modern ML frameworks such as PyTorch or TensorFlow, with experience developing and deploying AI systems.
  • Deep understanding of core ML concepts, including transformers, generative models, and agent architectures such as MCP or A2A.
  • Experience training large models on GPU clusters and integrating LLMs with external tools via APIs or orchestration frameworks.
  • Comprehensive understanding of foundational deep learning, machine learning concepts, and state-of-the-art GenAI models.
  • Hands-on experience training custom LLMs on GPUs and working with Retrieval-Augmented Generation (RAG) systems.
  • Proven ability to build end-to-end ML pipelines—from data prep and experimentation to production deployment—in a cloud-native environment.
  • A growth mindset, strong communication skills, and the ability to translate complex technical work to stakeholders across functions.
  • Demonstrated experience working in R&D environments; publications in major AI/ML conferences are a plus, but not required.
  • Architect and implement core infrastructure to support agent-based LLM systems, including multi-agent pipelines, RAG, and real-time orchestration.
  • Train and fine-tune custom models, including LLMs and foundation models, to solve unique Upwork-specific challenges.
  • Lead cross-functional collaboration across engineering, product, and research teams to align technical solutions with business impact.
  • Mentor and guide engineers and researchers, helping foster a high-performing, inclusive team culture grounded in AI excellence.
  • Translate experimental ideas into reliable, scalable production systems by applying best practices in AI engineering and deployment.
  • Publish and share innovations where appropriate, contributing to Upwork’s visibility in the broader AI/ML research community.

PythonCloud ComputingMachine LearningPyTorchAlgorithmsData StructuresTensorflowCommunication SkillsRESTful APIsMentoring

Posted 18 days ago
Apply
Apply

📍 United States

🧭 Full-Time

🔍 EdTech

🏢 Company: Securly👥 101-250💰 $16,000,000 Series B over 6 years agoEdTechSecuritySoftware

  • Experience building, training, and deploying machine learning models in production
  • Proficiency in Python and ML libraries such as TensorFlow, PyTorch, or Scikit-learn
  • Experience working with NLP, computer vision, time-series analysis, or anomaly detection
  • Familiarity with cloud-based model pipelines and MLOps tools (e.g., SageMaker, Vertex AI, MLflow)
NOT STATED

PythonMachine LearningMLFlowPyTorchTensorflow

Posted 23 days ago
Apply
Apply
🔥 Founding AI/ML Engineer
Posted 2 months ago

📍 United States

🧭 Full-Time

🔍 Software Development

🏢 Company: Beakon👥 11-50Information TechnologySoftware

  • 3+ years experience in backend or full-stack development.
  • Fluency in one or more modern languages, such as Python or JavaScript.
  • Fluency in cloud provider platforms.
  • Drive the technical vision, ensuring a scalable and maintainable architecture.
  • Make key technology decisions and establish best practices for development and deployment.
  • Build and ship core features, writing clean, efficient, and well-tested code.
  • Design resilient, fault-tolerant systems that can handle mission-critical workloads.
  • Implement best-in-class security practices to protect sensitive data and infrastructure.
  • Lay the groundwork for scaling both the product and engineering team.
  • Mentor and hire top engineering talent to help grow the company.

AWSBackend DevelopmentDockerLeadershipPythonSoftware DevelopmentSQLArtificial IntelligenceCloud ComputingData AnalysisData MiningFull Stack DevelopmentGitKubernetesMachine LearningSoftware ArchitectureREST API

Posted 2 months ago
Apply
Apply

📍 United States

🧭 Full-Time

💸 140000.0 - 165000.0 USD per year

🔍 Insurance

🏢 Company: Integrated Specialty Coverages, LLC

  • 5+ years of relevant model development and deployment experience, preferably within the insurance industry
  • Proven experience in prompt engineering, LLM fine-tuning, and LLM evaluation
  • Experience working with RAG architecture, including embedding models, retrieval mechanisms, and integration with LLMs
  • Proven experience building and deploying machine learning models
  • Experience building & deploying models with AWS tools (Bedrock, Sagemaker, Comprehend, S3, etc.) is preferred
  • Proficiency in writing production-quality, scalable code using Python; experience with scikit-learn, PyTorch, TensorFlow, huggingface, Keras, etc.
  • Strong working knowledge of MLFlow, Docker, Git, and SQL
  • Experience in creating and integrating APIs for model deployment
  • Utilize expertise in generative AI technologies to develop innovative solutions for underwriting, risk assessment, claims processing, and customer engagement.
  • Think big about the arc of development of generative AI over a multi-year horizon, and identify new opportunities to apply these technologies to solve real-world problems
  • Work closely with engineering and devops teams to ensure seamless integration, deployment, and scalability of machine learning models within the existing infrastructure.
  • Keep abreast of the latest developments in AI and machine learning, particularly in generative models, to inform strategy and implementation.
  • Build, deploy, maintain, and optimize machine learning models
  • Adhere to best practices for model validation, testing, and monitoring to ensure performance and reliability
  • Provide guidance and mentorship to junior data scientists, fostering a culture of learning and professional growth within the data science team.
  • Effectively communicate complex analytical findings to both technical and non-technical audiences through reports, presentations, and visualizations.
  • Proactively identify areas for improvement in existing models and processes, advocating for and implementing enhancements.

AWSDockerPythonSQLArtificial IntelligenceData AnalysisGitMachine LearningMLFlowPyTorchAPI testingTensorflowCommunication SkillsProblem SolvingData visualization

Posted 3 months ago
Apply
Apply
🔥 Staff AI/ML Engineer
Posted 6 months ago

📍 Worldwide

🧭 Full-Time

🔍 Software Development

🏢 Company: Zencoder

  • 5+ years of experience in ML/AI, including shipping models to production and iterative improvement post-launch.
  • Strong knowledge of LLMs, including SOTA models (GPT, Claude, Mistral, etc.), with practical experience in prompt engineering, fine-tuning, or retrieval-augmented generation.
  • Deep understanding of NLP, including tokenization, embeddings, and transformer architectures.
  • Deep understanding of machine learning, including experience with some fields of classical ML (recommendation systems, regressions/classifications on tabular data, and time series or other areas of classical ML).
  • Ability to work with customer data to identify usage patterns, perform analytics, and generate insights for product development and model optimization.
  • Ability to set up data collection pipelines.
  • Solid understanding of software engineering concepts, especially around the SDLC in modern dev environments.
  • Proficient in Python and ML frameworks (PyTorch, HuggingFace, etc.).
  • Proven ability to work effectively in a collaborative team environment, with excellent communication skills and a commitment to delivering high-quality solutions on time.
  • Experience designing and evaluating AI agents or multi-agent pipelines is a strong plus.
  • Design, build, and optimize LLM-powered agents that assist developers in tasks such as code generation, unit test creation, bug fixing, and refactoring.
  • Research the capabilities and limitations of SOTA LLMs and apply findings to improve agent performance and reliability.
  • Develop evaluation pipelines to benchmark model quality, correctness, and impact on developer productivity.
  • Collaborate cross-functionally with product, software, and infrastructure teams to integrate AI agents seamlessly into IDE environments (JetBrains, VSCode).
  • Craft effective prompts, fine-tune models, and experiment with advanced techniques like RLHF or DPO to guide model behavior.
  • Analyze user interactions and data to derive insights and continuously optimize the agent experience.

PythonSoftware DevelopmentMachine LearningAPI testing

Posted 6 months ago
Apply
Apply
🔥 Lead AI / ML Engineer
Posted 6 months ago

📍 Arizona, California, Connecticut, Illinois, Massachusetts, Michigan, New Hampshire, New York, Texas, Vermont, Washington D.C.

💸 165000 - 190000 USD per year

🔍 Nonprofit consumer advocacy

🏢 Company: Consumer Reports👥 501-1000💰 $1,148,509 Grant over 3 years agoConsumer ElectronicsInformation ServicesPublishingConsumer Research

  • Bachelor's degree in Computer Science, Data Science, Machine Learning, Artificial Intelligence, or related field.
  • 7+ years of professional working experience in AI/ML engineering projects.
  • Strong foundations in Computer Science, Math, Probability, Statistics, Machine Learning, Image Processing, Natural Language Processing (NLP), and Generative AI concepts.
  • Expert in Python, SQL, Unix Scripting, and related libraries and frameworks.
  • Experience building end-to-end production grade AI/ML pipelines in cloud (AWS + Databricks).
  • Excellent communication and collaboration skills to partner with business and technology.
  • Analyze business requirements from cross-functional teams and translate to technical AI/ML problems.
  • Lead the AI/ML initiatives in designing, developing, and deploying AI Solutions.
  • Collaborate with business, product, UI/UX, and engineering teams to build automated end-to-end AI/ML pipelines.
  • Research and stay updated on the latest AI/ML technologies and frameworks to drive innovation and improve the organization’s AI/ML capabilities.
  • Advocate for responsible AI practices, ensuring ethical use of AI/ML and adherence to data privacy and compliance standards.
  • Mentor and guide team members on AI/ML engineering best practices.

AWSLeadershipPythonSQLArtificial IntelligenceImage ProcessingMachine LearningStrategyData scienceCollaborationCompliance

Posted 6 months ago
Apply
Apply

📍 United States

🔍 Mental Health Care

  • The candidate should be an experienced AI/ML Infrastructure Engineer.
  • They should care about impact and ownership, indicating a strong sense of responsibility.
  • The job description does not specify particular technical skills or years of experience required.
  • At Lyra, the engineer will work on data-driven technology and decision making to address complex challenges in provider quality and accessibility, crucial for delivering high quality care.
  • The role emphasizes impact, ownership, cross-functional projects, and mentorship within the team.

LeadershipPythonArtificial IntelligenceMachine LearningCross-functional Team LeadershipMentoring

Posted 6 months ago
Apply
Apply
🔥 AI/ML Engineer
Posted 7 months ago

📍 United States, Canada

🧭 Full-Time

🔍 Software Development

  • Bachelor's or master's degree in Computer Science, Software Engineering, or Data Science
  • 3+ years of experience in software development
  • Experience with AI/ML frameworks such as TensorFlow, PyTorch, or scikit-learn
  • Strong proficiency in Python and relevant libraries
  • Foundational understanding of machine learning concepts
  • Proficiency in data preprocessing and feature engineering
  • Familiarity with cloud platforms (AWS, Azure, Google Cloud) and Docker/Kubernetes
  • Excellent analytical and communication skills
  • Implement AI/ML solutions with business leaders
  • Design and deploy scalable machine learning models
  • Ensure efficient data integration for AI models
  • Collaborate with tech teams for system integration
  • Monitor and optimize model performance
  • Write clean and well-documented code
  • Identify process improvement opportunities
  • Assist with user training and support

DockerPythonCloud ComputingKerasKubernetesMachine LearningNumpyPyTorchPandasTensorflow

Posted 7 months ago
Apply