Apply

Senior Quantitative Scientist (ML/NLP)

Posted about 1 month agoViewed

View full description

💎 Seniority level: Senior, 5+ years

📍 Location: AZ, CA, CO, CT, FL, GA, IL, LA, MA, MN, NC, NJ, NV, NY, OH, PA, SC, TN, TX, UT , VA, WA, Washington, D.C

💸 Salary: 154000.0 - 200000.0 USD per year

🔍 Industry: Healthcare

🏢 Company: Verana Health👥 51-100💰 $150,000,000 Series E about 3 years agoDatabaseBiotechnologyAnalyticsHealth Care

🗣️ Languages: English

⏳ Experience: 5+ years

🪄 Skills: AWSPythonSQLData AnalysisMachine LearningNumpyAlgorithmsData scienceREST APISparkData visualizationData modeling

Requirements:
  • 5+ years of hands-on experience with messy data (e.g., electronic health records, outcomes data) and analytical methodologies.
  • 3+ years of hands-on experience with machine learning model implementation & deployment, especially on clinical notes
  • 3+ years of hands-on experience with state-of-the-art natural large language models (e.g., BERT, Longformer, RoBERTa, etc.) in resolving use cases like named entity recognition (NER), text classification, entity relation extraction, etc.
  • Strong familiarity with programming languages, especially Python, Pyspark, R, SQL.
  • Strong familiarity with coding platforms, especially Databricks, Amazon Sagemaker, Visual Studio Code
  • Strong familiarity with unstructured text processing techniques.
  • Familiarity with clinical datasets and coding systems such as ICD, CPT, and RxNorm.
  • Ability to work effectively with cross-functional teams.
  • Clear communication skills and able to deliver internal/external presentations.
  • Ability to prioritize and manage multiple projects with high attention to detail
Responsibilities:
  • Develop and leverage state-of-the-art advances in natural language processing using pre-trained large language models (LLMs) for analyzing and reasoning over clinical notes and other unstructured data in the context of clinical problems.
  • Drive cutting-edge research on language modeling with emphasis on scientific accuracy and explainability.
  • Communicate analysis results via presentations to a multi-disciplinary audience using clear, intuitive visualizations.
  • Establish and maintain best practices for data exploration, end-to-end model development and deployment lifecycle, and data/code/documentation management
  • Work on Qdata development to enable commercial projects leveraging real-world data through responsibilities such as creation of study plans, implementation of analyses, development of algorithms, and/or writing of publications.
  • Collaborate cross-functionally with teams (e.g., Commercial, Product, Medical, Engineering/Technology, etc.) to translate clinical investigation questions into detailed data analytics requirements for internal and external projects.
  • Provide mentorship and knowledge sharing to team members in standardizing machine learning/natural language processing best practices.
Apply