Careers

NLP&LLM Data Scientist

Written by Trio Health | Jun 24, 2025 5:49:55 PM

 

Role Description

This is a full-time hybrid role near Boulder, CO for a Data Scientist specializing in NLP, ML and LLMs. This role will be responsible for guiding Trio’s AI strategy as a hands-on and research contributor, designing, developing, and deploying models and pipelines. Model components include preprocessing, using NLP, ML and LLM models, system prompt engineering, and threshold tuning, specifically focused on extraction and vector search domains.

Company Description

Trio Health is a healthcare technology and analytics company that collaborates with clinical experts and Electronic Health Record data to provide unparalleled insights into patient care. Trio's proprietary platform enables Life Sciences stakeholders and Channel Partners to access unprecedented amounts of curated information surrounding the patient journey. 

Required Skills/Qualifications/Experience – 3 years+

  • Bachelor's / Master’s degree in Computer Science, Mathematics or Statistics, Computational Linguistics, Engineering, or a related field.
  • 3+ years of professional hands-on experience leveraging large sets of structured and unstructured data to develop data-driven insights using ML and NLP
  • Demonstrated 1+ years hands-on experience with Python, Hugging Face, TensorFlow, PyTorch or similar tools.
  • Expert in Python
  • 1+ years hands-on experience developing natural language processing (NLP) models, ideally with transformer architectures.
  • 1+ years of experience with implementing information search and retrieval at scale, using a range of solutions ranging from keyword search to semantic search using embeddings.
  • Knowledge of developing or tuning Large Language Models (LLM) and Generative AI (GAI)
  • Knowledge of NLP, LLMs (extractive and generative), fine-tuning and LLM model development. Familiar with higher level trends in LLMs and open-source platforms
  • Experience working with Snowflake, Databricks, AWS or equivalent for model deployment and AI inference
  • Quick, pragmatic, simple solutions inside 1-3 week delivery increments, iterative and feedback-based
  • Team player, collaboration with team and broader group, able to find consensus across colleagues and stakeholders, develop productive relationships with colleagues
  • Ability to work with general and sometimes ambiguous requirements and objectives and break down technical work
  • Ability to align work with strategic objectives
  • Experience with Kanban or Scrum methodologies
  • Experience working with healthcare data, familiarity with practical privacy and security methods used within HIPAA for PHI
  • Track record of high quality on time delivery, adoption of solutions, initiative and professional curiosity

Desirable

  • Prior history working with Electronic Health Record (EHR) data, LOINC mapping, ICD9/10 coding, notes