-
View all jobs
Lensa is a career site that helps job seekers find great jobs in the US. We are not a staffing firm or agency. Lensa does not hire directly for these jobs, but promotes jobs on LinkedIn on behalf of its direct clients, recruitment ad agencies, and marketing partners. Lensa partners with DirectEmployers to promote this job for Norstella. Clicking "Apply Now" or "Read more" on Lensa redirects you to the job board/employer site. Any information collected there is subject to their terms and privacy notice.
NLP & LLM Data Scientist - Healthcare & Life Sciences
Company: Norstella
Location: Remote, United States
Date Posted: Aug 17, 2025
Employment Type: Full Time
Job ID: R-1387
Description
About Norstella
At Norstella, our mission is simple: to help our clients bring life-saving therapies to market quicker-and help patients in need.
Founded in 2022, but with history going back to 1939, Norstella unites best-in-class brands to help clients navigate the complexities at each step of the drug development life cycle -and get the right treatments to the right patients at the right time.
Each Organization (Citeline, Evaluate, MMIT, Panalgo, The Dedham Group) Delivers Must-have Answers For Critical Strategic And Commercial Decision-making. Together, Via Our Market-leading Brands, We Help Our Clients
As one of the largest global pharma intelligence solution providers, Norstella has a footprint across the globe with teams of experts delivering world class solutions in the USA, UK, The Netherlands, Japan, China and India.
Job Description
Norstella Real World Data (RWD) is seeking a skilled NLP Data Scientist with clinical background with a focus on Language Models to join our AI & Life Sciences Solutions team. Your expertise in processing and understanding natural language data, along with your knowledge of Electronic Health Records (EHR) and laboratory reports analysis, will be instrumental in driving our data science initiatives and innovations, particularly in the development of rich multimodal real-world datasets to expedite RWD-driven drug development in pharma.
Responsibilities
Benefits
Norstella is an equal opportunities employer and does not discriminate on the grounds of gender, sexual orientation, marital or civil partner status, pregnancy or maternity, gender reassignment, race, color, nationality, ethnic or national origin, religion or belief, disability or age. Our ethos is to respect and value people's differences, to help everyone achieve more at work as well as in their personal lives so that they feel proud of the part they play in our success. We believe that all decisions about people at work should be based on the individual's abilities, skills, performance and behavior and our business requirements. Norstella operates a zero-tolerance policy to any form of discrimination, abuse or harassment.
Sometimes the best opportunities are hidden by self-doubt. We disqualify ourselves before we have the opportunity to be considered. Regardless of where you came from, how you identify, or the path that led you here- you are welcome. If you read this job description and feel passion and excitement, we're just as excited about you.
Norstella is an equal opportunity employer. All job applicants will receive equal treatment regardless of race, creed, color, religion, alienage or national origin, ancestry, citizenship status, age, physical or mental disability or handicap, medical condition, sex (including pregnancy and pregnancy-related conditions), marital or domestic partner status, military or veteran status, gender, gender identity or expression, sexual orientation, genetic information, reproductive health decision making, or any other protected characteristic as established by federal, state, or local law.
If you have questions about this posting, please contact [email protected]
NLP & LLM Data Scientist - Healthcare & Life Sciences
Company: Norstella
Location: Remote, United States
Date Posted: Aug 17, 2025
Employment Type: Full Time
Job ID: R-1387
Description
About Norstella
At Norstella, our mission is simple: to help our clients bring life-saving therapies to market quicker-and help patients in need.
Founded in 2022, but with history going back to 1939, Norstella unites best-in-class brands to help clients navigate the complexities at each step of the drug development life cycle -and get the right treatments to the right patients at the right time.
Each Organization (Citeline, Evaluate, MMIT, Panalgo, The Dedham Group) Delivers Must-have Answers For Critical Strategic And Commercial Decision-making. Together, Via Our Market-leading Brands, We Help Our Clients
- Citeline - accelerate the drug development cycle
- Evaluate - bring the right drugs to market
- MMIT - identify barrier to patient access
- Panalgo - turn data into insight faster
- The Dedham Group - think strategically for specialty therapeutics
As one of the largest global pharma intelligence solution providers, Norstella has a footprint across the globe with teams of experts delivering world class solutions in the USA, UK, The Netherlands, Japan, China and India.
Job Description
Norstella Real World Data (RWD) is seeking a skilled NLP Data Scientist with clinical background with a focus on Language Models to join our AI & Life Sciences Solutions team. Your expertise in processing and understanding natural language data, along with your knowledge of Electronic Health Records (EHR) and laboratory reports analysis, will be instrumental in driving our data science initiatives and innovations, particularly in the development of rich multimodal real-world datasets to expedite RWD-driven drug development in pharma.
Responsibilities
- Employ and leverage NLP and open-source Large Language Models (LLM) such as LLama2, Mixtral, Qwen, BERT, etc., to extract, process, and interpret unstructured medical data from diverse sources like EHRs, medical notes, and laboratory reports.
- Collaborate with clinical scientists and data scientists to create efficient NLP models for healthcare, exhibiting an understanding of both the technical and medical aspects of the data.
- Conduct data cleaning, preprocessing, and validation to maintain the accuracy and reliability of insights gathered from NLP processes.
- Validate and present data findings to stakeholders, exhibiting clear and effective communication skills.
- Master's or Ph.D. degree in Computational Biology, Computer Science, Data Science, Computational Linguistics, Machine Learning, or a related analytical field.
- Deep understanding and direct experience (2+ years) in handling and interpreting either Electronic Health Records (EHR) and laboratory tests results or genetic test results is a must.
- Proven experience (2+ years) in NLP with a strong knowledge of NLP techniques such as Named Entity Recognition (NER), text summarization, topic modeling, etc. and their applied use in healthcare.
- Expert-level understanding and practical experience (1+ years) with open-source Large Language Models (Llama2/3, Mixtral etc.), e.g., prompt engineering, inference, and fine-tuning.
- Proficient in Python and SQL, with strong experience in NLP libraries such as NLTK, spaCy, Hugging face Transformers, and deep learning libraries such as PyTorch, TensorFlow.Familiarity with common data science and ML practices, e.g., version control systems, agile methodologies, and documentation.
- Experience in working with AWS cloud environment and large databases (e.g., AWS redshift).
- Experience in managing ML lifecycle using open-source tools (e.g., MLflow).Detail-oriented with strong analytical and problem-solving abilities.
- Excellent verbal and written communication skills, with ability to present complex data to non-technical audience.
- Experience dealing with protected health information (PHI) and familiarity with healthcare-related data privacy laws such as HIPAA.
- Familiarity with standard healthcare codes and terminologies such as ICD-10, CPT, LOINC, and SNOMED CT.
- Experience in RAG (Retrieval-Augmented Generation) and vector store in the context of storing large volume of healthcare unstructured documents and querying those.
Benefits
- Medical and prescription drug benefits
- Health savings accounts or flexible spending accounts
- Dental plans and vision benefits
- Basic life and AD&D Benefits
- 401k retirement plan
- Short and Long-Term Disability
- Paid parental leave
- Open vacation policy
Norstella is an equal opportunities employer and does not discriminate on the grounds of gender, sexual orientation, marital or civil partner status, pregnancy or maternity, gender reassignment, race, color, nationality, ethnic or national origin, religion or belief, disability or age. Our ethos is to respect and value people's differences, to help everyone achieve more at work as well as in their personal lives so that they feel proud of the part they play in our success. We believe that all decisions about people at work should be based on the individual's abilities, skills, performance and behavior and our business requirements. Norstella operates a zero-tolerance policy to any form of discrimination, abuse or harassment.
Sometimes the best opportunities are hidden by self-doubt. We disqualify ourselves before we have the opportunity to be considered. Regardless of where you came from, how you identify, or the path that led you here- you are welcome. If you read this job description and feel passion and excitement, we're just as excited about you.
Norstella is an equal opportunity employer. All job applicants will receive equal treatment regardless of race, creed, color, religion, alienage or national origin, ancestry, citizenship status, age, physical or mental disability or handicap, medical condition, sex (including pregnancy and pregnancy-related conditions), marital or domestic partner status, military or veteran status, gender, gender identity or expression, sexual orientation, genetic information, reproductive health decision making, or any other protected characteristic as established by federal, state, or local law.
If you have questions about this posting, please contact [email protected]
Key Skills
Ranked by relevance
machine learning
aws
deep learning
pytorch
python
hipaa
cloud
sql
ai
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
Data Scientist_ML (India)
2026-05-21
Full-time
Mid-Senior
India
Internet Publishing
Engineering
View Job Details
Related
Data Scientist
2026-05-20
Full-time
Mid-Senior
India
Internet Publishing
Engineering
View Job Details
Related
Python Developer
2026-05-23
Full-time
Mid-Senior
United States
Internet Publishing
Engineering
Login to Apply
- Posted
- Aug 25, 2025
- Type
- Full-time
- Level
- Entry
- Location
- Boston
- Company
- Lensa
Industries
Internet Publishing
Categories
Engineering
Information Technology
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
Data Scientist_ML (India)
2026-05-21
Full-time
Mid-Senior
India
Internet Publishing
Engineering
View Job Details
Related
Data Scientist
2026-05-20
Full-time
Mid-Senior
India
Internet Publishing
Engineering
View Job Details
Related
Python Developer
2026-05-23
Full-time
Mid-Senior
United States
Internet Publishing
Engineering