-
Barrington James

Senior ML Data Engineer

Barrington James
Netherlands · Full-time · Mid-Senior

This fast-growing, well-funded health-tech company is on a mission to transform how clinicians understand and treat complex diseases. By combining advanced multimodal AI with real-world clinical data, they are building next-generation tools to help medical professionals see the full picture of a patient’s condition — from imaging to genomics to treatment history.


Working in close collaboration with leading hospitals and research institutes across Europe, the company has doubled in size over the past year to 80+ team members from more than 25 nationalities, with offices in Zurich and Amsterdam.


About the Role

As a Senior Data Engineer, you’ll be at the heart of building large-scale, high-quality datasets that power state-of-the-art foundation models for healthcare. You’ll design and implement advanced data pipelines for sourcing, generating, and curating complex multimodal datasets at massive scale.

You will:

  • Build high-throughput pipelines to ingest multimodal data at petabyte scale.
  • Develop systems for synthetic data generation to enhance model training.
  • Create filtering and rating systems for topic relevance, quality, and compliance.
  • Partner closely with ML researchers to steer the development of cutting-edge AI models.


What We’re Looking For

Essential Skills & Experience

  • Excellent Python programming skills.
  • Strong experience with distributed computing frameworks (Ray, Spark, or similar).
  • Proven track record in designing and operating large-scale data pipelines.
  • Hands-on experience with synthetic-data pipelines for LLMs.
  • Deep familiarity with modern data architectures (Delta, Iceberg) and columnar formats (Parquet, ORC).
  • Expertise in core data-processing techniques (hashing, deduplication, chunking) and related performance trade-offs.
  • Strong communication skills for presenting technical concepts and experimental results


If you have a passion for building world-class data infrastructure and want to work on AI that genuinely helps people, I'd love to hear from you.


Following your application, Jay Robins, a specialist AI recruiter will discuss the opportunity with you in detail.


He will be more than happy to answer any questions relating to the industry and the potential for your career growth.


The conversation can also progress further to discussing other opportunities, which are also available right now or will be imminently becoming available.


This position has been highly popular, and it is likely that it will close prematurely. We recommend applying as soon as possible to avoid disappointment.

Key Skills

Ranked by relevance

ai distributed computing python spark
Login to Apply
Posted
Aug 08, 2025
Type
Full-time
Level
Mid-Senior
Location
Amsterdam Area

Industries

Biotechnology Research Pharmaceutical Manufacturing Research Services

Categories

Science Information Technology Engineering

Related Jobs

3 roles aligned with this opportunity

View all jobs
View Job Details
OptiComm.ai
Related

Artificial Intelligence Engineer

2026-06-17

Full-time
Mid-Senior
Romania
Artificial Intelligence
Engineering
View Job Details
Barrington James
Related

Machine Learning Engineer

2026-06-17

Full-time
Mid-Senior
Switzerland
Biotechnology Research
Engineering
View Job Details
Bitdefender
Related

Senior Software Engineer

2026-06-16

Full-time
Mid-Senior
Romania
Construction
Engineering