-
Agilyti

Data Engineer

Agilyti
Poland · Full-time · Mid-Senior

Data Engineer – Databricks


The Role

We are hiring a Data Engineer with strong Databricks expertise to design and scale our clients Lakehouse architecture supporting AI-driven clinical and real-world data workflows. You will work closely with AI engineers, biostatisticians, and client-facing strategy teams to build robust, high-performance data pipelines that power agentic AI systems, simulation environments, and advanced analytics. This is infrastructure for AI, not just reporting.


Key Responsibilities

  • Design and implement scalable data pipelines using Databricks (Delta Lake) and PySpark.
  • Architect medallion-style (Bronze/Silver/Gold) workflows optimized for AI and analytics.
  • Build ingestion pipelines for structured and unstructured clinical datasets.
  • Optimize Spark clusters for performance, cost efficiency, and reliability.
  • Partner with AI engineers to structure data for RAG pipelines, simulations, and agentic workflows.
  • Implement best practices using Unity Catalog for governance, lineage, and access control.
  • Build data validation, monitoring, and observability into pipelines.
  • Deploy and manage infrastructure within AWS (S3, Glue, IAM, Redshift).
  • Monitor Spark jobs and troubleshoot performance bottlenecks.


Required Qualifications

  • Strong hands-on experience with Databricks (Delta Lake).
  • Deep knowledge of Spark / PySpark.
  • Experience with AWS cloud services.
  • Strong SQL and modern data modeling experience.
  • Experience building scalable Lakehouse architectures.
  • Experience working with large, complex datasets.


Preferred Experience (Life Sciences Context)

  • Experience with clinical trial data, RWD/RWE, or healthcare datasets.
  • Familiarity with IQVIA, Veeva, or pharma commercial data ecosystems.
  • Experience working in regulated environments (GxP).
  • Exposure to ML workflows or AI platform engineering.


Preferred Certifications

  • Databricks Certified Data Engineer Associate.
  • Databricks Certified Data Engineer Professional.
  • Databricks Certified Machine Learning Associate (nice to have).
  • AWS Certified Data Analytics – Specialty (nice to have).


Education

Bachelor’s or Master’s degree in Computer Science, Data Engineering, or related technical field.


Why This Role Matters

You will be part of building the data backbone for clinically grounded AI systems used in global life sciences environments. This is an opportunity to work on modern Lakehouse architecture powering agentic AI and simulation systems, not just dashboards.

Key Skills

Ranked by relevance

ai spark simulation aws machine learning unity cloud sql s3
Login to Apply
Posted
Mar 30, 2026
Type
Full-time
Level
Mid-Senior
Location
Poland
Company
Agilyti

Industries

Software Development

Categories

Information Technology

Related Jobs

3 roles aligned with this opportunity

View all jobs
View Job Details
H2O.ai
Related

Program Manager

2026-04-10

Full-time
Not Applicable
Australia
Software Development
Project Management
View Job Details
Contentsquare
Related

AI Senior Fullstack Engineer

2026-04-10

Full-time
Not Applicable
France
Software Development
Engineering
View Job Details
Genesys
Related

Software Engineer, GRC

2026-04-09

Full-time
Mid-Senior
Ireland
IT Services
Information Technology