-
Vox AI

Data Engineer - LLM Pipeline & Data Infrastructure

Vox AI
Netherlands · Full-time · Associate

We're building an AI-powered conversational system for drive-thru automation. As our Data Engineer, you'll design and implement the infrastructure that powers our multi-stage LLM pipeline, from data capture to processing, model training, and deployment.


Tasks

  • Build scalable real-time data pipelines for audio processing, LLM interactions, and model training

  • Design comprehensive data storage solutions across object storage, NoSQL, and analytical databases

  • Implement data quality management with filtering, normalization, and enrichment capabilities

  • Create automated processes for data preparation, model evaluation, and continuous improvement

  • Develop observability systems with monitoring, alerting, and performance dashboards

  • Establish data security and compliance protocols, including privacy protection measures

  • Build resilient data systems with error recovery, backup, and integrity verification


Requirements

What You'll Need



  • Experience designing data pipelines for AI/ML applications

  • Expertise with Apache Airflow for workflow orchestration

  • Strong knowledge of Apache Spark for large-scale data processing

  • Experience with Apache Kafka for real-time event streaming

  • Proficiency with object storage systems (S3/MinIO) and database technologies (Cassandra/ScyllaDB, ClickHouse)

  • Understanding of monitoring tools (OpenTelemetry) and observability platforms

  • Experience implementing data security and compliance measures

  • Advanced Python programming skills


Preferred Experience



  • Audio data processing and conversational AI systems

  • LLM training and fine-tuning pipelines

  • Data quality frameworks (Great Expectations) and versioning tools (LakeFS, DVC)

  • Kubernetes for container orchestration

  • Multi-region deployment and distributed systems


Benefits

  • Build cutting-edge conversational AI systems with real-world impact

  • Work with modern, open-source technology stack

  • Help shape the future of automated customer service

  • Competitive compensation and flexible work arrangements


If you're passionate about building robust data systems for AI applications and excited by complex real-time data challenges, we'd love to talk.

Key Skills

Ranked by relevance

ai storage python nosql
Login to Apply
Posted
Jul 02, 2025
Type
Full-time
Level
Associate
Location
Amsterdam
Company
Vox AI

Industries

IT Services IT Consulting

Categories

Information Technology

Related Jobs

3 roles aligned with this opportunity

View all jobs
View Job Details
Vox AI
Related

AI Engineer - LLM Systems & Alignment

2025-07-02

Full-time
Associate
Netherlands
IT Services
Information Technology
View Job Details
Accenture DACH
Related

Senior Data & Machine Learning Engineer (all genders)

2026-05-21

Full-time
Not Applicable
Austria
IT Services
Engineering
View Job Details
Vox AI
Related

Senior Python Developer: Systems & POS Integrations

2026-05-11

Full-time
Associate
Netherlands
IT Services
Information Technology