Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
About the role
In this role, you will help translate advanced AI research into production-ready solutions for edge environments. Your focus will be on optimizing large language models, improving system performance, and developing agentic AI capabilities that can run efficiently on resource-constrained hardware.
Main responsibilities
- Optimize LLMs and multimodal models for deployment on edge and embedded devices.
- Apply model optimization techniques such as quantization, pruning, and distillation to improve performance and efficiency.
- Improve inference performance through system-level optimizations and efficient decoding strategies.
- Develop and implement agentic AI capabilities, including tool orchestration and function calling.
- Design secure and reliable agent workflows, incorporating guardrails and safe tool invocation mechanisms.
- Deploy optimized models using inference engines and frameworks such as llama.cpp, ONNX Runtime, TFLite, and Ollama.
- Build benchmarking pipelines to evaluate the performance of generative and agentic AI systems on-device.
- Develop proofs of concept and demonstrators for edge AI use cases.
- Translate research innovations into production-ready implementations and collaborate with engineering teams to integrate them into products.
What you bring
- MSc, EngD, or PhD in Computer Science, AI, or a related technical field.
- 5+ years of experience in software or AI engineering with strong exposure to LLMs, VLMs, and performance optimization.
- Experience with model optimization techniques such as quantization, pruning, and efficient inference strategies.
- Strong experience with AI frameworks such as PyTorch or TensorFlow.
- Experience with agentic AI frameworks (e.g., LangChain or similar ecosystems).
- Understanding of safety and security mechanisms for AI agents, including guardrails and secure function calling.
- Experience with AI deployment toolchains and inference engines (e.g., CUDA, TensorRT, ONNX, TFLite).
- Experience working with embedded systems, NPUs, or edge AI hardware.
- Strong programming skills in Python, C/C++, and Linux environments.
- Familiarity with MLOps environments, build systems, and cross-compilation workflows is a plus.
- Strong communication skills and experience working in international and cross-functional teams.
Key Skills
Ranked by relevanceReady to apply?
Join MCA Nederland and take your career to the next level!
Application takes less than 5 minutes

