Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
We are looking for a versatile and hands-on AI Engineer / Developer to lead the development, deployment, and scaling of AI applications powered by large language models (LLMs) and related tooling. The ideal candidate combines strong software engineering fundamentals with a deep understanding of MLOps, LLMs, DevOps best practices, and GPU-based infrastructure. You'll play a pivotal role in shaping our clients AI stack-from prototype to production.
Key Responsibilities:
- Build and integrate AI-powered applications using LLMs, LangChain, and vector databases.
- Design and deploy scalable systems leveraging GPUs (on-premise or in the cloud).
- Collaborate with data scientists to transition models from research to production.
- Develop and manage CI/CD pipelines for AI workflows and AI APIs.
- Implement monitoring, logging, and observability for AI workloads (e.g., model drift, latency, resource consumption).
- Optimize model performance and cost-efficiency across inference endpoints.
- Ensure compliance with IT and security standards for model deployment and GPU workloads.
Required Skills & Experience:
- 3+ years of experience in software engineering, with hands-on exposure to AI/ML systems, web development, and building applications leveraging large language models (LLMs).
- Strong Python skills; experience with backend APIs (FastAPI, Flask) is a plus.
- Experience working with LLMs (OpenAI, Hugging Face Transformers, Anthropic, etc.).
- Hands-on with LangChain, vector stores (e.g., FAISS, Pinecone), and retrieval-augmented generation (RAG).
- Solid grasp of containerization and orchestration (Docker, Kubernetes).
- Familiarity with GPU infrastructure and deployment tools (e.g., NVIDIA Triton, CUDA, TensorRT).
- Proficient with cloud services (AWS, GCP, or Azure) and IaC (e.g., Terraform).
- Understanding of model lifecycle management and MLOps tooling (MLflow, Weights & Biases, etc.)
- Fluency in English & French
Additional information :
- Start Date: As soon as possible
- Location: Caen or Paris
- Work Arrangement: Hybrid (2 days remote per week)
- Duration: 6-month contract, renewable
- Contract Type: Freelance or Portage salarial
If you have the required skills and are available, please apply today!