AI Software engineer

Track This Job

Add this job to your tracking list to:

Monitor application status and updates
Change status (Applied, Interview, Offer, etc.)
Add personal notes and comments
Set reminders for follow-ups
Track your entire application journey

Save This Job

Add this job to your saved collection to:

Access easily from your saved jobs dashboard
Review job details later without searching again
Compare with other saved opportunities
Keep a collection of interesting positions
Receive notifications about saved jobs before they expire

AI-Powered Job Summary

Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.

Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.

We are looking for a versatile and hands-on AI Engineer / Developer to lead the development, deployment, and scaling of AI applications powered by large language models (LLMs) and related tooling. The ideal candidate combines strong software engineering fundamentals with a deep understanding of MLOps, LLMs, DevOps best practices, and GPU-based infrastructure. You'll play a pivotal role in shaping our clients AI stack-from prototype to production.

Key Responsibilities:

Build and integrate AI-powered applications using LLMs, LangChain, and vector databases.
Design and deploy scalable systems leveraging GPUs (on-premise or in the cloud).
Collaborate with data scientists to transition models from research to production.
Develop and manage CI/CD pipelines for AI workflows and AI APIs.
Implement monitoring, logging, and observability for AI workloads (e.g., model drift, latency, resource consumption).
Optimize model performance and cost-efficiency across inference endpoints.
Ensure compliance with IT and security standards for model deployment and GPU workloads.

Required Skills & Experience:

3+ years of experience in software engineering, with hands-on exposure to AI/ML systems, web development, and building applications leveraging large language models (LLMs).
Strong Python skills; experience with backend APIs (FastAPI, Flask) is a plus.
Experience working with LLMs (OpenAI, Hugging Face Transformers, Anthropic, etc.).
Hands-on with LangChain, vector stores (e.g., FAISS, Pinecone), and retrieval-augmented generation (RAG).
Solid grasp of containerization and orchestration (Docker, Kubernetes).
Familiarity with GPU infrastructure and deployment tools (e.g., NVIDIA Triton, CUDA, TensorRT).
Proficient with cloud services (AWS, GCP, or Azure) and IaC (e.g., Terraform).
Understanding of model lifecycle management and MLOps tooling (MLflow, Weights & Biases, etc.)
Fluency in English & French

Additional information :

Start Date: As soon as possible
Location: Caen or Paris
Work Arrangement: Hybrid (2 days remote per week)
Duration: 6-month contract, renewable
Contract Type: Freelance or Portage salarial

If you have the required skills and are available, please apply today!

Apply

Post Date

2025-06-07

Job Type

REMOTE

Employment type

Contract