-
View all jobs
Licorne Society a été missionné par une startup IA en pleine croissance pour les aider à trouver leur Lead LLM Engineer.
What You Will Own
You will be responsible for one thing:
Make our AI outputs reliable, fast, and indispensable in real workflows.
Concretely
Most teams fail because:
You Will Turn
What Success Looks Like (first 90 Days)
What You Will Own
You will be responsible for one thing:
Make our AI outputs reliable, fast, and indispensable in real workflows.
Concretely
- Design and evolve our LLM / agent architecture
- Own output quality across key use cases (emails, document analysis, etc.)
- Build evaluation systems (datasets, metrics, regression detection)
- Drive fast iteration loops from production data
- Improve retrieval, reasoning, and tool usage
- Ensure production reliability (latency, failure modes, fallback)
- Work directly with product + founders on what to build and why
Most teams fail because:
- they don’t know what “good output” means
- they don’t have evals
- they iterate randomly
- they overuse agents
You Will Turn
- vague user problems
- → into structured AI systems
- → with measurable performance
- → that improve every week
- Shipping real LLM systems
- You’ve built systems used in production (not demos)
- You understand RAG, tools, agents, structured outputs
- You can design full pipelines, not just prompts
- Evaluation-driven development
- You know how to define quality metrics
- You build datasets from real usage
- You run continuous evals to prevent regressions
- Debugging complex failures
- You can trace issues across:
- retrieval
- prompts
- model behavior
- You don’t guess — you isolate and fix
- Speed of iteration
- You move from problem → improvement in hours or days, not weeks
- You use logs, traces, and data — not intuition alone
- Strong judgment
- You know when to:
- use an agent vs a pipeline
- add complexity vs simplify
- You optimize for reliability and user value, not novelty
- Number of years of experience
- Whether you’ve used a specific framework
- Fancy research credentials
What Success Looks Like (first 90 Days)
- Clear eval framework for core use cases
- Measurable improvement in output quality
- Faster iteration cycles across the team
- Reduced hallucinations / failures
- Stronger system architecture decisions
- Python (FastAPI)
- Postgres
- Google Cloud
- LangGraph / LangChain (evolving)
- PostHog (product analytics)
- Langfuse (LLM traces)
- LLM APIs (Azure OpenAI)
Key Skills
Ranked by relevance
ai
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
Backend Engineer
2026-05-18
Full-time
Associate
France
Technology
Engineering
View Job Details
Related
Full-Stack Developer (senior)
2026-05-22
Full-time
Associate
France
Technology
Engineering
Login to Apply
- Posted
- May 22, 2026
- Type
- Full-time
- Level
- Associate
- Location
- Paris
- Company
- Leonar
Industries
Technology
Information
Internet
Categories
Engineering
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
Backend Engineer
2026-05-18
Full-time
Associate
France
Technology
Engineering
View Job Details
Related
Full-Stack Developer (senior)
2026-05-22
Full-time
Associate
France
Technology
Engineering