Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
About the Role:
The Applied AI Engineer will design, develop, build and deploy on-premise AI solutions, integrating large language models (LLMs), retrieval-augmented generation (RAG), and intelligent agents into enterprise systems. This role combines machine learning engineering, generative AI development, and prompt design to deliver secure, scalable, and business-aligned AI applications.
Key Responsibilities:
- Model Engineering :
- Fine-tune and optimize open-source LLMs using techniques such as LoRA/QLoRA.
- Quantize and deploy models efficiently using frameworks like vLLM, Triton, or TensorRT.
- Evaluate model performance with domain-specific benchmarks and metrics.
- RAG & AI Development :
- Implement embeddings, vector databases, and retrieval-augmented generation pipelines.
- Develop AI agents and workflows using frameworks such as LangChain, LangGraph, or Semantic Kernel.
- Connect AI systems securely with enterprise platforms (ERP, CRM, IoT, PLM, file systems).
- Prompt & Workflow Design :
- Engineer prompts, structured templates, and chaining logic to improve reliability.
- Collaborate with business teams to tailor AI outputs to specific use cases and compliance requirements.
- Implement guardrails, grounding, and refusal rules to reduce hallucinations and ensure safe outputs.
- Collaboration & Delivery :
- Work with Data Engineers and MLOps teams to integrate clean data pipelines.
- Partner with domain SMEs to align AI solutions with business processes.
- Document workflows, best practices, and provide internal training on AI usage.
Required Skills:
- Strong proficiency in Python and ML/AI libraries (PyTorch, HuggingFace, Transformers).
- Experience with LLM fine-tuning, optimization, and inference serving (LoRA, vLLM, Triton).
- Hands-on expertise with vector databases (Qdrant, Milvus, pgvector) and RAG pipelines.
- Knowledge of agent frameworks (LangChain, LangGraph, Semantic Kernel) and tool integration.
- Practical experience in prompt engineering and structured output design.
- Familiarity with on-prem AI deployment, containerization (Docker/Kubernetes), and API integrations.
- Strong problem-solving mindset and ability to translate business needs into AI solutions.
Key Skills
Ranked by relevanceReady to apply?
Join Calidus, LLC and take your career to the next level!
Application takes less than 5 minutes