Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
What will you do?
- Integrate LLMs/SLMs into apps and backend services, including prompt design and chaining logic.
- Design and manage API orchestration across AI and non-AI components.
- Build and deploy AI systems using containerized environments (e.g., Docker, Kubernetes).
- Optimize inference performance for low-latency and resource-constrained scenarios.
- Develop and maintain speech pipelines, including STT (Speech-to-Text) and TTS (Text-to-Speech).
- Design and orchestrate AI agents integrating outputs from NLP pipelines (intent classification, entity recognition) and voice interfaces.
- Collaborate on the integration of ASR/TTS services and support conversational flow design.
- Collaborate with platform engineers to ensure the reliable and secure deployment of AI services.
- Contribute to architecture decisions and continuously improve system performance and cost.
What are we looking for?
- Between 4–7 years of experience as an AI Engineer, Machine Learning Engineer, or similar.
- Experience integrating LLMs or SLMs (OpenAI, Mistral, Azure OpenAI, Hugging Face, among others).
- Strong experience with containerized deployment and orchestration using Docker and Kubernetes.
- Proficiency in API design and orchestration of multi-component systems.
- Solid knowledge of inference optimization strategies (quantization, batching, caching).
- Experience with speech technologies (e.g., Whisper, Azure Speech, Google TTS/STT).
- Strong coding skills in Python and familiarity with modern AI frameworks.
- Familiarity with AI agents, tools, frameworks, or multi-modal AI.
- Experience working in production-grade environments and CI/CD workflows.
- Experience with vector databases, semantic search, or retrieval-augmented generation (RAG).
- Exposure to automotive, IoT, or other embedded/edge AI scenarios.
- Understanding of data privacy, on-device inference, or model compression techniques.
What can you expect from us?
- A permanent job contract for a long term project;
- Tech equipment + SIM Card + personal smartphone;
- Health and Life Insurance;
- Social events and team buildings;
- The commitment of letting you grow with us, and be rewarded accordingly;
- A dynamic and young team that will be always there to support you;
- Training in the latest technologies;
- Coffee, fruits, snacks and a warm welcoming when you pass by the office.
Key Skills
Ranked by relevanceReady to apply?
Join Caixa Mágica Software and take your career to the next level!
Application takes less than 5 minutes