Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
*Native/Bilingual English is required for this role (read/written/spoken)
Please upload your CV Resume in English.
Monthly salary: $2,000 - $3,000 USD
Along with our partner, we're looking for a talented Jr. ML Engineer who will be the main touchpoint for our partner's client. You will have to leverage technical knowledge, soft skills, and strong problem-solving skills to deliver great experiences to clients.
Key Responsibilities:
- Guide new users through onboarding, setup, and best practices
- Document technical learnings, common patterns, and solutions derived from real customer interactions
- Transform lessons learned into clear, actionable knowledge for both internal teams and external users
- Create and maintain documentation, tutorials, and sample projects
- Provide technical support via Slack, email, and meetings, helping users troubleshoot and resolve issues quickly
- Collaborate with engineering to debug production issues and prioritize fixes
- Help users architect scalable and efficient deployments using FriendliAI
- Lead customer-facing technical sessions, demos, and Q&As
Qualifications:
- 1+ years of hands-on experience with deploying & debugging LLMs on GPUs for production
- 1+ years in a developer-facing or technical support role (Customer Success, Solutions Engineering, etc.)
- Bachelor’s or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent
- Have deep knowledge of how LLMs and related technologies/features work
- Familiarity with Kubernetes and cloud infrastructure
- Excellent written and verbal communication skills
- Proficient in Python and familiar with LLM ecosystems (e.g., Hugging Face, LangChain)
- Experience working with APIs, CLI tools, and deployment workflows
- Ability to explain complex technical concepts clearly and concisely
Preferred Experience:
- Strong technical writing skills with experience authoring user-facing or internal technical content.
- Experience working with open-source model inference in production.
- Experience writing developer documentation or educational content.
- Prior experience working with enterprise customers or managing technical escalations.
- Contributions to developer communities or open source.
- Familiarity with model serving platforms and inference workflows.
- Hands-on experience building with agentic or autonomous AI frameworks.
Work Schedule: rotating through Monday - Sunday, 6 AM - 6 PM PST.
Commitment: 180 hours per month
Benefits:
- A fully remote position with a structured schedule that supports work-life balance.
- The opportunity to work with our partner at the cutting edge of generative AI infrastructure and model serving.
- Two weeks of paid vacation per year.
- 10 paid days for local holidays.
*Please note our partner is only looking for full-time dedicated team members who are eager to fully integrate within their team.
Key Skills
Ranked by relevanceReady to apply?
Join Tecla and take your career to the next level!
Application takes less than 5 minutes

