Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
This role sits at the intersection of cutting-edge AI and scalable production systems, focusing on building the infrastructure that powers high-performing, reliable AI applications. You will play a foundational role in shaping how AI systems are evaluated, monitored, and continuously improved in real-world environments. Working in a highly autonomous and distributed team, you’ll contribute to developing intelligent features such as AI agents and copilots while ensuring their quality, speed, and accuracy at scale. The position offers a unique opportunity to influence a growing AI platform from the ground up, combining experimentation with robust engineering practices. If you thrive in fast-paced, async environments and enjoy turning data into actionable insights, this role offers both ownership and impact.
Accountabilities
- Design and build evaluation frameworks to measure AI system performance, including both offline development metrics and real-time production monitoring
- Develop observability tools and dashboards to track system quality, latency, and accuracy over time
- Investigate and diagnose performance issues across AI pipelines, identifying root causes such as retrieval, ranking, or prompt design flaws
- Experiment with different models, architectures, and agent configurations to optimize system outcomes
- Prototype and implement improvements to retrieval-augmented generation (RAG) pipelines, including chunking, retrieval, and re-ranking strategies
- Analyze user interactions with AI features to uncover opportunities for enhancement and innovation
- Collaborate closely with engineering teams to validate improvements and ensure system reliability
- Stay up to date with emerging AI tools, frameworks, and best practices to continuously evolve platform capabilities
- 6+ years of experience in software engineering, data science, or machine learning, with exposure to domains such as NLP, search, or recommendation systems
- Proven experience building and evaluating production-grade AI systems, including LLM-based applications and RAG pipelines
- Strong proficiency in Python for experimentation, prototyping, and data analysis
- Solid understanding of AI system architecture, including embeddings, retrieval mechanisms, and context management
- Experience designing evaluation pipelines, A/B testing frameworks, and performance measurement systems
- Ability to build internal tools and infrastructure such as dashboards and data processing pipelines
- Willingness to work with and learn additional technologies (e.g., Ruby on Rails) for system integration
- Strong analytical mindset with the ability to interpret complex data and communicate insights clearly
- Comfortable working in fast-paced, ambiguous environments with high autonomy and accountability
- Excellent collaboration and communication skills in a distributed, async team
- Full professional fluency in English
- Competitive salary package ranging from $130,000 to $140,000 USD annually, with regular compensation reviews
- Equity participation and performance-based bonus opportunities
- Fully remote work environment with flexibility to work from anywhere
- Generous paid time off (35 days annually) plus sabbatical opportunities after long-term tenure
- Comprehensive healthcare coverage for employees and their families (or equivalent reimbursement options)
- Parental leave to support growing families
- Home office setup allowance
- Learning and development stipend for continuous skill growth
- Biannual company retreats in international locations
- High-trust, outcome-driven culture with strong emphasis on autonomy and impact
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Why Apply Through Jobgether?
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Key Skills
Ranked by relevanceReady to apply?
Join Jobgether and take your career to the next level!
Application takes less than 5 minutes

