Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
What You'll Be Doing
- Develop an MCP tool to analyse microservices logs.
- Build and extend log-analysis capabilities for large-scale microservice deployments.
- Create an AI Agent to orchestrate troubleshooting across multiple microservices.
- Integrate the MCP log-analysis tool into an autonomous agent.
- Automate triage, root cause analysis, and cross-service incident workflows.
- Implement observability and feedback loops for safe operation in production.
- Evaluate and compare ReFRAG, R2R, and other retrieval/agent methods.
- Design experiments to measure performance, cost, and reliability of competing approaches.
- Propose new architectures and approaches.
- Currently pursuing a Bachelor's or Master's degree in Computer Science, Engineering, AI, Data Science, or related field.
- Strong programming fundamentals in at least one of: Python (preferred for AI frameworks), or Go.
- Solid understanding of data structures, algorithms, and API design.
- Basic knowledge of machine learning concepts, especially retrieval-based or agent-based architectures.
- Familiarity with Git, databases (SQL/NoSQL/Vector) and data pipelines.
- Strong problem-solving skills and eagerness to learn new technologies.
- Excellent communication and ability to work effectively in a team environment.
- Experience with RAG pipelines, vector databases, embeddings, or LLM orchestration.
- Exposure to MCP tools or multi-tool agent frameworks (Lang* stack preferably).
- Knowledge of containerization (Docker) and orchestration (Kubernetes).
- Familiarity with infrastructure automation.
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Applications will be accepted until: November 1, 2025.
Please note: We will be reviewing applications on a rolling basis as they are submitted. Strong candidates may be contacted for next steps before the application deadline. We encourage you to apply early.
JR2003999
Key Skills
Ranked by relevanceReady to apply?
Join NVIDIA and take your career to the next level!
Application takes less than 5 minutes