Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
We have positions available for enthusiastic and experienced software developers for the development of Kubernetes related functionality in our Linux-based cluster software environment. NVIDIA's Base Command Manager is used to power thousands of Linux clusters around the world, varying from a few nodes to several thousands of nodes. Base Command Manager clusters can run on-premises, completely in the cloud, or in a hybrid environment.
What You'll Be Doing
- Development of Kubernetes integration in our Linux-based cluster management software product. You will allow customers to set up, manage and monitor Kubernetes deployments on their BCM clusters.
- Integrating other NVIDIA components into Base Command Manager.
- Ensuring that various types of workload can easily utilize GPUs through Kubernetes or other workload management systems such as Slurm.
- Development of various Kubernetes operators to facilitate different types of workload in Kubernetes.
- Following the latest developments in the area of Kubernetes.
- Assisting the support team with Kubernetes specific support tickets that require specific expertise.
- Working with the latest hardware (e.g. GPUs, AI accelerators, high-speed interconnects such as InfiniBand and Spectrum X) and software technologies such as parallel filesystems (e.g. Lustre, GPFS, BeeGFS, WekaIO), Jupyter, various ML frameworks and tools, and Ceph.
- Degree in Computer Science or related field.
- Fluency in C++ and/or Python
- Experience with concurrent programming techniques
- 7+ years of relevant experience, ideally in the area of systems programming
- In-depth knowledge of Linux and Kubernetes
- Experience with high-performance computing and system administration would be an asset
- Experience with Slurm
- Background with GoLang would be beneficial
JR2005848
Key Skills
Ranked by relevanceReady to apply?
Join NVIDIA and take your career to the next level!
Application takes less than 5 minutes