Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
In this role you will be part of our team responsible for the development of libraries that provide groundbreaking functionality and performance. The internship may include extending the capabilities of existing as well as building new libraries that will be used in various AI and HPC applications. It will involve working with senior software engineers who will provide mentorship and guidance. The project will include implementing new algorithms, defining APIs, analyzing performance, finding appropriate solutions for difficult numerical corner cases, and other general software engineering work.
What You’ll Be Doing
- Collaborate with team members to understand software use cases and requirements
- Analyze the performance of GPU or CPU implementations and find opportunities for improvements
- Prototype and develop algorithms for single node and multi GPU clusters
- Studying towards a MS or PhD degree in Computational Science, Computer Science, Applied Mathematics, Engineering, or a related field.
- Programming skills (C/C++, Python)
- Parallel or GPU programming experience (AVX, NEON, OpenMP, MPI, SHMEM, CUDA or OpenCL)
- Exposure to floating-point arithmetic and numerical error analysis.
- Knowledge of GPU/CPU and network hardware architecture.
- Understanding of composability and fusions, compilers, and implementation of programming languages
- Experience implementing sparse or dense linear algebra algorithms.
- Experience with domain-specific language design and compiler optimizations, in particular sparse compilers (MLIR or TACO)
JR2007027
Key Skills
Ranked by relevanceReady to apply?
Join NVIDIA and take your career to the next level!
Application takes less than 5 minutes

