Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
We are seeking a highly skilled Performance Optimization Engineer with a strong understanding of CPU and GPU architectures to join our team. Candidate will be responsible for optimizing software performance through low-level optimization techniques.
Key Responsibilities
- Analyze and optimize code performance on various CPU and/or GPU architectures to drive improvements in speed and efficiency.
- Utilize low-level optimization techniques, considering hardware architecture details such as memory hierarchy as well as performance trade-offs.
- Collaborate with cross-functional teams to understand project requirements and deliver optimized software solutions.
- Conduct performance testing and profiling to identify bottlenecks and propose improvements.
Required Qualifications
- Strong knowledge of CPU and/or GPU architectures.
- Proficiency in low-level optimization techniques, memory hierarchy, and instruction scheduling.
- Experience in C++ and/or Python programming.
- Excellent problem-solving skills and the ability to work independently and collaboratively.
- Strong background in developing high-performance computing applications and scientific software in C++.
- Familiarity with GPU software development using e.g. CUDA or OpenCL.
- Experience with deep learning frameworks such as PyTorch or TensorFlow.
- Experience in high-performance computing (HPC) environments.
Key Skills
Ranked by relevanceReady to apply?
Join Unikie and take your career to the next level!
Application takes less than 5 minutes

