Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
Why work at Nebius
Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field.
Where we work
Headquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team of over 800 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI R&D team.
About the role
Biological AI models (protein folding, protein design, and large foundation models) are powerful but heavy and expensive to run. This project focuses on making them faster and more efficient at inference time without significantly hurting biological quality.
You will work on profiling bottlenecks, applying model compression and architectural optimizations, and building efficient inference pipelines. The goal is to make these models practical for real-world research and production use.
Your responsibilities will include:
- Profile inference bottlenecks in selected biological models
- Implement and test optimization techniques (quantization, pruning, distillation)
- Explore efficient attention and architecture-level improvements
- Build and benchmark optimized inference pipelines
- Evaluate speed, memory, and accuracy trade-offs
- Write clean, well-documented experimental code
- Share results and practical deployment recommendations
We expect you to have:
- Enrollment in or completion of a degree in computer science, AI, or a related field
- Strong knowledge of computer science and machine learning fundamentals
- Experience with Python and familiarity with deep learning frameworks
- Ability to write clean, efficient code
- Strong problem-solving skills and a willingness to learn quickly
- Interest in life sciences
Nice to have:
- Familiarity with large language models and transformer architectures
- Experience profiling GPU workloads and optimizing deep learning systems
- Experience with model compression techniques (quantization, pruning, distillation)
- Experience working with large-scale models or distributed inference
- Contributions to open-source ML projects
What we offer
- Competitive salary and comprehensive benefits package
- Opportunities for professional growth within Nebius
- A dynamic and collaborative work environment that values initiative and innovation
We’re growing and expanding our products every day. If you’re up to the challenge and are excited about AI and ML as much as we are, join us!
Key Skills
Ranked by relevanceReady to apply?
Join Nebius and take your career to the next level!
Application takes less than 5 minutes

