Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
Minimum qualifications:
- PhD degree in Computer Science, a related field, or equivalent practical experience.
- One of more scientific publication submissions for conferences, journals, or public repositories (such as CVPR, ICCV, NeurIPS, ICML, ICLR, etc.).
- Experience in a university or industry labs, with primary emphasis on AI research.
- Understand of transformer architecture internals.
- Ability to drive new research ideas from problem abstraction, designing solution, experimentation, to productionisation in a rapidly shifting landscape.
- Excellent technical leadership and communication skills to conduct multi-team cross-function collaborations.
- Passionate for deep/machine learning, computational statistics, and applied mathematics.
As an organization, Google maintains a portfolio of research projects driven by fundamental research, new product innovation, product contribution and infrastructure goals, while providing individuals and teams the freedom to emphasize specific types of work. As a Research Scientist, you'll setup large-scale tests and deploy promising ideas quickly and broadly, managing deadlines and deliverables while applying the latest theories to develop new and improved products, processes, or technologies. From creating experiments and prototyping implementations to designing new architectures, our research scientists work on real-world problems that span the breadth of computer science, such as machine (and deep) learning, data mining, natural language processing, hardware and software performance analysis, improving compilers for mobile platforms, as well as core search and much more.
As a Research Scientist, you'll also actively contribute to the wider research community by sharing and publishing your findings, with ideas inspired by internal projects as well as from collaborations with research programs at partner universities and technical institutes all over the world.
Google Research Singapore is the very latest addition to the Google Research presence around the globe!
As a Research Scientist, you will be making significant breakthroughs towards Computational Efficiency of large-scale Generative AI Models (LLMs, Diffusion Models, Generative Videos).
Through foundational research, the team will deliver research on algorithmic efficiency, model compression, and inference acceleration, directly impacting how next-generation AI models will be deployed to billions of people.Google Research is building the next generation of intelligent systems for all Google products. To achieve this, we’re working on projects that utilize the latest computer science techniques developed by skilled software developers and research scientists. Google Research teams collaborate closely with other teams across Google, maintaining the flexibility and versatility required to adapt new projects and foci that meet the demands of the world's fast-paced business needs.
Responsibilities
- Advance algorithms, sampling techniques and large-scale optimization to make serving and inference of generative AI models more efficient and flexible.This includes model compression, knowledge distillation and quantization strategies.
- Innovate algorithms and large language model architectures that improve computation efficiency and generalization of training deep learning models.
- Improve the end-to-end model deployment pipeline that includes entirely new formulations of pretraining, instruction tuning, reinforcement learning, thinking and reasoning.
- Collaborate with hardware and software teams to optimize kernels and inference engines, across different hardware and model architectures.
- Optimize latency, memory bandwidth, workloads.
Key Skills
Ranked by relevanceReady to apply?
Join Google and take your career to the next level!
Application takes less than 5 minutes

