Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
As a Senior Software Engineer in the Azure HPC/AI team, you will play a critical role in designing and delivering the next generations of our platform by solving technical problems at all levels of the stack, contributing to our codebases to enable new features on our VMs, working on architectural proposals, and collaborating with our industry partners.
This position involves deep technical work that primarily focuses on hardware/software interactions, device virtualization, and performance analysis of GPU workloads in VMs. Since our team is also responsible for the vertical integration of our VM offerings, you will also have the opportunity to work with upper layers of the Azure infrastructure software.
It is an exciting time for the team as we are working on expanding the capacity and range of supported scenarios to fuel the next growth wave. This position offers a unique opportunity to have a huge impact on Microsoft’s AI infrastructure and AI initiatives.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. #azurehpcai
Responsibilities
We are looking for a Senior Software Engineer who is committed to quality, wants the customer to succeed and gets things done. You will join a phenomenal team of hardworking engineers with deep experience in high performance computing, cloud infrastructure, operating systems, machine learning, and software engineering in general.
- Analyzes functionality, integration, and performance issues at various levels of the hardware/software stack on current and future generations of AI training platforms.
- Designs and codes solutions that improve functional correctness, stability and performance of AI training oriented VM offerings and related services. When appropriate, drives internal partner teams or industry partners to implement such solutions.
- Optimizes, debugs, refactors, and reuses code to improve performance and maintainability, effectiveness, and return on investment (ROI). Applies metrics to drive the quality and stability of code, as well as appropriate coding patterns and best practices.
- Holds accountability as a Designated Responsible Individual (DRI), and collaborates with other engineers across products/solutions, working as on-call to monitor system/product/service for degradation, downtime, or interruptions.
- Develops a playbook for the team to resolve issues.
- Maintains communication with key partners across the Microsoft ecosystem of engineers.
- Your mission will be to help ensure Azure platform is consistent on performance, can scale on-demand, and engineered to withstand the unparalleled computing demand from the customer workloads. You will help build a test-driven engineering culture to reduce regressions and bugs in production and will set a higher bar for infrastructure quality.
Required Qualifications:
- Bachelor's Degree in Computer Science or related technical field AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Experience in HPC or Machine Learning
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
- Master's Degree in Computer Science or related technical field AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Familiarity with Operating Systems fundamentals and virtualization technologies
- Experience on High Performance Computing / Machine Learning middleware
- Experience on Profiling and Performance Analysis Tools
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Key Skills
Ranked by relevanceReady to apply?
Join Microsoft and take your career to the next level!
Application takes less than 5 minutes