Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
Minimum qualifications:
- Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
- 1 year of experience with software development in one or more programming languages during coursework/projects, research, internships, or practical experience in school, work, or Open Source projects.
- 1 year of experience with data structures or algorithms.
- Master's degree in Computer Science or Engineering.
- Ability to debug, optimize code, and automate routine tasks.
- Excellent verbal and written communication skills, and leadership in a distributed team structure.
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.
Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.
With your technical expertise, you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.
Responsibilities
- Design, launch, and enhance the reliability of Home products and Assistant services utilizing Google's advanced production infrastructure.
- Engage in software engineering to build and maintain services, while providing expert troubleshooting to rapidly resolve complex production issues.
- Identify opportunities and implement innovative solutions to continuously improve the reliability, performance, and development velocity of Home products.
- Provide technical leadership and mentorship to team members, guiding engineering decisions and upholding best practices in Site Reliability Engineering (SRE).
- Lead the initiative to accelerate the migration of legacy services from platforms like GCP to Google's internal systems, driving the phase-out of outdated architecture.
Key Skills
Ranked by relevanceReady to apply?
Join Google and take your career to the next level!
Application takes less than 5 minutes