Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
We are HCLTech, one of the fastest-growing large tech companies in the world and home to 225,000+ people across 60 countries, supercharging progress through industry-leading capabilities centered around Digital, Engineering and Cloud. The driving force behind that work, our people, are diverse, creative, and passionate, raising the bar for excellence on a regular basis. We, in turn, work hard to bring out the best in them as we strive to help them find their spark and become the best version of themselves that they can be. If all this sounds like an environment you’ll thrive in, then you’re in the right place.
As a Site Reliability Engineer (SRE) at HCLTech, you will play a critical role in maintaining the reliability, scalability, and availability of cloud-based production systems. You will be responsible for proactive incident management, infrastructure automation, and performance optimization across AWS and multi-cloud environments. Your expertise will directly contribute to the seamless operation and ongoing improvement of mission-critical services, ensuring that HCLTech continues to deliver industry-leading solutions to its global clientele.
Responsibilities
Incident Management
- Provide 24x7 support for “Keep the Lights On” (KTLO) operations.
- Respond promptly to incidents, alerts, and escalations affecting production systems.
- Perform thorough root cause analysis and implement corrective actions to prevent recurrence.
AWS Cloud Infrastructure Management
- Assist in the building and configuration of new AWS cloud instances and environments.
- Support code deployments and rollouts across multiple AWS environments.
- Maintain and update infrastructure-as-code (IaC) templates for consistent, repeatable deployments.
Patching and Deployment
- Execute critical system patching using CrowdStrike’s internal tools and protocols.
- Automate deployment pipelines utilizing Jenkins, ArgoCD, or similar CI/CD tools.
- Validate deployment outcomes and execute rollback strategies as necessary.
Monitoring and Observability
- Set up and maintain observability tests leveraging Grafana, Prometheus, and proprietary tools.
- Proactively monitor service health, latency, and overall availability.
- Optimize alert thresholds and reduce false positives to ensure actionable monitoring.
Performance Optimization
- Conduct performance tuning for applications and databases to maximize efficiency.
- Analyze logs and metrics to identify and address system bottlenecks.
- Recommend architectural improvements to enhance scalability and resilience.
Documentation and Knowledge Transfer
- Maintain comprehensive runbooks, standard operating procedures (SOPs), and architectural diagrams.
- Lead knowledge transfer sessions with internal teams to foster collaboration.
- Document incident postmortems and share lessons learned across teams.
Required Skills and Qualifications
- Minimum 5 years of experience in SRE, DevOps, or cloud infrastructure roles.
- Deep expertise in AWS services including EC2, RDS, S3, VPC, IAM, CloudWatch, and CloudTrail.
- Proficiency in Infrastructure as Code (IaC) tools such as Terraform and AWS CloudFormation.
- Experience with AWS Control Tower, EKS, and exposure to multi-cloud environments (GCP, OCI).
- Hands-on experience with CI/CD tools including Jenkins, ArgoCD, GitOps, and CodeFresh.
- Advanced scripting skills in Python, Shell, or GoLang for automation tasks.
- Familiarity with monitoring and observability tools such as Grafana, Prometheus, AppDynamics, Splunk, and AWS CloudWatch.
- Bachelor’s degree in computer science, Information Technology, or related field.
How You’ll Grow
At HCLTech, we offer continuous opportunities for you to find your spark and grow with us. We want you to be happy and satisfied with your role and to really learn what type of work sparks your brilliance the best. Throughout your time with us, we offer transparent communication with senior level employees, learning and career development programs at every level, and opportunities to experiment in different roles or even pivot industries. We believe that you should be in control of your career with unlimited opportunities to find the role that fits you best. Explore more career paths with us at www.hcltech.com.
Why Us
- We offer End-to-end digital transformation expertise that helps clients from strategy through execution.
- We work with the biggest brands, offering the opportunity to be a part of industry-leading work
- We are invested in your growth, offering learning and career development opportunities at every level to help you find your spark
- We offer freedom and flexibility on the job, empowering our employees to make decisions
- We offer a virtual-first work environment, promoting a good work-life balance and real flexibility
- Our company is extremely diverse with representation of 165 nationalities
- We offer the opportunity to work with colleagues across the globe
- We offer comprehensive benefits for all employees
- We are a certified great place to work and a top employer in 25 countries, offering a positive work environment that values employee recognition and respect
HCLTech is committed to protecting and securing the privacy and confidentiality of the Personal Data which it collects directly or indirectly from you when applying for a job at HCLTech either directly or through a third-party human resources agency. This notice (the “Notice”) outlines and explains how HCL Technologies Limited including its subsidiaries, local employing entities, associates, and affiliated companies [collectively referred to as “HCLTech”, “us,” “our”, or “we”] will process your Personal Data in accordance with applicable privacy legislation(s).
https://www.hcltech.com/candidate-privacy-notice
Key Skills
Ranked by relevanceReady to apply?
Join HCLTech and take your career to the next level!
Application takes less than 5 minutes