Zoho
Cloud Operations Engineer
ZohoIndia6 days ago
Full-timeInformation Technology

Zoho is one of the world's most prolific software companies. With 55+ applications in nearly every major business category, including sales, marketing, customer service, accounting and back office operations, and an array of productivity and collaboration tools built from the ground up, Zoho has the depth and breadth to solve even the most complex business challenges.


With more than 130 million+ users and over 18,000 employees across the globe, hundreds of thousands of companies rely on Zoho, every day to run their businesses, including Zoho itself. With 29 years of being private, bootstrapped and profitable, we understand what it takes to run a sustainable, resilient business.


Job Role: Cloud Operations Engineer

Work Experience: 1 - 5 Years

Work Location: Chennai


Job Overview:

We are looking for motivated and technically proficient individuals to join our Cloud and Infrastructure Operations team. The role involves managing Zoho’s global data centers, ensuring high availability, performance, and reliability across our cloud environments.


Key Responsibilities:

1. Linux & Cloud Infrastructure Management - Strong knowledge of Linux operating systems and cloud infrastructure components such as virtualization, load balancers, and storage systems.


2. Data Center Operations - Participate in the management of Zoho’s global data centers, covering the complete lifecycle of servers, storage systems, and supporting infrastructure.


3. Monitoring & Incident Management - Work in a 24x7 rotational shift to monitor servers and applications. Conduct root cause analysis (RCA) for incidents and implement preventive measures.


4. Server Configuration & Provisioning - Perform automated Linux OS installations and configurations for large-scale deployments using industry-standard tools and methods.


5. Infrastructure Component Management - Administer DNS, DHCP, NTP, Proxy, OS Image Servers and Load Balancers.


6. Disaster Recovery Validation - Conduct periodic validation and testing of Disaster Recovery (DR) data centers across multiple global locations.


7. Automation & Tools - Utilize open-source automation tools such as Puppet or Ansible for configuration management and package automation.


8. Scripting & Development - Develop and maintain shell and Python scripts to automate operational tasks.


9. Networking Fundamentals - Possess foundational knowledge of networking concepts including VLAN, NAT, ARP, firewalls and switch configuration.


10. Database & SaaS Components - Basic understanding of SaaS ecosystem components such as RDBMS, NoSQL, and Hadoop.


11. Hardware Knowledge - Familiarity with server hardware components, including CPU, drive types, and internal architecture.


12. Collaboration & Communication - Work effectively with cross-functional teams, including network, DevOps, framework, and application development teams.


13. Work Management - Capable of handling multiple priorities independently with minimal supervision.


14. Continuous Learning - Demonstrate a strong interest in learning new technologies related to operating systems, hardware, and cloud infrastructure.


Required Skills:

· Linux system administration

· Cloud infrastructure management

· Automation using Ansible / Puppet

· Shell & Python scripting

· Networking fundamentals

· Strong communication and coordination skills

Key Skills

Ranked by relevance