Site Reliability Engineer

As a Site Reliability Engineer, you will assist in the operation and maintenance of LANDI Global infrastructures. Your responsibilities will include supporting the platform's reliability and performance while learning from senior engineers.


Key Responsibilities:

  • Help build and maintain platform infrastructures across various environments.
  • Collaborate with the R&D team to ensure platform availability and scalability.
  • Assist in implementing monitoring and alerting systems for timely issue resolution.
  • Support the maintenance of Disaster Recovery plans for business continuity.
  • Analyze performance metrics and contribute to cost optimization strategies.
  • Participate in automated testing, CI/CD processes, and deployment efficiency.
  • Help manage incident reporting and change management processes.
  • Provide operational support for platforms and assist with production issues.
  • Participate in a 24/7 standby rotation.
  • Support environment deployments for new client onboarding.


EXPERIENCES

  • At least 3 years or more experience in similar capacity
  • Excellent oral and written communication in English.


PREFERRED SKILLS

Candidates should ideally have experience in some of the following technologies:

  • Experience in various cloud technologies (e.g. AWS, Azure)
  • Experience in distributed Linux/Unix operating systems
  • Experience in high-level programming or scripting languages
  • Experience in monitoring tools (e.g. Prometheus, Grafana, Zabbix)
  • Experience in configuration management tools (e.g. Ansible, Chef, Puppet)
  • Experience in SQL databases (e.g. Postgres, MySQL)
  • Experience in load balancing and reverse proxies (e.g. Nginx)
  • Experience in CI/CD tools (e.g. Jenkins, GitLab)
  • Experience in Containerization (e.g. Dockers, K8s)

Post Date
2025-06-17
Job Type
-
Employment type
Full-time
Category
Engineering, Information Technology
Level
Associate
Country
Singapore
Industry
IT Services , IT Consulting ,
LANDI Global*******