Site Reliability Engineer

LANDI Global

Singapore · Full-time · Associate

As a Site Reliability Engineer, you will assist in the operation and maintenance of LANDI Global infrastructures. Your responsibilities will include supporting the platform's reliability and performance while learning from senior engineers.

Key Responsibilities:

Help build and maintain platform infrastructures across various environments.
Collaborate with the R&D team to ensure platform availability and scalability.
Assist in implementing monitoring and alerting systems for timely issue resolution.
Support the maintenance of Disaster Recovery plans for business continuity.
Analyze performance metrics and contribute to cost optimization strategies.
Participate in automated testing, CI/CD processes, and deployment efficiency.
Help manage incident reporting and change management processes.
Provide operational support for platforms and assist with production issues.
Participate in a 24/7 standby rotation.
Support environment deployments for new client onboarding.

EXPERIENCES

At least 3 years or more experience in similar capacity
Excellent oral and written communication in English.

PREFERRED SKILLS

Candidates should ideally have experience in some of the following technologies:

Experience in various cloud technologies (e.g. AWS, Azure)
Experience in distributed Linux/Unix operating systems
Experience in high-level programming or scripting languages
Experience in monitoring tools (e.g. Prometheus, Grafana, Zabbix)
Experience in configuration management tools (e.g. Ansible, Chef, Puppet)
Experience in SQL databases (e.g. Postgres, MySQL)
Experience in load balancing and reverse proxies (e.g. Nginx)
Experience in CI/CD tools (e.g. Jenkins, GitLab)
Experience in Containerization (e.g. Dockers, K8s)

Key Skills

Ranked by relevance

cicd configuration management containerization prometheus jenkins ansible grafana cloud sql aws

Related Jobs

3 roles aligned with this opportunity

View all jobs