Temu
Site Reliability Engineer
TemuIreland16 hours ago
Full-timeInformation Technology, Engineering

Responsibilities:

  1. Responsible for the deployment, configuration, operations, monitoring, and troubleshooting of production environment components including servers, networks, and storage.
  2. Collaborate with development teams to design highly available and scalable system architectures.
  3. Participate in the design and development of operations-related platforms and tools.
  4. Assist in the daily management and maintenance of operations-related platform systems.


Job Requirements:

  1. Minimum of 3 years of operations experience in a medium to large-scale internet company.
  2. Familiarity with mainstream cloud platform services and their operational management, including compute instances, load balancers, networking, and object storage.
  3. Proficient in high-availability technologies with strong capabilities in fault identification and troubleshooting.
  4. Expertise in Linux system operations and proficient scripting skills.
  5. Solid understanding of computer networking fundamentals, including TCP/IP protocols and common application-layer protocols such as HTTP/HTTPS.
  6. Proactive, strong sense of responsibility, team-oriented, with excellent communication and learning abilities.
  7. Experience with large-scale containerized production environments (e.g., Docker, Kubernetes) is a plus.
  8. Experience in developing or maintaining monitoring systems, workflow automation tools, or DevOps/O&M platforms is preferred.

Key Skills

Ranked by relevance