-
SmartChoice International Limited

Site Reliability Engineer

SmartChoice International Limited
Turkey · Full-time · Mid-Senior

Job Title: Site Reliability Engineer

Location: Türkiye (Remote) - Only local candidates in Türkiye are preferred

Contract Duration: 6 months + extensions


Job Description:

We are seeking a skilled and proactive Site Reliability Engineer (SRE) to join our remote team based in Türkiye. This role is instrumental in maintaining, improving, and scaling our systems to ensure high reliability and performance. You will collaborate closely with cross-functional teams to enhance system stability, manage deployments, and drive operational excellence.


Key Responsibilities:

  • Monitoring and Incident Management: Implement and maintain OpenSearch logging, alerting systems, and monitoring tools like Prometheus, Grafana, and Instana to ensure system health.
  • Infrastructure Management: Build, deploy, and manage cloud infrastructure using AWS services (e.g., CDK, Lambda, serverless computing) and CI/CD pipelines.
  • Performance Optimization: Define and monitor SRE KPIs, such as SLA, SLO, SLI, and latency metrics, to guarantee system reliability and efficiency.
  • Container Orchestration: Deploy and manage Kubernetes clusters, Docker containers, ingress configurations, and image management with Harbor.
  • Micro-frontend Development: Support micro-frontend architectures and create scripts using Bash or Python to automate processes.
  • Data Streaming and APIs: Work with Kafka frameworks and GraphQL to ensure seamless data integration and efficient API management.


Qualifications:


Basic Proficiency in:

  • OpenSearch logging and alerting systems.
  • Node.js and TypeScript programming.


Intermediate Proficiency in:

  • Micro-frontend development and scripting (Bash, Python).
  • Kubernetes (k8s), Docker, ingress configurations, and Harbor image management.
  • Kafka framework for data streaming.
  • Monitoring tools like Prometheus, Grafana, and Instana.
  • GraphQL API development and management.
  • SRE concepts such as SLA, SLO, SLI, and latency metrics.


Expert Proficiency in:

  • AWS services and infrastructure-as-code (IAC) practices.
  • CI/CD pipeline setup and infrastructure management.


Preferred Skills and Attributes:

  • Strong problem-solving and analytical skills.
  • Excellent communication and teamwork abilities.
  • Proactive and able to work effectively in a remote environment.
  • A passion for system reliability and automation.


Why Join Us?

  • Opportunity to work remotely with a talented global team.
  • Be part of innovative projects leveraging cutting-edge technology.
  • Competitive compensation and professional growth opportunities.


Application Process:

Interested candidates are invited to submit their CVs detailing their relevant experience and qualifications for this position of "Site Reliability Engineer" at the email address given below:


[email protected]


Note: Only shortlisted candidates will be contacted for further steps in the selection process

Key Skills

Ranked by relevance

c ai ha kubernetes prometheus graphql grafana python docker kafka excel bash aws sla typescript serverless server unity cloud esp isr asm nat ui
Login to Apply
Posted
Dec 09, 2024
Type
Full-time
Level
Mid-Senior
Location
Türkiye

Industries

IT Services IT Consulting

Categories

Information Technology Engineering

Related Jobs

3 roles aligned with this opportunity

View all jobs
View Job Details
ioBuilders
Related

Senior Golang Developer

2026-05-23

Full-time
Mid-Senior
Spain
IT Services
Engineering
View Job Details
Sanction Scanner
Related

DevOps & IT Engineer

2026-05-26

Full-time
Mid-Senior
Turkey
IT Services
Engineering
View Job Details
SII Group Spain
Related

DevOps Engineer

2026-05-23

Full-time
Associate
Spain
IT Services
Engineering