-
GiGa-Ops Global Solutions

Site Reliability Engineer (SRE) with EKS Expertise

GiGa-Ops Global Solutions
Lithuania · Contract · Mid-Senior

Job Summary

We are seeking a highly skilled Site Reliability Engineer (SRE) with experience in building and managing EKS (Elastic Kubernetes Service) environments. The ideal candidate will be responsible for designing, deploying, and maintaining reliable systems while supporting our DevOps practices. A background in observability tools such as ELK (Elasticsearch, Logstash, Kibana) and Grafana is highly preferred.

Key Responsibilities

  • EKS Build and Run:
  • Design, implement, and manage EKS clusters to ensure high availability and scalability.
  • Automate provisioning, deployment, and scaling of EKS environments.
  • Monitor and maintain the health and performance of Kubernetes workloads in EKS.
  • Site Reliability Engineering:
  • Enhance system reliability through the development of monitoring, automation, and fault-tolerant solutions.
  • Build tools and automation to streamline infrastructure management and operational tasks.
  • Respond to incidents, troubleshoot performance issues, and conduct root cause analysis.
  • DevOps Collaboration:
  • Support CI/CD pipelines, including integrating EKS into the DevOps lifecycle.
  • Ensure seamless collaboration with development teams to deliver infrastructure as code (IaC) and automate deployments.
  • Observability & Monitoring:
  • Implement and optimize observability solutions using tools like ELK Stack and Grafana.
  • Establish robust logging, monitoring, and alerting frameworks to improve system transparency and uptime.

Required Skills & Experience

  • Kubernetes/EKS Expertise: Strong experience in deploying and managing Kubernetes clusters, specifically on AWS EKS.
  • Cloud Platforms: Advanced knowledge of AWS services and infrastructure.
  • DevOps Tools: Familiarity with DevOps practices and tools like Terraform, Ansible, Jenkins, or GitLab CI/CD.
  • Observability: Hands-on experience with ELK Stack (Elasticsearch, Logstash, Kibana) and Grafana.
  • Automation & Scripting: Proficiency in scripting languages (e.g., Python, Bash) and automation frameworks.
  • System Administration: Solid understanding of Linux/Unix systems and networking.

Preferred Qualifications

  • Background in building observability pipelines and frameworks.
  • Experience with Prometheus, Loki, or other observability tools is a plus.
  • Certification in AWS (e.g., AWS Certified Solutions Architect or DevOps Engineer) is an advantage.

Soft Skills

  • Excellent problem-solving and troubleshooting skills.
  • Strong communication and teamwork abilities.
  • A proactive approach to learning and adopting new technologies.

Skills: kubernetes,building,gitlab ci/cd,eks,ci,grafana,aws,infrastructure,automation,elk stack,reliability,networking,jenkins,ansible,bash,terraform,unix,skills,linux,devops,python

Key Skills

Ranked by relevance

c eks devops ui kubernetes aws ai grafana elk esp elasticsearch gitlab ci terraform jenkins ansible python gitlab linux bash unix git ux ha infrastructure as code system administration high availability prometheus scala cloud excel loki nist lan
Login to Apply
Posted
Nov 21, 2024
Type
Contract
Level
Mid-Senior
Location
Lithuania

Industries

Technology Information Internet

Categories

Engineering Information Technology

Related Jobs

3 roles aligned with this opportunity

View all jobs
View Job Details
Alignerr
Related

Software Engineer (AI Training)

2026-06-15

Contract
Not Applicable
Argentina
Technology
Engineering
View Job Details
Alignerr
Related

Software Engineer (AI Training)

2026-06-15

Contract
Not Applicable
Argentina
Technology
Engineering
View Job Details
Profound
Related

Design Engineer

2026-06-17

Full-time
Not Applicable
Argentina
Technology
Engineering