Intellias
Senior SRE DevOps Engineer (AWS)
IntelliasUkraine2 days ago
Full-timeInformation Technology

Dive deep into Digital! For 20 years, Intellias has been developing top-tier digital solutions for the world’s leading companies, keeping them in line with the latest technology trends. Join in and provide innovations for the future!


The Senior DevOps Engineer will play a key role in designing, maintaining, and scaling the infrastructure and automation systems that ensure the reliability, availability, and performance of the company's critical applications and services. This position requires deep expertise in cloud-native platforms, infrastructure as code (IaC), CI/CD, and modern observability practices. The role involves a blend of software engineering and systems engineering skills to build resilient, secure, and scalable infrastructure.


Requirements

Experience:

  • Minimum 5+ years of professional experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles.

Technical Skills:

  • Strong proficiency with AWS (EKS, RDS, S3, IAM, Lambda, CloudFormation/Terraform).
  • Prior experience in SRE teams or initiatives.
  • Proficient in Terraform, Helm, ArgoCD, Kubernetes, and CI/CD automation.
  • Solid understanding of networking, DNS, TLS, load balancers, and container orchestration.
  • Experience with monitoring/alerting tools (e.g., Datadog, Prometheus, Grafana).
  • Strong scripting skills in Python, Bash, or Go.

Preferred Qualifications

  • Master’s degree in Computer Science or related field.
  • AWS Certified Solutions Architect or DevOps Engineer certifications.
  • Experience with Crossplane, OpenSearch, or multi-cloud architecture.

Available to work from 15:00 to 23:00 (EET/EEST).


Responsibilities

  • Design and implement scalable infrastructure solutions using AWS cloud services (e.g., EC2, EKS, RDS, MSK, Lambda, CloudFront).
  • Develop and maintain infrastructure automation using Terraform, Helm, and ArgoCD within GitOps workflows.
  • Architect and manage multi-region, highly available systems to ensure business continuity and disaster recovery.
  • Lead incident response, postmortems, and root cause analysis efforts to improve system reliability and performance.
  • Define and enforce Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets.
  • Build and maintain CI/CD pipelines using GitHub Actions, ensuring efficient and secure software delivery.
  • Implement and manage observability stacks including Datadog, CloudWatch, and Prometheus/Grafana.
  • Ensure compliance and security best practices, including IAM policies, secrets management, and audit logging.
  • Collaborate with software engineering, security, and infrastructure teams to define and implement a reliable architecture.
  • Conduct cost optimization, capacity planning, and performance tuning for cloud workloads.
  • Mentor junior engineers and contribute to knowledge sharing and process improvements.


Why this position: Client makes smart cutting machines that work with an easy-to-use app, an ever-growing collection of materials, and crafting essentials to help you design and personalize almost anything — custom cards, unique apparel, everyday items, and so much more.

Key Skills

Ranked by relevance