Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
From our start as a small mobile games company founded in Israel to our current position as a publicly traded company and industry leader, we continue to be a dominant force in interactive entertainment. With a diverse portfolio of award-winning, category-leading Casual and Social Casino-themed games, including nine of the top 100 highest-grossing mobile games in the US, we're setting the standard for excellence.
Our success story is co-authored by a dynamic team of storytellers, strategists, creators and data scientists who thrive on innovation. We are home of the best, advancing an inclusive culture that embraces our core values and reflects our agile DNA.
With a strong financial foundation, disciplined operations, unwavering player-focused approach and relentless can-do spirit, we're well-positioned for sustained growth. If you're ready to join the driving force behind the evolution of interactive entertainment, we invite you to come play with us.
SRE / OPS / DevOps Engineer to join our infrastructure team. In practice, this is a cross-functional role that combines Site Reliability Engineering, Operations, and DevOps disciplines. You will be embedded in a team that owns and operates complex multi-datacenter infrastructure supporting multiple high-traffic gaming studios at Playtika.
You will work alongside experienced engineers across SRE, DBA, and Platform teams to keep our services reliable, scalable, and observable.
Responsibilities:
- Operate and maintain Kubernetes-based infrastructure (SpectroCloud / Cloudstack) across multiple datacenters in the US.
- Support and troubleshoot a wide range of stateful workloads running in k8s: Kafka, Redis / KeyDB / RedisLabs, MariaDB / Galera, Aerospike, Singlestore, Elasticsearch / OpenSearch
- Participate in cloud migration projects — prepare environments, perform data sync and failover, execute final cutover
- Manage and maintain monitoring, alerting, and observability stacks: Prometheus, VictoriaMetrics, Grafana, AlertManager, PagerDuty
- Maintain and improve logging infrastructure: ELK / OpenSearch stack, Filebeat, Logstash — including configuration, performance tuning, and index lifecycle management
- Configure and maintain load balancers (Nginx+, MaxScale, internal PLB / IPVS-based solutions) and manage DNS records
- Manage SSL certificate lifecycle automation (Sectigo + Prometheus / Grafana)
- Administer and maintain secrets management systems (HashiCorp Vault, External Secrets Operator)
- Participate in on-call duty rotation and respond to production incidents; follow and improve SRE alert handling guidelines
- Contribute to GitOps workflows: maintain infrastructure-as-code in Git, work with Flux CD, deployment packages, and Ansible playbooks
- Review and extend automation scripts and tooling (primarily Bash and Python)
- Provide SRE-level support to multiple game studios: investigate production issues, handle ChatOps requests, and collaborate with development teams
- Write and maintain operational runbooks, SOPs, migration plans, and other technical documentation in Confluence
- Perform capacity planning reviews and resource utilization analysis for datastores and cluster nodes
- Participate in cross-team initiatives and contribute to platform-level improvements
- Solid hands-on experience with Linux systems (primarily Ubuntu 22.04 / 24.04 LTS)
- Practical knowledge of Kubernetes administration: workloads, operators, resource management, node maintenance, cluster upgrades
- Experience operating and troubleshooting stateful services in k8s: at least one of — Kafka, Redis / KeyDB, MariaDB / Galera, Elasticsearch / OpenSearch, Aerospike
- Familiarity with GitOps approach and tools: Git, Flux CD, Helm, Kustomize
- Monitoring and observability experience: Prometheus ecosystem, VictoriaMetrics, Grafana, AlertManager
- Practical experience with ELK / OpenSearch stack (Filebeat, Logstash, index management)
- Solid scripting skills (Bash); ability to read and modify Python
- Understanding of networking fundamentals: TCP/IP, DNS, load balancing, ports and protocols, VIPs, VLANs
- Experience with HashiCorp Vault or similar secrets management solutions
- Familiarity with Ansible for infrastructure automation
- Ability to troubleshoot complex distributed system issues under production pressure
- Strong communication skills: ability to collaborate across SRE, DBA, RnD, and NOC teams
- Experience working with Jira-based workflow and documenting in Confluence
- Experience with cloud migration projects (datacenter-to-cloud or cloud-to-datacenter)
- Knowledge of additional datastores: Singlestore (SingleStore / MemSQL), Aerospike, Couchbase, KeyDB
- Familiarity with HashiCorp Boundary for secure remote access and Ansible dynamic inventory
- Experience with load balancer solutions: Nginx+, F5, IPVS
- Understanding of high-availability and DR patterns: active-active, active-passive, failover procedures
- Exposure to SSL certificate lifecycle automation (cert-manager, Sectigo)
- Knowledge of PagerDuty, or similar on-call and incident management platforms
- Experience with AWS (IAM, S3, EC2) in the context of infrastructure operations
- Understanding of SLO/SLA concepts and how they apply to infrastructure reliability
- Familiarity with Python-based monitoring collectors or custom exporters for Prometheus
- Experience with capacity planning, performance analysis, and resource optimization in production environments
- Annual company bonus
- Competitive salary & flexible working hours
- Daily breakfast, lunch, and office refreshments
- Private health insurance, dental coverage, and psychological counseling
- 20 paid vacation days and 5 sick days per year
- 6 long weekends and 1 day off for your birthday
- Gifts for special occasions (birthday, Easter, Christmas, Women’s and Men’s Day, weddings, childbirth, etc.)
- Technical library with the option to order books
- Access to our educational platform with courses, training programs, and certifications
- Career development through coaching and reviews
- Internal mobility & referral program
- Corporate celebrations, team-building events, and fun activities
- Personal care in the office (nails, eyebrows, and barber services)
- Modern offices (Kyiv, Vinnytsia) with organized shuttle buses to and from the office
By applying to this job, you acknowledge that you have read and understood our Candidates Privacy Notice, including with respect to the use of AI tools, and consent to the processing of your information as described therein.
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
If you feel the above describes you perfectly - Apply now!
Employee at Playtika? Click here
Key Skills
Ranked by relevanceReady to apply?
Join Playtika and take your career to the next level!
Application takes less than 5 minutes

