Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
From our start as a small mobile games company founded in Israel to our current position as a publicly traded company and industry leader, we continue to be a dominant force in interactive entertainment. With a diverse portfolio of award-winning, category-leading Casual and Social Casino-themed games, including nine of the top 100 highest-grossing mobile games in the US, we're setting the standard for excellence.
Our success story is co-authored by a dynamic team of storytellers, strategists, creators and data scientists who thrive on innovation. We are home of the best, advancing an inclusive culture that embraces our core values and reflects our agile DNA.
With a strong financial foundation, disciplined operations, unwavering player-focused approach and relentless can-do spirit, we're well-positioned for sustained growth. If you're ready to join the driving force behind the evolution of interactive entertainment, we invite you to come play with us.
Responsibilities:
- Maintain and improve existing monitoring configurations (alerts, dashboards, service discovery, scrape configs, etc.)
- Implement and enhance alerting logic, including threshold tuning and dynamic alert conditions
- Troubleshoot monitoring and metrics-related issues (e.g., missing data, false alerts, broken dashboards)
- Support and improve self-developed metrics collectors and Python-based monitoring services
- Assist NOC and SRE teams with alert deduplication, escalation rules, and alert quality improvements
- Participate in design and implementation of observability improvements for new services and infrastructure components
- Review, modify, and extend existing scripts and plugins (primarily Python and Bash)
- Provide monitoring-related guidance to development, infrastructure, and operations teams
- Ensure monitoring tools and services operate reliably within Kubernetes clusters and Linux systems
- Maintain monitoring configuration in Git and follow internal version control best practices
- Participate in cross-team initiatives to improve the overall monitoring and incident response ecosystem
- Strong hands-on experience with Linux systems (primarily Ubuntu)
- Practical knowledge of Prometheus ecosystem, VictoriaMetrics, Grafana, and Zabbix
- Experience supporting monitoring systems in Kubernetes-based infrastructure
- Solid scripting skills (Bash)
- Familiarity with Git and common version control workflows
- Good understanding of networking and infrastructure concepts (ports, protocols, DNS, etc.)
- Ability to troubleshoot metric collection, alert firing, and data visualization issues
- Basic knowledge of SQL (e.g., for querying time-series or metadata stores)
- Strong communication skills for cross-functional collaboration
- Understanding of high-availability and failover patterns in observability systems
- Experience working with SLO/SLA-based alerting or anomaly detection mechanisms
- Exposure to automation and CI/CD pipelines for monitoring infrastructure
If you feel the above describes you perfectly - Apply now!
Employee at Playtika? Click here
Key Skills
Ranked by relevanceReady to apply?
Join Playtika and take your career to the next level!
Application takes less than 5 minutes