Grid Dynamics
Site Reliability Engineer (SRE)
Grid DynamicsArgentina7 hours ago
Full-timeRemote FriendlyEngineering, Information Technology
We are looking for a Site Reliability Engineer to join a new team at one of our clients, a major American pet care retailer offering supplies, services, and care solutions. This is an opportunity to join a large, well-established organization that combines retail, services, and digital solutions to improve the lives of pets and their owners, in a collaborative environment with the chance to work on impactful, customer-facing products at scale.

Responsibilities

  • Ensure high availability, reliability, and performance of retail systems (e-commerce, checkout, inventory), especially during peak sales events.
  • Monitor systems using SLIs/SLOs, lead incident response, and perform root cause analysis to reduce downtime and customer impact.
  • Design and maintain scalable, fault-tolerant infrastructure using cloud platforms, containers, and Infrastructure as Code.
  • Automate deployments, testing, and operational tasks through CI/CD pipelines and self-healing systems.
  • Implement robust monitoring, logging, and alerting to proactively detect and resolve issues.

Requirements

  • Strong experience with Linux/Unix systems and cloud platforms (GCP).
  • Proficiency in at least one programming/scripting language (Python, Bash, Node).
  • Hands-on experience with containers and orchestration (Docker, Kubernetes).
  • Solid understanding of monitoring, logging, and alerting tools and SRE concepts (SLIs, SLOs, SLAs).
  • Experience building or supporting high-traffic, customer-facing systems, preferably in e-commerce or retail environments.
  • Knowledge of CI/CD pipelines, Infrastructure as Code (Terraform), and reliability best practices.

Nice to have

  • Experience with Observability
  • Experience with Ecommerce

We offer

  • Flexible working hours (full-time).
  • One "Flex Day" off per month – eligible after six months with the company.
  • 10 business days of vacation.
  • Swiss Medical health coverage.
  • 100% remote work.
  • Permanent contract with salary review every four months (in ARS).
  • Access to Udemy and Platzi for professional training.
  • Employee Assistance Program (financial, nutritional, psychological support, etc.).
  • Fully covered English classes during working hours.
  • Discounts on Club de Beneficios and Samsung products.
  • Birthday day off.

About Us

Mobile Computing is joining Grid Dynamics (NASDAQ: GDYN), a leading provider of technology consulting, platform and product engineering, AI, and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical challenges and enable positive business outcomes for enterprise companies undergoing business transformation. A key differentiator for Grid Dynamics is our 8 years of experience and leadership in enterprise AI, supported by profound expertise and ongoing investment in data, analytics, cloud & DevOps, application modernization and customer experience. Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.

Key Skills

Ranked by relevance