C-Serv
Principal Cloud Operations Engineer
C-ServIreland17 days ago
Full-timeRemote FriendlyOther
Our Clients Cloud Operations team is a group of talented engineers passionate about building highly reliable, scalable and secure solutions in public/private cloud environments. We are looking to hire a highly motivated Cloud Operations engineer with strong working experience in production operation, as well as cloud infrastructure design and implementation. Together, we will design, develop and implement the best public / private / local cloud solutions for our customers. You will also be expected to participate in continuous cloud service operation, troubleshoot, and resolve complex issues in production.

Responsibilities:

  • Manage and maintain the clients cloud infrastructure in AWS, GCP & Azure
  • Provide technical leadership in cloud infrastructure design and implementation
  • Ensure secure and reliable communication across different regions and cloud service providers
  • Deploy and configure middleware services, such as SQL, NoSQL databases, and messaging queue systems
  • Evaluate, recommend, and implement CloudOps / DevOps technology and solutions
  • Participate in continuous cloud service operations with the US and remote teams
  • Troubleshoot and follow up on production infrastructure / application related issues
  • Driving root cause analysis and resolution
  • Communicate with Dev/QA as well as external carriers to resolve and prevent issues
  • Design and implement deployment automation platform for Kubernetes based microservices
  • Improve service availability and scalability through tuning, automation, tools, and process
  • Analyze service performance, identify bottleneck and provide actionable improvement plans

Requirements

  • BS level technical degree required; Computer Science or Engineering background preferred
  • 8+ years of experience in a CloudOps / DevOps role
  • Hands on experience with AWS or any public cloud (Azure, GCP etc.)
  • Knowledge of Linux, security and networking fundamentals
  • Working knowledge of container-based architecture and deployment (Docker, Kubernetes.)
  • Working knowledge of deployment automation development (Terraform, Helm, ArgoCD)
  • Experience in diagnosing and resolving complex application problems
  • Working knowledge of Elasticsearch, PostgreSQL, Redis, Ignite, Flink, Kafka, and RabbitMQ
  • Experience with monitoring tools (Nagios, Grafana, Prometheus)
  • Experience with cloud security and compliance implementation is a plus
  • Strong follow-through and initiative to stay with issues until they are resolved
  • Comfortable working within a distributed team located in multiple time zones

Key Skills

Ranked by relevance