Job Description
Key Responsibilities:
- Infrastructure Automation: Design, implement, and manage CI/CD pipelines to automate development, testing, and deployment processes.
- System Reliability: Ensure high availability and reliability of production systems and infrastructure by proactively monitoring and resolving incidents.
- Monitoring & Incident Management: Utilize monitoring tools (such as Prometheus, Grafana, New Relic, etc.) to track system health and performance. Quickly respond to incidents and participate in root cause analysis.
- Configuration Management: Automate infrastructure provisioning and configuration management using tools such as Terraform, Ansible, Puppet, or Chef.
- Cloud Infrastructure Management: Manage and optimize cloud infrastructure, including AWS, Azure, GCP, or others, ensuring scalability, security, and cost efficiency.
- Collaboration: Work closely with development teams to ensure application performance, reliability, and scalability requirements are met.
- Security & Compliance: Help ensure infrastructure is secure by integrating security best practices and tools into the DevOps pipeline.
- Continuous Improvement: Continuously improve systems and processes, sharing knowledge and expertise with the team to drive efficiencies.
Required Skills & Qualifications:
- Experience: Minimum 6+ years of experience in DevSecOps & Site Reliability Engineering.
- Programming/Scripting: Proficiency in one or more languages such as Python, Bash,Shell.
- CI/CD Tools: Strong experience with CI/CD tools such as Jenkins, GitLab CI.
- Cloud Platforms: Hands-on experience with cloud infrastructure (atleast one AWS, Azure, GCP) and cloud-native technologies like Kubernetes, OpenShift and Docker.
- Automation & Configuration: Expertise in automation tools like Terraform, Ansible, Puppet, or Chef.
- Monitoring & Logging: Experience with monitoring tools (Prometheus, Grafana, Datadog) and log management tools (ELK Stack, Splunk).
- Version Control: Familiarity with Git for version control.
- Infrastructure as Code (IaC): Experience writing and maintaining infrastructure as code (e.g., CloudFormation, Terraform).
- Incident Management: Proven experience in managing incidents, troubleshooting, and performing root cause analysis.
Key Skills
Ranked by relevance
Related Jobs
3 roles aligned with this opportunity
DevSecOps Expert
2026-05-28
AI Software Engineer (m/f/d) - Berlin
2026-05-21
Network Engineer
2026-05-27
- Posted
- Mar 13, 2025
- Type
- Contract
- Level
- Entry
- Location
- Canada
- Company
- Avance Consulting
Industries
Categories
Related Jobs
3 roles aligned with this opportunity
DevSecOps Expert
2026-05-28
AI Software Engineer (m/f/d) - Berlin
2026-05-21
Network Engineer
2026-05-27