Avance Consulting
Site Reliability Engineer
Avance ConsultingAustralia1 day ago
ContractInformation Technology

Role Overview

We are seeking a skilled Site Reliability Engineer (SRE) with expertise in automation and observability. The ideal candidate will have strong proficiency in PowerShell scripting for automation, infrastructure management, and operational efficiency, as well as Power BI experience for building dashboards, metrics, and insights to support reliability engineering practices.

Key Responsibilities

Design, build, and maintain automated solutions using PowerShell for system administration, configuration management, and operational tasks.

Develop and maintain Power BI dashboards to visualize service health, SLAs, incident metrics, and operational performance.

Implement monitoring, alerting, and observability solutions to improve system reliability and uptime.

Support and optimize CI/CD pipelines and infrastructure automation.

Collaborate with development and operations teams to implement SRE best practices, including error budgets, SLIs, SLOs, and SLAs.

Troubleshoot complex incidents, conduct root cause analysis (RCA), and contribute to continuous improvement.

Participate in on-call rotation to ensure high availability and quick recovery from failures.

Document automation processes, runbooks, and reliability standards

Required Skills & Qualifications

Proven experience as an SRE, DevOps Engineer, or Systems Engineer.

Strong expertise in PowerShell scripting for automation and system operations.

Hands-on experience with Power BI (dashboard creation, DAX, data modeling).

Good understanding of monitoring/observability tools (e.g., Splunk, Grafana, AppDynamics, Prometheus).

Familiarity with cloud platforms (Azure, AWS, or GCP).

Knowledge of CI/CD tools (Azure DevOps, Jenkins, GitHub Actions, etc.).

Strong troubleshooting skills in Windows/Linux environments.

Understanding of networking, load balancing, and system performance tuning.

Key Skills

Ranked by relevance