-
View all jobs
Agile Lab is a company founded in 2014 with the mission to create value for its customers in data-intensive environments through customisable solutions that establish performance-driven processes, sustainable architectures and automated platforms based on data governance best practices.
Having delivered over 100 successful Elite Data Engineering initiatives, we have used this experience to create Witboost: a modular, technology-agnostic platform that enables modern organisations to discover, value and produce their data in both traditional environments and fully compliant Data Mesh architectures.
With a highly skilled team of over 260 data engineers based in Europe, Agile Lab helps organisations with their data-driven transformation.
Take a look at our handbook to discover our core values and processes.
💼 The opportunity:
We are looking for a Site Reliability Engineer II (SRE II) to join our growing team. You will play a key role in maintaining the reliability, observability, and operational efficiency of enterprise-level distributed systems.
In this role, you’ll coordinate a small technical team (3–4 people) in managing microservices in complex production environments. You will be involved in monitoring, incident management, release coordination, and performance tuning, with a strong focus on OpenShift platforms.
You’ll also work closely with multiple cross-functional teams to ensure high availability and performance of our cloud-native services.
This role includes on-call availability.
💰 RAL:
38.5K-48.5K
💻 Responsibilities:
Only candidates based in European time zones (CEST or similar) will be considered for this position;
Having delivered over 100 successful Elite Data Engineering initiatives, we have used this experience to create Witboost: a modular, technology-agnostic platform that enables modern organisations to discover, value and produce their data in both traditional environments and fully compliant Data Mesh architectures.
With a highly skilled team of over 260 data engineers based in Europe, Agile Lab helps organisations with their data-driven transformation.
Take a look at our handbook to discover our core values and processes.
💼 The opportunity:
We are looking for a Site Reliability Engineer II (SRE II) to join our growing team. You will play a key role in maintaining the reliability, observability, and operational efficiency of enterprise-level distributed systems.
In this role, you’ll coordinate a small technical team (3–4 people) in managing microservices in complex production environments. You will be involved in monitoring, incident management, release coordination, and performance tuning, with a strong focus on OpenShift platforms.
You’ll also work closely with multiple cross-functional teams to ensure high availability and performance of our cloud-native services.
This role includes on-call availability.
💰 RAL:
38.5K-48.5K
💻 Responsibilities:
- Ensure high reliability of microservices running in OpenShift environments
- Lead and coordinate a technical team of 3–4 engineers for operational excellence
- Manage incident resolution and ticketing workflows via ServiceNow
- Collaborate with development teams to drive performance optimization and tuning
- Design, configure and maintain monitoring dashboards (Grafana, Prometheus, etc.)
- Coordinate with Service Control Room to maintain effective alerting and response
- Oversee release processes of new features, hotfixes, and updates in production
- Degree in Computer Engineering, Computer Science, or a related field
- Proven experience in Application Maintenance Services (AMS): minimum 2 years
- In-depth knowledge of OpenShift and microservices in cloud-native environments
- Ability to technically and operationally lead a team of 3–4 people
- Experience in release management, monitoring, and incident resolution
- Excellent communication and cross-functional coordination skills
- Strong initiative, operational autonomy, and results-oriented mindset
- Fluency in Italian (mandatory requirement)
- Monitoring & Observability: Grafana, Prometheus, Kibana, Jaeger, Datadog, OpenTelemetry
- Cloud/DevOps: OpenShift, GitLab, Jenkins
- Data & Messaging: Kafka, MongoDB, Ignite
- Ticketing & ITSM: ServiceNow
- Full Remote or hybrid working in our offices: Milan, Turin, Padua, Bologna, Catania and Rende;
- Real work life balance;
- Training monthly budget (time and money);
- Support of a buddy in the first week of work;
- Benefits and corporate welfare programs: company prizes and welcome pack with all the equipment you need to work;
- Agile Nomads Experience: opportunity to work for 2 weeks abroad;
- Referral bonus, if you bring people as talented as you;
- The opportunity to attend one conference per year;
- A company rated 4.8 out of 5 for employee satisfaction on Glassdoor and certified as a Great Place to Work
- Inclusive environment where you can be who you really are;
- Stimulating environment oriented to growth, both professional and personal.
- We don't like hierarchies: we work as a team;
- We don't like bureaucracies, we prefer sense of responsibility;
- We like data, certainly, so anything that is measurable;
- We want to make a positive change in our industry;
- Empathy, humility, collaboration, and willingness to challenge ourselves are the basis of our work.
Only candidates based in European time zones (CEST or similar) will be considered for this position;
Key Skills
Ranked by relevance
microservices
prometheus
grafana
cloud
high availability
datadog
gitlab
kafka
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
DevOps Engineer
2026-06-18
Full-time
Not Applicable
Poland
Software Development
Engineering
View Job Details
Related
Backend Developer (Remote)
2026-06-19
Contract
Not Applicable
United Arab Emirates
Technology
Research
View Job Details
Related
Lead Machine Learning Engineer/ AI Systems Architect
2026-06-19
Full-time
Mid-Senior
Romania
Software Development
Information Technology
Login to Apply
- Posted
- Dec 20, 2025
- Type
- Full-time
- Level
- Entry
- Location
- Milan
- Company
- Agile Lab
Industries
Software Development
Categories
Engineering
Information Technology
Related Jobs
3 roles aligned with this opportunity
View Job Details
Related
DevOps Engineer
2026-06-18
Full-time
Not Applicable
Poland
Software Development
Engineering
View Job Details
Related
Backend Developer (Remote)
2026-06-19
Contract
Not Applicable
United Arab Emirates
Technology
Research
View Job Details
Related
Lead Machine Learning Engineer/ AI Systems Architect
2026-06-19
Full-time
Mid-Senior
Romania
Software Development
Information Technology