DevOps Engineer
My European client is seeking a DevOps Engineer with strong knowledge of Observability for a contract assignment in Stockholm, Sweden. This is a hybrid working model with onsite and offsite working, mainly offsite. Due to the nature of the project, candidates need to be EU citizens and preference will be given to candidates with previous experience in EU institutions.
What will your duties as a specialist DevOps Engineer be ?
- Observability Platform Design & Architecture:
- You will design and implement scalable and robust observability architectures for complex distributed systems, including microservices, cloud-native environments (Kubernetes, serverless), and traditional infrastructure.
- You will define and enforce standards and best practices for telemetry data (metrics, logs, traces) collection, processing, storage, and visualization.
- You will also evaluate, select, and integrate new observability tools and technologies into the existing ecosystem.
- Instrumentation & Data Collection:
- You will be working directly with development and Back Office teams to implement pervasive instrumentation within applications and infrastructure, using frameworks like OpenTelemetry.
- You will develop custom exporters, agents, or integrations to collect specific telemetry data from various sources.
- You will also configure and optimize data pipelines for efficient ingestion and routing of metrics, logs, and traces.
- Tooling Implementation & Management:
- Involved in hands-on deployment, configuration, and administration of observability platforms.
- Support the automation of the deployment and management of observability infrastructure using Infrastructure as Code (IaC) tools (e.g. Terraform).
- Develop custom dashboards, alerts, and reports to provide actionable insights into system health and performance.
- Performance Analysis & Troubleshooting:
- You will support and coach teams for them to identify performance bottlenecks, anomalies, and potential issues by analysing observability data.
- Support and coach teams for them to leverage observability tools to quickly pinpoint root causes and facilitate rapid resolution.
- Support and coach teams for them to conduct in-depth performance analysis, capacity planning, and resource optimization based on collected telemetry.
- Support and coach teams for them to implement anomaly detection and predictive analytics to anticipate and prevent issues.
- Automation & Scripting:
- You will need to develop scripts and automation tools to streamline observability workflows, integrate systems, and enhance operational efficiency.
- Mentoring & Knowledge Sharing:
- You will act as a subject matter expert, providing technical guidance and mentorship to teams on observability best practices, tools, and troubleshooting techniques.
- Create detailed technical documentation, runbooks, and playbooks.
- Conduct training sessions and workshops to upskill development and operations teams on observability concepts and tools.
- Collaboration & Cross-functional Support:
- You'll be in close collaboration with development and operations to embed observability throughout the software development lifecycle (SDLC).
- Work with stakeholders to define Service Level Indicators (SLIs) and Service Level Objectives (SLOs) and build dashboards to track adherence.
Required Skills and Experience
- Bachelor's in Computer Science, Software Engineering, DevOps, or a related technical discipline.
- A minimum of 5-8+ years of progressive experience in a hands-on technical role, with a significant focus on observability, monitoring, or DevOps.
- Minimum of 3-5 years of dedicated experience in designing, implementing, and managing observability solutions in production environments.
- Proven track record of architecting and delivering scalable and resilient observability platforms.
- Extensive experience with incident response and post-mortem analysis.
- Expert-level understanding of Observability Principles: Deep knowledge of the "three pillars" (metrics, logs, traces), distributed tracing, event correlation, and their application in complex systems.
- Deep Hands-on Expertise with Observability Tools: Proven proficiency in deploying, configuring, and optimizing multiple leading observability platforms (e.g., Prometheus/Grafana, ELK Stack, Jaeger/ OpenTelemetry.
- Cloud-Native & Distributed Systems Expertise: In-depth understanding and hands-on experience with cloud platforms (Azure), containerization (Docker, Kubernetes), service mesh, and microservices architectures.
- Infrastructure as Code (IaC): Proficient in using tools like Terraform for automating infrastructure provisioning and configuration related to observability.
- Linux System Administration & Networking: Strong grasp of Linux operating systems, networking protocols, and system-level troubleshooting.
- Database Knowledge: Familiarity with time-series databases (e.g. Prometheus, InfluxDB) and other relevant data stores for observability data.
- Troubleshooting & Root Cause Analysis: Exceptional analytical and problem-solving skills, with a systematic approach to diagnosing complex technical issues.
- Relevant industry certifications in cloud platforms, Kubernetes, or specific observability tools are highly valued.
- A strong command of the English language is mandatory (speaking, writing)
Key Skills
Ranked by relevance
Related Jobs
3 roles aligned with this opportunity
Enterprise Architect
2025-05-16
Machine Learning Engineer
2025-10-17
DevSecOps Engineer
2026-03-13
- Posted
- Jul 24, 2025
- Type
- Contract
- Level
- Mid-Senior
- Location
- Stockholm
Industries
Categories
Related Jobs
3 roles aligned with this opportunity
Enterprise Architect
2025-05-16
Machine Learning Engineer
2025-10-17
DevSecOps Engineer
2026-03-13