NLB Services
Site Reliability Engineer
NLB ServicesIreland5 hours ago
ContractRemote FriendlyInformation Technology

Role: Senior Telemetry Engineer

Location: Dublin, Ireland

Mode of working: Hybrid (2-3 days/week onsite)


End client: Banking

Primary skills: Splunk, Observability Cloud, Open Telemetry.


Role Overview:

We are seeking a highly skilled Senior Telemetry Engineer to lead the design and implementation of telemetry pipelines across diverse environments including Microservices, VM-based applications, cloud-native platforms, and on premise systems. The ideal candidate will have deep expertise in Open Telemetry architecture and implementation, and a strong background in observability, distributed systems, and performance monitoring.


Key Responsibilities:

· Architect and implement end-to-end telemetry pipelines for applications deployed across cloud, on-prem, and hybrid environments.

· Lead the installation, configuration, and optimization of Open Telemetry components including SDKs, Collector, and exporters.

· Collaborate with application, infrastructure, and DevOps teams to define telemetry standards and integrate observability into CI/CD workflows.

· Design scalable and resilient data collection strategies for metrics, logs, and traces.

· Develop and maintain instrumentation libraries for microservices and legacy applications.

· Ensure telemetry data is efficiently routed to observability platforms (e.g., Splunk, Prometheus, Grafana, Datadog).

· Conduct performance tuning and troubleshooting of telemetry pipelines.

· Provide architectural guidance and best practices for telemetry adoption across teams.

· Stay current with OpenTelemetry releases and contribute to internal tooling and automation.


Required Skills & Qualifications:

· Proven experience in setting up telemetry pipelines from scratch across multiple environments.

· Strong hands-on expertise with OpenTelemetry (Collector, SDKs, OTLP protocol).

· Deep understanding of distributed tracing, metrics collection, and log aggregation.

· Experience with observability platforms such as Splunk, Prometheus, Grafana, Jaeger, Zipkin, Datadog, etc.

· Proficiency in one or more programming languages (e.g., Python, Go, Java, Node.js) for instrumentation.

· Familiarity with cloud platforms (AWS, Azure, GCP) and VM/on-prem infrastructure.

· Knowledge of container orchestration (Kubernetes), service meshes (Istio), and CI/CD pipelines.

· Excellent communication and documentation skills.

Key Skills

Ranked by relevance