akeno
Staff / Senior DevOps Engineer
akenoGermany1 day ago
Full-timeRemote FriendlyEngineering
We're looking for a Staff Platform/DevOps Engineer to join forces with our DevOps Lead and push our platform's delivery, automation, and observability across diverse deployment environments to the next level.We build B2B software for industrial customers. Reliability, traceability, and maintainability matter across the full lifecycle.You'll work closely with engineering to design and operate the pipelines, runtimes, and tooling that connect development to production. You'll take ownership of the CI/CD architecture, defining how code moves from commit to production safely and consistently. That includes managing the pipelines, shaping how and when to apply GitOps workflows and principles, and setting clear guardrails for rollback and change control.You'll also lead the observability program: operating and improving our current stack, defining what good telemetry means, and ensuring every service has actionable visibility and meaningful SLOs.As the architecture evolves, you'll help evaluate and introduce modern tools or practices that improve delivery speed, reliability, or traceability—always with a clear understanding of operational constraints and trade-offs.

What You'll Achieve:

CI/CD: Design, implement, and operate pipelines using GitHub Actions, Terraform, and Ansible. Shorten feedback cycles, ensure reproducibility, and codify rollback policies.

Lifecycle integration: Ensure the full software lifecycle—from development to operations is supported with automation, validation, observability, and continuous improvement built-in.

GitOps: Standardize repository topology, environment promotion, and secrets/drift handling. Document where GitOps fits best and where exceptions apply.

Runtime & deployments: Operate and improve the runtime stack. Choose and implement rollout strategies (blue/green, canary, feature flags) according to risk and environment type.

Observability: Collaborate with engineering teams to embed logging and metrics by default across services and environments.

Own Fluentd, Loki, and Grafana; deliver structured logs, meaningful metrics, tuned alerting, and SLOs that drive action—built in, not bolted on.

Architecture evolution: Drive and support platform evolution by evaluating and adopting modern DevOps practices and tools aligned with real team needs.

Security & resilience: Improve secrets handling, image hygiene/SBOMs, backup and restore procedures, and baseline hardening.

Reliability in practice: Diagnose and fix issues with well-tested updates; introduce tools and practices only when they meet actual operational requirements and constraints.

Requirements

  • Solid experience designing and operating CI/CD pipelines using GitHub Actions, Terraform, and Ansible, with an openness to evolving the stack
  • Strong understanding of how pipelines support the entire lifecycle—from local development to production operations—with traceability, monitoring, and validation at every step
  • GitOps maturity: repo layouts, promotion models, secrets management, and drift/exception handling; aware of limits and edge cases
  • Kubernetes and service discovery/management literacy; able to discuss trade-offs, operational overhead, and when a lightweight approach is better
  • Clear understanding of deployment strategies (blue/green, canary, feature flags) and how to choose per context, with rollback criteria tied to metrics
  • Deep understanding of observability — logs, metrics, and traces, cost/cardinality control, alert fatigue. Management and hands-on experience running Fluentd, Loki, and Grafana to deliver actionable, reliable telemetry.Strong scripting and automation skills using Python and Bash (PowerShell a plus)
  • Confident with Git-based workflows in daily development and operations.Solid foundation in Linux-based environments, containers, and core networking (DNS, TCP/IP, firewalls, OSI model)
  • Proven problem-solving and troubleshooting skills—systematic, persistent, and outcome-oriented.Strong communication and collaboration skills—clear documentation, proactive sync, and team alignment.English proficiency (C1+)

Nice To Haves:

  • Experience with air-gapped delivery (registry/Helm/OCI mirrors, artifact signing)
  • Cloud exposure (especially Azure or GCP) and hybrid networking/VPN/S2S
  • Experience with modern or emerging infra/deployment tools (Pulumi, OpenTelemetry, etc.)
  • Postgres HA/backup tooling and restore drills.Exposure to MLOps, data-heavy pipelines, or infra for AI systems. German skills are a plus

Benefits

At our company, we believe people do their best work when they feel supported, connected, and empowered. Here's what you can look forward to:

  • Generous Annual Compensation
  • 35-40 Hours Working Week - Supporting work-life balance without compromising impact
  • Relocation Package - Full support, including relocation assistance and accommodation guidance
  • Visa Sponsorship - Comprehensive support throughout the process
  • Wellpass Gym Membership - Access to hundreds of fitness and wellness options for just €10/month
  • Job Ticket - Subsidized public transport to keep your commute affordable and stress-free
  • German Language Courses - Learn or improve your German with our supported language program
  • Snacks, Drinks & Great Coffee - Always available to keep you fuelled and focused
  • Modern Office with a View - Work from our beautiful Hamburg office with panoramic city views
  • Supportive & International Team - Join a warm, open-minded group of colleagues from around the world
  • Real Ownership - You'll have space to take initiative and make a tangible impact
  • Flexible Working Hours - Because we understand that life and studies don't always fit a 9-5
  • Regular Team Events - From lunches to offsites—we celebrate the wins (big and small) together

Key Skills

Ranked by relevance