Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
Job Title: Senior DevOps / SRE Engineer
Location: Romania / Czech Republic / Ukraine (Remote – WFH)
Experience Required: 7+ years (5+ years relevant)
Duration: Long-term contract
Working Hours: CET Timings
Job Summary
We are looking for an experienced Senior DevOps / Site Reliability Engineer (SRE) to join our team. The ideal candidate will have deep expertise in Kubernetes at scale, CI/CD automation, infrastructure as code, and observability tools. You will be responsible for building and maintaining highly reliable, resilient, and secure cloud infrastructure and pipelines, while supporting development teams in delivering scalable applications.
Key Responsibilities
- Define standard release automation patterns for infrastructure and applications.
- Design and implement CI/CD pipelines with integrated DevOps and SecOps tools.
- Develop reports and metrics to ensure DevSecOps compliance.
- Optimize redundancies, monitoring, and alerting practices.
- Build highly available cloud patterns and manage resilient infrastructure.
- Control and execute code promotion through all environments.
- Implement Infrastructure as Code (Terraform/Helm).
- Develop and maintain build/release pipelines.
- Manage secrets and configuration securely.
- Provide incident and emergency response to resolve issues.
- Improve application ecosystem for performance, resiliency, and reliability.
- Create documentation, runbooks, and knowledge articles.
- Collaborate with dev teams, ensuring adherence to security guidelines and policies.
Mandatory Skills
- SRE & DevOps expertise with focus on automation and reliability.
- Advanced Kubernetes at scale (GKE, AKS, EKS, or RKE). Strong skills in Kubectl & Helm.
- Containers: Deploying Java (Spring Boot) microservices in Docker.
- Observability Tools: Prometheus/Grafana, Datadog, AppDynamics, Splunk, with APM, logging, and alerting (PagerDuty/OpsGenie).
- CI/CD Tools: Jenkins, Azure DevOps, GitHub Actions, ArgoCD, Artifactory, Azure/GCP registries.
- SCM: GitHub/GitLab with branching strategies (GitFlow, trunk-based).
- Strong troubleshooting skills across infrastructure and code.
- Excellent communication, documentation, and collaboration skills.
Qualifications
- Bachelor’s degree in Computer Science or related field (or equivalent experience).
- 7+ years of IT experience with 5+ years relevant in DevOps/SRE.
- Strong problem-solving and root cause analysis skills.
- Ability to work effectively with cross-functional Agile/Scrum teams.
Why Join Us?
- Exciting opportunity to work on large-scale, cloud-native platforms.
- Flexible remote working model (WFH).
- Work with cutting-edge technologies in Azure DevOps, Kubernetes, and CI/CD.
Key Skills
Ranked by relevanceReady to apply?
Join BlueRose Technologies and take your career to the next level!
Application takes less than 5 minutes