Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
Remotely
ELK GitLab Linux AWS Grafana Kafka nosql
We're seeking an exceptional Senior DevOps Engineer with specialized expertise in RabbitMQ clusters and Node.js environments to join our innovative team. Your primary mission will be conducting a comprehensive system audit and implementing critical infrastructure improvements to resolve performance bottlenecks and enhance system reliability.
Key Responsibilities
- Conduct exhaustive system audits to pinpoint infrastructure vulnerabilities and performance constraints.
- Architect and deploy solutions for RabbitMQ cluster optimization—ensuring unparalleled throughput and steadfast reliability.
- Craft and maintain sophisticated message queue systems capable of processing high-volume data with minimal latency.
- Devise and refine CI/CD pipelines for Node.js applications, leveraging cutting-edge DevOps methodologies.
- Implement robust Infrastructure as Code (IaC) solutions for consistent, reproducible environment management.
- Establish comprehensive monitoring frameworks and alerting mechanisms that safeguard system integrity.
- Diagnose and resolve intricate infrastructure challenges while implementing preemptive measures.
- Collaborate with development teams to streamline application deployment workflows and enhance efficiency.
- Create meticulous documentation for system architecture, configurations, and operational procedures.
- Provide mentorship to junior engineers, imparting DevOps best practices and infrastructure expertise.
- 5+ years of demonstrable experience in DevOps or Site Reliability Engineering roles.
- Profound expertise with RabbitMQ 3.11+ cluster management—including configuration optimization, high-availability setup, and performance tuning.
- Substantial experience with Node.js 18+ application deployment, scaling, and infrastructure design.
- Mastery in designing and orchestrating message queue architectures for mission-critical systems.
- Hands-on proficiency with container orchestration platforms—particularly Kubernetes 1.27+ or Docker Swarm.
- Comprehensive understanding of Infrastructure as Code principles, with practical expertise using Terraform or CloudFormation.
- Demonstrable experience with configuration management tools (Ansible, Chef, or Puppet).
- Advanced Linux/Unix server administration capabilities and shell scripting proficiency.
- In-depth knowledge of CI/CD orchestration using Jenkins, GitLab CI, or GitHub Actions.
- Extensive experience with major cloud platforms (AWS, Azure, or GCP).
- Thorough comprehension of network architecture principles and security best practices.
- Practical experience with alternative message brokers—such as Kafka, ActiveMQ, or Amazon SQS.
- Adeptness with monitoring ecosystems like Prometheus, Grafana, or the ELK stack.
- Insight into microservices architecture patterns and implementation strategies.
- Familiarity with contemporary observability frameworks and methodologies.
- Industry-recognized certifications in cloud platforms or relevant technologies.
- Demonstrated success in performance tuning for high-load, mission-critical systems.
- Working knowledge of both relational and NoSQL database systems.
- Expertise in sophisticated log management and analysis techniques.
Become part of our dynamic team and tackle complex infrastructure challenges while working with state-of-the-art technologies! We offer a flexible remote work environment, competitive compensation, and the opportunity to make a significant impact on our systems' performance and reliability. You'll collaborate with exceptionally talented professionals who value innovation and continuous improvement—in an atmosphere that fosters both professional growth and technical excellence.