-
Bloq.it

Senior Site Reliability Engineer

Bloq.it
Portugal · Full-time · Mid-Senior

At Bloq.it, we've created the world's leading smart locker solution.

Solving online deliveries by enabling everyone to participate easily, reducing delivery costs, and making them more sustainable.

We're quickly expanding, and after growing at 1000% for three years in a row, we're now the fastest-growing Smart Locker company in the world and one of the fastest growing scale-ups in Europe.

We are searching for a Site Reliability Engineer to join our innovative Team.

This person is a key player in maintaining the health, stability, and performance of our production systems. This role is designed for a highly technical engineer who thrives in troubleshooting complex issues, collaborating with cross-functional teams, and building observability and monitoring services. As part of the 3rd level support team, you will be responsible for investigating and resolving escalated issues that affect system availability, performance, and reliability. What will be your responsibilitiesProvide expert-level troubleshooting and incident management for escalated production issues, including performance degradation, outages, and system anomalies.Diagnose and resolve complex issues across infrastructure, applications, and services, working closely with development teams to identify root causes.Collaborate with operations, development, and security teams to drive proactive improvements in the reliability, scalability, and availability of systems.Maintain and enhance system observability tools, ensuring proper monitoring, alerting, and logging to detect issues early and respond to incidents effectively.Contribute to the creation and refinement of runbooks, incident response protocols, and other technical documentation for internal teams.Automate repetitive tasks to improve operational efficiency and reduce toil.Define and implement incident response processes, including root cause analysis and post-mortems.What are the requirements to join us in this positionAt least 3 years of proven professional experience as Site Reliability Engineer (SRE) or a similar role.Strong expertise in monitoring and observability tools (Prometheus, Grafana, Datadog, Elasticsearch, Kibana, New Relic, OpenTelemetry, etc.

).Experience with NoSQL Databases (MongoDB or Elasticsearch are a nice to have).Deep understanding of incident management, post-mortem analysis, and on-call best practices.Experience with AWS cloud platform.Experience creating automations and tooling.Expertise in Unix/Linux console debugging, using commands and tools such as grep, awk, sed, strace, tcpdump, lsof, journalctl, and others.A problem-solving mindset with a data-driven approach to resilience engineering.Prior experience setting up SRE practices from scratch.Experience in products combining both hardware and software.Experience in high-growth, product-driven startups.Familiarity with ITIL or incident management frameworks.Experience implementing error budgets and reliability SLAs.Experience with Kubernetes and containerization.What will you get if you join us in this positionA dynamic and fast-paced work environment with a culture of innovation, collaboration, and continuous learning;Competitive salary and benefits package, tailored to your experience and skills, including performance-based bonus and Portuguese health insurance;Flexible work conditions, including a remote-friendly policy and a flexible schedule that allows you to balance your work and personal life;Monthly meetings in-person at our Offices in Portugal (Lisbon or Porto), giving you the chance to connect with the team and immerse yourself in our company culture;Make a tangible impact by contributing to the continuous improvement of our core solutions, actively supporting our mission to provide affordable and sustainable solutions.Got questions or curious if this role is the right fit?

Reach out directly to Pedro Calado at ****** — happy to chat. Join our team of #bloqstars and help us redefine the last-mile delivery experience!

#J-18808-Ljbffr

Key Skills

Ranked by relevance

incident response elasticsearch kubernetes prometheus grafana datadog nosql cloud itil aws
Login to Apply
Posted
Jun 15, 2025
Type
Full-time
Level
Mid-Senior
Location
Porto
Company
Bloq.it

Industries

Technology Information Internet

Categories

Engineering Information Technology

Related Jobs

3 roles aligned with this opportunity

View all jobs
View Job Details
EPAM Systems
Related

DevOps Engineer

2026-05-27

Full-time
Associate
Argentina
Software Development
Engineering
View Job Details
Bloq.it
Related

Senior Backend Engineer

2026-05-13

Other
Not Applicable
Portugal
Technology
Engineering
View Job Details
Bloq.it
Related

Fullstack Engineer

2026-02-17

Full-time
Entry
Portugal
Technology
Engineering