Site Reliability Engineer

Doghouse RecruitmentRotterdam, NL
Published on

About the Role

Are you an observability expert looking to take full ownership in a highly critical, large-scale environment? This is your opportunity. As a Senior Site Reliability Engineer in Rotterdam, you will manage a complex observability landscape.

Your Role

  • 50% Operations: Maintain a massive, business-critical observability landscape.
  • 50% Engineering: Build, automate and evolve the observability stack, working with a team that builds for engineers, by engineers. You'll enable reliability across the entire company.

About the Candidate

To be successful in this role, you should have:

  • At least 7 years of experience as a Site Reliability Engineer.
  • Deep expertise with custom implementations of Prometheus, OpenTelemetry, and the LGTM stack (Loki, Grafana, Tempo), customized using Helm Charts.
  • Strong understanding of Kubernetes, with experience in building Custom Operators.
  • Solid programming skills in Python and/or Golang.
  • Proven experience operating in large-scale, multi-tenant environments.

About the Company

My client is a leading international telecom company with a state-of-the-art DevOps hub in Rotterdam, home to over 450 engineers, looking to strengthen their Observability team managing over 350 Kubernetes clusters.

Company Culture and Benefits

This role offers high-impact engineering work in a business-critical environment, a true engineering culture focused on performance, customization, and scale, and the opportunity to collaborate with highly skilled professionals in one of Europe’s largest DevOps hubs.

Are you ready to take your observability expertise to the next level and make a real impact? Apply now and be part of something big.