Quick Summary

We are looking for a Staff Site Reliability Engineer to join our team and help us deliver high-quality products and services to our customers.

Required Skills

Site Reliability Engineering Cloud Services Kubernetes Go Programming Automation Observability Incident Management Mentoring Knowledge Sharing

Job Description

Our mission and customers: We are creating the freedom for SMEs to succeed by delivering Europe's leading finance workspace with banking at its core, augmented by financial tools. We are proud to be rated 4.8 on Trustpilot, based on 55,000+ reviews. Our culture puts customer satisfaction at the core of what we do, as proven by our Net Promoter Score of 75.

Our journey: Founded in 2017 by Alexandre and Steve, Qonto has grown to 1,600+ Qontoers serving over 600,000+ customers across 8 European countries. We have been profitable since 2023, and we are just getting started.

Our beliefs: We hire for skills and potential. With 80+ nationalities, 45% women, and 56% of women in our leadership team, diversity isn't a program; It's who we are. We've built a discrimination-free hiring process because the best teams are built on merit.

AI at Qonto: AI is deeply embedded in how we work (here) - Every Qontoer gets unlimited access to the best AI tools. We want people who experiment without waiting for permission, push AI beyond the obvious, know when to trust it, and when to question it.

------------------------------------------------------------------------------------------------------

⭐ Mission: Join us as a Staff Site Reliability Engineer to be the strongest technical voice on our Platform Reliability team and help us scale a reliable infrastructure as Qonto grows toward 1 million customers across Europe.

⚡ Impact: As a Staff SRE, you will play a key role in shaping how our platform evolves. Framing complex infrastructure challenges, driving architectural decisions, and enabling the entire tech department to ship faster and more reliably. You'll be a key technical reference, a mentor for junior engineers, and an active contributor to our knowledge-sharing culture

Our SRE department is divided into 2 teams (18 talented engineers): Platform and Storage. You will join the Platform Reliability team, who believe that reliability is built before problems happen, not after.

👩‍💻🧑‍💻 As a Site Reliability Engineer at Qonto, you will

• Think big picture: Frame complex infrastructure problems, propose clear solutions, and drive projects end-to-end on your own initiative

• Build solid things: Work with backend, data, security, and engineering efficiency teams to design, deploy, and maintain our infrastructure

• Write real code: Spend 20 to 40% of your time writing Go services, tools, and APIs, same standards as our backend engineers

• Cut out toil: Automate what can be automated, we aim for as little repetitive work as possible

• Own observability: Keep the platform visible and debuggable through logs, metrics, and tracing

• Be part of on-call: Join the rotation, lead post-incident reviews, and turn incidents into lasting fixes

• Help others grow: Share knowledge, challenge ideas, and be someone your teammates can learn from

🛠 What stack you can expect

Cloud services: AWS, EKS

Container technology: Kubernetes, Docker

CI/CD: GitLab CI & ArgoCD

Monitoring: Metrics with Prometheus & Thanos, traces with OpenTelmetry, on-call with OpsGenie, and logs with Elasticsearch & Loki

DB and messaging: AWS RDS PostgreSQL, SQS, Redis, and Kafka

Programming language: Go & Python

Infrastructure as Code: Terraform

🧠 What you can expect

• Design robust solutions at real scale : 25k pods, 86 microservices, 1300 deployments per month

• Full autonomy to propose and drive your own ideas using lean methodologies and a bottom-up approach

• Modern ways of working: GitOps, AI-assisted engineering microservices development, and the freedom to challenge the technical status quo

• Deep cross-team collaboration: spec reviews, brainstorming sessions, and problem-solving with backend, data, security, and more

🏅About You

• Experience: You have strong hands-on experience with cloud-native infrastructure in production, including managing Kubernetes clusters at scale

• Programming skills: You have solid Go experience (or equivalent) and are comfortable building tools, services, and automation, not just config files

• AI-driven engineering: You actively use AI tools to move faster, write better code, and solve infrastructure problems more efficiently, and you're curious about what's coming next

• Problem solver: You understand the full system before proposing a solution, dependencies, trade-offs, long-term impact. You anticipate problems, you don't just react to them

• Team player: You share knowledge naturally, give clear feedback, and help less experienced engineers grow

"At Qonto we understand that true diversity isn't just about ticking boxes on a hiring checklist. Apply regardless of the boxes you tick! Who knows? You may have the missing piece of the puzzle we've been searching for all along"

------------------------------------------------------------------------------------------------------

On average, our hiring process lasts 20 working days. More information on our candidate journey here

------------------------------------------------------------------------------------------------------

🔒 Your security matters to us

Recruitment scams are on the rise. Keep in mind, we will never work with third-party platforms or agencies that request payment from candidates.

If you receive a suspicious message claiming to be from Qonto, please report it right away ([email protected])

Staff Site Reliability Engineer (Platform Reliability)

Interested in this position?

Quick Summary

Required Skills

Job Description

Why Apply Through MisuJob?

Frequently Asked Questions

How do I apply for this position?

Is MisuJob free for job seekers?

How does AI matching work?

Can I apply to jobs in other countries?

Ready to Apply?