Lead Site Reliability Engineer (Service Bus/Terraform)
Nix
Posted: February 16, 2026
Interested in this position?
Create a free account to apply with AI-powered matching
Quick Summary
We are looking for a Lead Site Reliability engineer to join our microservices framework team, focused on building highly scalable, available, and reliable distributed systems.
Required Skills
Job Description
We are looking for a Lead Site Reliability engineer to join the microservices framework team, focused on building highly scalable, available, and reliable distributed systems.
Working hours: 15:00-23:00 CET.
Our clients create, maintain, and operate scalable technology and data solutions that deliver an exceptional experience for the customers.
You will be responsible for designing high-performance, resilient architectures, developing microservices frameworks, and implementing cloud-based data platform systems to support global-scale business needs. You will also work on infrastructure automation, CI/CD pipelines, and Terraform-based deployments.
Responsibilities:
• Think about how to solve problems at scale and build automation to manage complex software systems
• Develop testable, high-quality, and ship-ready code with ample test coverage
• Work with Product Management and other developers to understand and translate engineering requirements into design and architectural solutions
• Work as part of a cross-site development team to drive design, implementation, testing, and release of microservices platforms
• Design, build, and maintain CI/CD pipelines to automate builds, testing, and deployments
• Manage infrastructure as code using Terraform to deploy and maintain cloud environment
• Implement best practices for cloud security, performance, and cost optimization
• Collaborate with cross-functional teams to define technical architecture and cloud strategies
• Participate in on-call rotations and contribute to improving system reliability and incident response
• Designing, building, and maintaining robust data platform solutions leveraging modern tool Disaster recovery on Service Bus (Optimize it. DRI dashboards to convince service owners that they have too many resources, rightsizing, Terraform code.)
Basic Qualifications:
• 5+ years of hands-on software development experience in an object-oriented programming language such as C#, C++, or Java
• 5+ years of working with cloud deployment and configuration tools using scripting and configuration platforms
• Hands-on experience with system architecture, API design, and distributed systems
• Experience designing, deploying, and maintaining CI/CD pipelines to automate application builds, tests, and deployments
• Proficiency in managing data platform infrastructure as code (IaC) using Terraform
• Experience with being part of an on-call rotation
• Familiarity with Databricks and Snowflake
We offer*:
• Flexible working format - remote, office-based or flexible
• A competitive salary and good compensation package
• Personalized career growth
• Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
• Active tech communities with regular knowledge sharing
• Education reimbursement
• Memorable anniversary presents
• Corporate events and team buildings
• Other location-specific benefits
*not applicable for freelancers