Site Reliability Engineering Manager
Okta
Posted: February 9, 2026
Interested in this position?
Create a free account to apply with AI-powered matching
Quick Summary
We're looking for a Site Reliability Engineering Manager to join our team and help us build a world where identities are secure and accessible.
Required Skills
Job Description
Get to know Okta
Okta is The World’s Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth.
At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we’re looking for lifelong learners and people who can make us better with their unique experiences.
Join our team! We’re building a world where Identity belongs to you.
As a Manager, Site Reliability Engineer, you will champion all things pertaining to reliability at Okta on our Auth0 product. Working closely with the product engineers, quality engineers, platform engineers, and architecture teams, your primary focus will be on ensuring production systems remain operational at all times, while continually setting and achieving long-term performance, reliability, and scalability goals in a platform with a growth plan for the coming years.
You will play a key role in Auth0’s dedication to ensuring customers’ uninterrupted access to business-critical enterprise and consumer applications. This is a hands-on role where you will directly operate, troubleshoot, and scale our production systems by responding to monitoring alerts and managing incidents as part of a team's 24/7 on-call rotation. Your work is critical to meeting the demands of ever-increasing traffic and user growth for our customers who rely on us to provide a reliable product experience.
At Auth0, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we’re looking for lifelong learners and people who can make us better with their unique experiences.
What you’ll do
Drive the technical direction of the team, working with SRE leadership to translate the organizational vision into an actionable technical roadmap.
Participate in a global on-call rotation featuring a follow-the-sun model on weekdays and a lower-frequency, shared rotation for weekends to remediate incidents on critical systems.
Lead and drive complex, cross-functional initiatives that require partnership with external platform & product teams
Use existing monitoring tools to identify problems and resolve and/or escalate to service teams
Implement changes to enable or improve infrastructure resilience, monitoring, and alerting
Develop and continuously refine SRE tools and processes to improve software delivery, observability, reliability, and operational efficiency.
Optimize existing systems and eliminate toil through simplification and automation.
Define, document, and advocate reliability best practices and policies
Represent SRE as a senior technical expert in architectural reviews and strategic planning, ensuring reliability is a primary consideration in all major engineering efforts.
Mentor other SREs through pair programming, design discussions, and code reviews to level up the team's technical capabilities.
What you'll need to be successful
3+ years of experience managing SRE or SWE teams, ideally in a cloud native environment.
Strong leadership, communication, and project management skills.
Have 8+ years of industry experience, with a proven track record of leading complex, cross-functional technical projects.
Believe in the SRE mindset: you are data-driven, embrace a blameless culture, and approach operational problems with a software engineering approach.
Have demonstrable experience participating in a 24/7 on-call rotation.
Possess deep expertise in a major cloud provider (Azure, AWS).
Have demonstrable experience managing infrastructure as code with Terraform at scale.
Have a strong understanding of cloud-native architecture, including containers (Docker, Kubernetes), microservices, modern networking concepts, and various database technologies (SQL, NoSQL, etc.).
Demonstrate strong proficiency in Go or Python with proven experience building and maintaining production-grade software, tools, and automation.
Have a systematic problem-solving approach, coupled with a strong sense of ownership and the drive to see complex issues through to resolution.
Have strong interpersonal and collaboration skills, with a proven ability to build relationships and work effectively in a globally distributed, remote-first team.
Are passionate about acting as a force multiplier, mentoring senior engineers and elevating the technical capabilities of the entire team.
Have a strong interest in shaping the team's technical vision and actively contributing to its strategic direction and leadership decisions.
#LI-Hybrid
P15539_3330727
What you can look forward to as a Full-Time Okta employee!
Amazing Benefits
Making Social Impact
Developing Talent and Fostering Connection + Community at Okta
Okta cultivates a dynamic work environment, providing the best tools, technology and benefits to empower our employees to work productively in a setting that best and uniquely suits their needs. Each organization is unique in the degree of flexibility and mobility in which they work so that all employees are enabled to be their most creative and successful versions of themselves, regardless of where they live. Find your place at Okta today! https://www.okta.com/company/careers/.
Some roles may require travel to one of our office locations for in-person onboarding.
Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.
If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation.
Notice for New York City Applicants & Employees: Okta may use Automated Employment Decision Tools (AEDT), as defined by New York City Local Law 144, that use artificial intelligence, machine learning, or other automated processes to assist in our recruitment and hiring process. In accordance with NYC Local Law 144, if you are an applicant or employee residing in New York City, please
Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Personnel and Job Candidate Privacy Notice at https://www.okta.com/legal/personnel-policy/.
Okta
The foundation for secure connections between people and technology
Okta is the leading independent provider of identity for the enterprise. The Okta Identity Cloud enables organizations to securely connect the right people to the right technologies at the right time. With over 7,000 pre-built integrations to applications and infrastructure providers, Okta customers can easily and securely use the best technologies for their business. More than 19,300 organizations, including JetBlue, Nordstrom, Slack, T-Mobile, Takeda, Teach for America, and Twilio, trust Okta to help protect the identities of their workforces and customers.