ARCHIVED
This job listing has been archived and is no longer accepting applications.
MisuJob - AI Job Search Platform MisuJob

Lead Site Reliability Engineer

Health Catalyst

Location not specified Remote

Posted: December 8, 2025

Interested in this position?

Create a free account to apply with AI-powered matching

Job Description

Join one of the nation’s leading and most impactful health care performance improvement companies. Over the years, Health Catalyst has achieved and documented clinical, operational, and financial improvements for many of the nation’s leading healthcare organizations. We are also increasingly serving international markets. Our mission is to be the catalyst for massive, measurable, data-informed healthcare improvement through:
Data: integrate data in a flexible, open & scalable platform to power healthcare’s digital transformation
Analytics: deliver analytic applications & services that generate insight on how to measurably improve
Expertise: provide clinical, financial & operational experts who enable & accelerate improvement
Engagement: attract, develop and retain world-class team members by being a best place to work
Role: Lead Site Reliability Engineer
Team: Technology
Location: US remote
Travel: none anticipated
**This position is currently not eligible for visa sponsorship**
Job Summary
As a DevOps / Site Reliability Engineer, you’ll help shape and sustain the infrastructure behind Armus, a core Health Catalyst platform that drives outcomes for clinicians and patients across the country. You’ll work closely with software engineers and product teams to design, automate, and operate the cloud environments that power our clinical registries and analytics solutions.
This is a high-visibility, high-expectation role for someone who thrives on accountability, loves solving complex system problems, and wants to grow within a cross-product SRE group that spans multiple technologies and teams. You’ll ship improvements weekly, automate relentlessly, and end each day knowing your work improves healthcare outcomes.
If You Love
Building reliable, scalable systems that stay up and perform under pressure
Taking a half-defined problem and driving it to a clean, measurable solution
Balancing speed and safety through automation, testing, and disciplined process
Mentoring others, reviewing code, and strengthening DevOps culture
Working across application, infrastructure, and security boundaries to make systems better every week
Then this role will fit you perfectly.
What You’ll Own
Cloud Infrastructure (Google Cloud Platform Focus)
Design, implement, and operate scalable, secure, and resilient infrastructure on Google Cloud Platform (GCP), with a heavy focus on Google Kubernetes Engine (GKE)
Apply best practices in container orchestration, networking, IAM, and workload identity
Lead cloud cost optimization, capacity planning, and efficient scaling initiatives
Manage infrastructure as code using Terraform or similar tools
While GCP is the core environment, equivalent experience with AWS or Azure will be considered
CI/CD and Automation
Build and maintain CI/CD pipelines using Jenkins or GitLab CI/CD
Ensure reliable deployment flows across development, staging, and production environments
Implement automated checks and rollback mechanisms for safe, repeatable releases
Reliability, Monitoring, and Incident Response
Implement and refine observability using Sentry, Sumo Logic, and GCP Cloud Monitoring and Logging
Participate in the on-call rotation, respond quickly to operational issues, and drive long-term fixes
Collaborate with customer success and support teams to quantify and resolve production impact
Identify reliability risks early, automate detection and recovery, and reduce manual toil
Security and Compliance
Apply and maintain least-privilege IAM policies and secure configuration baselines
Partner with InfoSec to remediate vulnerabilities and support HIPAA and SOC II audit readiness
Contribute to incident response readiness and disaster recovery testing
Collaboration and Continuous Improvement
Engage with the cross-product SRE squad to learn and contribute across multiple Health Catalyst platforms
Help standardize SRE best practices, tooling, and documentation
Mentor teammates and continuously raise the bar for reliability and automation
What You Bring
5–7 years of experience in DevOps, SRE, or Cloud Infrastructure Engineering
Deep expertise in GCP, especially GKE
Experience with other major clouds (AWS or Azure) is a plus
Strong working knowledge of Kubernetes and containerized deployments
Proven experience with CI/CD tools such as Jenkins or GitLab
Scripting experience in Python, Bash, or similar languages
Solid understanding of networking, security, and performance fundamentals
Hands-on experience with cloud cost management and optimization
Calm under pressure with strong troubleshooting and communication skills
Nice to Have
Exposure to healthcare data or interoperability standards such as FHIR, HL7, or CDA
Familiarity with healthcare security and compliance frameworks like HIPAA and SOC II
Experience in Agile or Scrum software development environments
Background supporting SaaS or multi-tenant systems

What Success Looks Like
Platform uptime consistently meets or exceeds SLA targets
Deployments are automated, low-risk, and frequent
Reliability metrics improve quarter over quarter
Infrastructure costs are measured, optimized, and trending down
The Armus platform is seen as a model for SRE practices across Health Catalyst

Why This Role Matters
This role is central to the continued growth and reliability of the Armus platform. The person in this seat won’t just maintain systems—they’ll shape how Health Catalyst operates cloud infrastructure at scale. You’ll drive uptime, automation, and cost efficiency while influencing SRE practices company-wide.
If you want to do meaningful engineering work with real impact and high expectations, this is the opportunity.
Information Security and Compliance Responsibilities:
Maintain compliance with training directives required by the organization pertaining to Information Security, Acceptable Use Policy and HIPAA Privacy and Security.
Adhere to and comply with the organizations Acceptable Use Policy.
Safeguard information system assets by identifying and reporting potential and actual security events to the organizations Security and Compliance Officers.
The above statements describe the general nature and level of work being performed in this job function. They are not intended to be an exhaustive list of all duties, and indeed additional responsibilities may be assigned by Health Catalyst.
Studies show that candidates from underrepresented groups are less likely to apply for roles if they don’t have 100% of the qualifications shown in the job posting. While each of our roles have core requirements, please thoughtfully consider your skills and experience and decide if you are interested in the position. If you feel you may be a good fit for the role, even if you don’t meet all of the qualifications, we hope you will apply. If you feel you are lacking the core requirements for this position, we encourage you to continue exploring our careers page for other roles for which you may be a better fit.
At Health Catalyst, we appreciate the opportunity to benefit from the diverse backgrounds and experiences of others. Because of our deep commitment to respect every individual, Health Catalyst is an equal opportunity employer.

Why Apply Through MisuJob?

AI-Powered Job Matching: MisuJob uses advanced artificial intelligence to analyze your skills, experience, and career goals. Our matching algorithm compares your profile against thousands of job requirements to find positions where you have the highest chance of success. This saves you hours of manual job searching and ensures you only see relevant opportunities.

One-Click Applications: Once you create your profile, applying to jobs is effortless. Your resume and cover letter are automatically tailored to highlight the most relevant experience for each position. You can apply to multiple jobs in minutes, not hours.

Career Intelligence: Beyond job matching, MisuJob provides valuable career insights. See how your skills compare to market demands, identify skill gaps to address, and understand salary benchmarks for your experience level. Make data-driven decisions about your career path.

Frequently Asked Questions

How do I apply for this position?

Click the "Register to Apply" button above to create a free MisuJob account. Once registered, you can apply with one click and track your application status in your dashboard.

Is MisuJob free for job seekers?

Yes, MisuJob is completely free for job seekers. Create your profile, get matched with jobs, and apply without any cost. We help you find your dream job without any hidden fees.

How does AI matching work?

Our AI analyzes your resume, skills, and experience to understand your professional profile. It then compares this against job requirements using natural language processing to calculate a match percentage. Higher matches mean better fit for the role.

Can I apply to jobs in other countries?

Absolutely. MisuJob features jobs from companies worldwide, including remote positions. Filter by location or look for remote opportunities to find jobs that match your preferences.

Ready to Apply?

Join thousands of job seekers using MisuJob's AI to find and apply to their dream jobs automatically.

Register to Apply