Senior DevOps Engineer (Secure On-Prem & MLOps Focus)
Confidential
Posted: February 27, 2026
Interested in this position?
Create a free account to apply with AI-powered matching
Quick Summary
Design, deploy, and operate high-security on-premise infrastructure supporting mission-critical and data-sensitive systems.
Required Skills
Job Description
Job Description:
We are seeking a highly skilled DevOps / Platform Engineer to design, deploy, and operate high-security on-premise infrastructure supporting mission-critical and data-sensitive systems. This role emphasizes hardened environments, Kubernetes-based orchestration,and scalable MLOps platforms supporting machine learning and Large Language Model (LLM) workloads.
The ideal candidate combines deep expertise in Infrastructure as Code, Kubernetes, networking, and security best practices with hands-on experience supporting ML pipelines and AI-driven applications. Strong ownership, systems thinking, and cross-functional collaboration are essential.
Key Responsibilities:
Design and maintain hardened on-prem infrastructure with strong security,
reliability, and compliance standards
Deploy and operate production-grade Kubernetes clusters
Implement IaC using Terraform
Enforce container security best practices (image scanning, RBAC, secrets
management, encryption)
Design secure networking architectures (segmentation, firewalls, controlled
ingress/egress)
Build and maintain CI/CD pipelines with integrated security controls (DevSecOps)
Support MLOps workflows
Deploy and optimize LLM inference workloads, including GPU-based environments
Implement monitoring, logging, and incident response processes
Document infrastructure and ensure auditability
Qualifications:
Bachelor’s degree in Computer Science, Information Technology, or a related field.
Strong experience managing secure on-premise environments.
Must have strong expertise in Infrastructure as Code (Terraform).
Must have strong expertise in Kubernetes and Docker.
Experience working with CI/CD pipelines (e.g., GitHub Actions).
Experience supporting ML/LLM workloads and MLOps practices.
Strong version control experience with Git.
Familiarity with monitoring tools, incident management, and troubleshooting
practices.
Understanding of cybersecurity standards and regulatory compliance requirements.
Required Behaviors:
Ability to translate product requirements into actionable technical solutions.
Demonstrates independent ownership of tasks and projects with quality delivery.
Communicates effectively across teams, providing clear updates on progress.
Strong understanding of wider product development and team interdependencies.
Articulates complex technical details clearly to both technical and non-technical stakeholders.
Focus on clear, effective documentation of infrastructure and processes.
Proactively solves problems with a creative and analytical approach.
Thrives in uncertain, rapidly evolving environments while maintaining results orientation.
Demonstrates motivation to contribute to the success of the team and larger business goals.