MisuJob - AI Job Search Platform MisuJob

Reliability Engineer

Systems Engineering Solutions Corporation

Hanscom Air Force Base, Massachusetts, United States Hybrid permanent

Posted: March 31, 2026

Interested in this position?

Create a free account to apply with AI-powered matching

Quick Summary

This Reliability Engineer role at Hanscom Air Force Base in Massachusetts focuses on ensuring the availability, performance, scalability, and resiliency of mission-critical systems through software engineering principles, automation, monitoring, incident response, and continuous reliability improvement.

Job Description

This role supports the U.S. Air Force Cloud One Architecture and Common Shared Services contract and currently has an opening for a Reliability Engineer. The Reliability Engineer is responsible for ensuring the availability, performance, scalability, and resiliency of mission‑critical systems. This role applies software engineering principles to infrastructure and operations, with a strong emphasis on automation, monitoring, incident response, and continuous reliability improvement. The reliability engineer serves as the bridge between development, operations, and platform teams to ensure production systems consistently meet defined service level objectives (SLOs) while supporting rapid, safe delivery of new capabilities.

Location: This position will be hybrid remote. Candidates will be required to work onsite as needed. Candidates preferred to be located near Hanscom AFB (Boston, MA).


Requirements:
System Reliability & Availability

• Design, implement, and maintain highly available, fault-tolerant systems in cloud and hybrid environments
• Define, measure, and report Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets
• Identify reliability risks and implement mitigation strategies across the system lifecycle
• Conduct capacity planning and performance modeling to ensure systems scale to meet demand

Monitoring, Observability & Alerting

• Implement and manage monitoring, logging, and tracing solutions to provide full system observability
• Define actionable alerting thresholds that minimize noise and enable rapid incident detection
• Analyze trends and metrics to proactively identify potential reliability issues

Incident Response & Problem Management

• Participate in on‑call rotations and lead incident response activities for production systems
• Coordinate troubleshooting efforts across development, infrastructure, and security teams
• Conduct post‑incident reviews (PIRs) and develop corrective and preventive action plans
• Track recurring issues and ensure root causes are resolved

Automation & Engineering Excellence

• Automate operational tasks to reduce manual intervention and operational risk
• Develop scripts, tools, and services that improve system reliability and reduce mean time to recovery (MTTR)
• Promote “automation over toil” and standardize operational workflows

Reliability‑Focused Engineering

• Participate in architecture and design reviews with an emphasis on reliability, resiliency, and recoverability
• Validate disaster recovery (DR) and business continuity plans; test failover mechanisms
• Support chaos engineering, fault injection testing, and resilience validation where appropriate

Collaboration & Governance

• Partner with DevOps, Platform, and Security teams to ensure reliability aligns with delivery and compliance objectives
• Document system reliability standards, runbooks, and operational procedures
• Support compliance and audit activities (e.g., FedRAMP, FISMA, internal operational controls)

Required Skills:

· Bachelors and eight (8) years or more of experience; Masters and six (6) years or more of experience. Additional experience may be accepted in lieu of degree.

· Active Secret clearance at a minimum required to start

· US citizenship required

· Experience with cloud platforms (AWS, Azure, OCI, or GCP), including managed services

· Experience with containerized environments (Docker, Kubernetes)

· Familiarity with CI/CD pipelines and deployment automation

· SLOs and error budgets

· Capacity modeling and performance testing

· Strong understanding of:

· Distributed systems and high‑availability architectures

· Linux/Windows system administration

· Networking fundamentals (DNS, TCP/IP, load balancing)

· Hands-on experience with:

· Monitoring and observability tools (e.g., Prometheus, Grafana, ELK/Elastic, Datadog, Azure Monitor)

· Infrastructure as Code (Terraform, ARM, CloudFormation)

· Scripting or programming languages (Python, Bash, Go, PowerShell, or similar)

· Experience supporting incident management and on‑call operations

Preferred Skills

• Experience with USAF Cloud One or Platform 1.
• Experience with Zero Trust Architecture
• Cloud certifications in AWS, Azure, Google, or Oracle clouds


Benefits:
SES provides a competitive salary and the following benefits:

• Medical
• Dental
• Vision
• AD&D
• STD
• LTD
• Company paid Life Insurance
• 401k with employer contribution
• Paid Time Off
• Pet Insurance

Why Apply Through MisuJob?

AI-Powered Job Matching: MisuJob uses advanced artificial intelligence to analyze your skills, experience, and career goals. Our matching algorithm compares your profile against thousands of job requirements to find positions where you have the highest chance of success. This saves you hours of manual job searching and ensures you only see relevant opportunities.

One-Click Applications: Once you create your profile, applying to jobs is effortless. Your resume and cover letter are automatically tailored to highlight the most relevant experience for each position. You can apply to multiple jobs in minutes, not hours.

Career Intelligence: Beyond job matching, MisuJob provides valuable career insights. See how your skills compare to market demands, identify skill gaps to address, and understand salary benchmarks for your experience level. Make data-driven decisions about your career path.

Frequently Asked Questions

How do I apply for this position?

Click the "Register to Apply" button above to create a free MisuJob account. Once registered, you can apply with one click and track your application status in your dashboard.

Is MisuJob free for job seekers?

Yes, MisuJob is completely free for job seekers. Create your profile, get matched with jobs, and apply without any cost. We help you find your dream job without any hidden fees.

How does AI matching work?

Our AI analyzes your resume, skills, and experience to understand your professional profile. It then compares this against job requirements using natural language processing to calculate a match percentage. Higher matches mean better fit for the role.

Can I apply to jobs in other countries?

Absolutely. MisuJob features jobs from companies worldwide, including remote positions. Filter by location or look for remote opportunities to find jobs that match your preferences.

Ready to Apply?

Join thousands of job seekers using MisuJob's AI to find and apply to their dream jobs automatically.

Register to Apply