MisuJob - AI Job Search Platform MisuJob

Jobs

Browse 250+ jobs updated daily

Latest Job Openings

Data Site Reliability Engineering (m/f/d)

Firmenname für EXPERT-Mitglieder sichtbar

Remote
GitOpsContainerization and container managementPackaging of applicationsCustomization of deployments

Data Site Reliability Engineering (m/f/d) is a 100% remote position that involves ensuring the reliability of business-critical applications in Frankfurt, 50% remote....

April 10, 2026 View Details
San Francisco, California, United States permanent
Production ReliabilityIncident ResponseOn-CallPlatform EngineeringInfrastructure as CodeCapacity PlanningPerformance OptimizationSecurityReliability Culture

Who We Are We are an applied AI lab building end-to-end software agents. We're the team behind Devin, the first AI software engineer, and Windsurf, an AI-native IDE. These products represent our visi...

October 13, 2025 View Details
Bengaluru, INDIA, India Hybrid permanent
DockerKubernetesCloud TechnologiesAutomationBest PracticesProblem-SolvingMentorshipTechnology Trends

Founded by experienced entrepreneurs and engineers in 2016, Pismo is a technology company that provides a comprehensive processing platform for banking, card issuing and financial market infrastructur...

March 18, 2026 View Details
Pune, Maharashtra, India permanent
Site Reliability EngineeringCloudKubernetesDockerTerraformInfrastructure as CodeMonitoringDatadogIncident Management

About us:   Metro Global Solution Center (MGSC) is internal solution partner for METRO, a €29.8 Billion international wholesaler with operations in 32 countries through 625 stores & a team of 91...

March 16, 2026 View Details
New York, NY, United States of America (US - CA - Bay Area - Remote) Remote permanent
AWSDockerKubernetesPostgreSQLReactTerraformObservabilityIncident ManagementAI-Driven Tooling

Block is one company built from many blocks, all united by the same purpose of economic empowerment. The blocks that form our foundational teams — People, Finance, Counsel, Hardware, Information Secur...

April 9, 2026 View Details
Bay Area, CA, United States of America (US - CA - Bay Area - Remote) Remote permanent
CloudDockerKubernetesPostgreSQLMonitoringObservabilityAlertingIncident ManagementLeadership

Block is one company built from many blocks, all united by the same purpose of economic empowerment. The blocks that form our foundational teams — People, Finance, Counsel, Hardware, Information Secur...

April 9, 2026 View Details
Somerset, United Kingdom Agency contract
GCPDevOpsGoogle Cloud PlatformApigeeNetworkingCloud ArmorLoad BalancerGKEStorageTerraformVaultHarness

GCP DevOps Engineer with experience in GCP infrastructure and security is required for this role....

April 9, 2026 View Details
Remote Remote permanent
Site Reliability EngineeringCloud InfrastructureDistributed SystemsLinuxObservabilityIncident ManagementCollaborationKnowledge SharingAutomation

What’s in it for you? Ready to make a serious impact? Millions of people already rely on Calendly, and we’re still in the midst of exciting product growth — it’s a fantastic time to join us. Everythi...

April 9, 2026 View Details
Toronto, ON, CAN permanent
Site Reliability EngineeringCloud ServicesAWSSoftware DevelopmentReliabilityAutomationObservabilityResilienceDeveloper BackgroundSRE Standards

Application Deadline: 05/30/2026 Address: 4100 Gordon Baker Road Job Family Group: Technology Hybrid role (2 days/week in Scarborough office). Out of province candidates should consider relocati...

April 9, 2026 View Details

Site Reliability Engineer II

Sonyinteractiveentertainmentglobal

United States, Aliso Viejo, CA (USA-CA-Aliso Viejo) Remote permanent
Software DevelopmentLinux Systems AdministrationAPI GatewayService MeshTraffic ManagementmTLSIncident ResponseProduction ReadinessCloud Computing

Why PlayStation? PlayStation isn’t just the Best Place to Play — it’s also the Best Place to Work. Today, we’re recognized as a global leader in entertainment producing The PlayStation family of prod...

April 9, 2026 View Details
Remote (SSV Labs) Remote permanent
CloudKubernetesDockerLLMAutomationIncident ManagementAIProduction Deployments

About SSV Labs SSV Labs is the core team behind the SSV Network - pioneering decentralized infrastructure for Ethereum staking. We’re building tools, protocols, and standards to make staking more sec...

April 9, 2026 View Details
New York, NY / Bellevue, WA (Bellevue, WA, New York, NY) Remote permanent
Site Reliability EngineeringKubernetesContainerized ServicesArgo CDGitHub ActionsProduction SystemsHigh AvailabilityIncident ResponseDevSecOpsMulti-region Deployment

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. T...

April 9, 2026 View Details
Seattle, Washington, United States (Georgia-Atlanta Office) Hybrid permanent
CloudGoPythonKubernetesAWSAzureDevOpsIncident ResponseDocumentation

Join Axon and be a Force for Good. At Axon, we’re on a mission to Protect Life. We’re explorers, pursuing society’s most critical safety and justice issues with our ecosystem of devices and cloud sof...

April 9, 2026 View Details
Atlanta, Georgia, United States (Georgia-Atlanta Office) Hybrid permanent
CloudGoPythonKubernetesAWSAzureDevOpsIncident ResponseCode QualityDocumentation

Join Axon and be a Force for Good. At Axon, we’re on a mission to Protect Life. We’re explorers, pursuing society’s most critical safety and justice issues with our ecosystem of devices and cloud sof...

April 9, 2026 View Details
Boston, Massachusetts, United States (Georgia-Atlanta Office) Hybrid permanent
CloudGoPythonKubernetesAWSAzureDevOpsIncident ResponseCode QualityDocumentation

Join Axon and be a Force for Good. At Axon, we’re on a mission to Protect Life. We’re explorers, pursuing society’s most critical safety and justice issues with our ecosystem of devices and cloud sof...

April 9, 2026 View Details
Singapore, Singapore, Singapore permanent
Reliability EngineeringIncident ResponseDisaster RecoveryHigh AvailabilityObservabilityMonitoringConfiguration ManagementHigh-Traffic AppsGlobal ArchitectureMulti-region Deployment

Site Reliability Engineer (SRE) – Globalization Location: Singapore Function: Infrastructure / SRE / Platform Engineering About the Role Our client is a rapidly scaling global consumer internet pla...

April 9, 2026 View Details
Dubai, Dubai, United Arab Emirates Hybrid permanent
LeadershipTeam ManagementSRE LeadershipReliability EngineeringIncident ManagementIncident ResponseCapacity PlanningScalabilityDisaster RecoveryAutomationSLOsSLIs

At Sana Commerce we're committed to an inclusive environment and recognize that our diverse work\force is one of our greatest strengths. It all started in 2007, with a pizza and a plan. Sana Com...

April 1, 2026 View Details
Remote – India Remote permanent
Troubleshooting SupportCode FixesConsultative solutioningSafety Incident Response

We are seeking an experienced Site Reliability Engineer (SRE) with expertise in Infrastructure as Code tools like Terraform, core CI/CD tools such as Azure DevOps, and monitoring tools including DataD...

April 9, 2026 View Details
Maplewood, New Jersey, United States Remote permanent
AWSJavaMySQLJavaScriptUnixPerformanceCustomer ServiceCommunicationTime Management

It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow stands as a global market le...

April 8, 2026 View Details
United States (Spain) Remote permanent
CloudKubernetesTerraformAutomationIncident ResponseCustomer FocusHighly CollaborativeCloud Platforms

Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people. The Elastic Search AI P...

April 8, 2026 View Details
San Jose, CA, USA (United States of America) Remote permanent
CloudDockerKubernetesIaCAzureObservabilityIncident ResponseDocumentationReliability

Veeam is the Data and AI Trust Company, specializing in helping organizations ensure their data and AI are fully understood, secured, and resilient to enable the acceleration of safe AI at scale. As t...

April 8, 2026 View Details
United States (New York City, Remote North America, Toronto) Remote permanent
NetworkingDistributed SystemsKubernetesAWSAzureGCPService MeshLoad BalancingIPv6DNSTLS

The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational functions that support the broader engineering organization. Amon...

April 8, 2026 View Details
Toronto; Vancouver (New York City, Remote North America, Toronto) Remote permanent
NetworkingDistributed SystemsCloud PlatformsService MeshLoad BalancingTLSIPv6DNSVPNCDNsOn-Call Support

The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational functions that support the broader engineering organization. Amon...

April 8, 2026 View Details
Montreal; Toronto (New York City) Remote permanent
6+ years of experiencePythonGoStateful Storagecustomer-focused mindsetprocess automationKubernetesAWScloud infrastructureLinux networking

MongoDB’s Storage Layer Services (SLS) team is re-architecting the MongoDB cloud storage layer and sits at the heart of our next-generation cloud storage architecture. This relatively new team is buil...

April 8, 2026 View Details
Boston; Miami; New York City; Pittsburgh; Raleigh; United States (New York City) Remote permanent
Software DevelopmentDistributed SystemsPythonGoDatabase SystemsReliabilityDurabilityConsistencyRecoveryCustomer-FocusedEfficiencyStateful Storage

MongoDB’s Storage Layer Services (SLS) team is re-architecting the MongoDB cloud storage layer and sits at the heart of our next-generation cloud storage architecture. This relatively new team is buil...

April 8, 2026 View Details
Seattle, Washington, United States (Washington, DC (909)) permanent
CloudDockerKubernetesSystems IntegrationRoot Cause AnalysisPost-Mortem AnalysisCloud ArchitectureRoboticsMesh NetworkingScalable Solutions

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model...

April 8, 2026 View Details
Washington, District of Columbia, United States (Washington, DC (909)) permanent
Site ReliabilityNetworkingAutonomySystems IntegrationRoboticsCloudSustainabilityScalableFault TolerantCustomer Relationships

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model...

April 8, 2026 View Details
Colorado Springs, Colorado, United States permanent
ReliabilityAutomationMonitoringAlertingRunbooksIncident ManagementSecurity ClearanceTop Secret ClearanceCollaboration

About Onebrief Onebrief is collaboration and AI-powered workflow software designed specifically for military staffs. By transforming this work, Onebrief makes the staff as a whole superhuman - meanin...

April 8, 2026 View Details
Chantilly, Virginia, USA permanent
Site ReliabilityAWSSecurity ClearanceIncident ManagementMonitoringAlertingRunbooksDocumentationAutomationCollaboration

About Onebrief Onebrief is collaboration and AI-powered workflow software designed specifically for military staffs. By transforming this work, Onebrief makes the staff as a whole superhuman - meanin...

April 8, 2026 View Details
Austin, TX and/or Miami, FL (Austin, TX, Miami, FL - HQ) Remote permanent
Site Reliability EngineeringHybrid CloudCloud-Enabled SystemsTerraformKubernetesHelmAnsibleObservabilityMonitoringIncident ResponseReliabilitySecurity

Who We Are Core Scientific is a leading provider of infrastructure for high-performance compute in North America. Our mission is to accelerate digital innovation by scaling high-value compute rapidly...

April 8, 2026 View Details
Novi Sad, South Bačka, Serbia, EMEA (SRB - Novi Sad) Hybrid permanent
AWSAzureGCPTerraformAnsibleBuildkitePulumiPostgreSQLPythonManaged Kubernetes

From Fivetran’s founding until now, our mission has remained the same: to make access to data as simple and reliable as electricity. With Fivetran, customer data arrives in their warehouses, canonical...

April 8, 2026 View Details
Vancouver, BC, Canada Remote permanent
Site Reliability EngineeringKubernetesElixirInfrastructureMonitoringObservabilityIncident ResponseAI SystemsSecurityCompliance

Hiive is redefining how private companies and their shareholders access liquidity. Through its institutional-grade platform, Hiive brings together buyers, sellers, and issuers to facilitate secondary ...

April 8, 2026 View Details
Toronto, Ontario, Canada (Toronto) Remote permanent
GoDockerKubernetesAWSLinuxMonitoringAutomationIncident ResponseReliability-Centered Maintenance

Secure Every Identity, from AI to Human Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embr...

April 8, 2026 View Details
Paris Remote permanent
Site Reliability EngineeringCloud ServicesKubernetesGo ProgrammingAutomationObservabilityIncident ManagementMentoringKnowledge Sharing

Our mission and customers: We are creating the freedom for SMEs to succeed by delivering Europe's leading finance workspace with banking at its core, augmented by financial tools. We are proud to be r...

March 14, 2022 View Details

Senior Site Reliability Engineer

LSEG (London Stock Exchange)

IND-BLR-Divyasree Technopolis permanent
AWSAzureTerraformDockerKubernetesAWS CloudWatchAzure MonitorDatadogIncident ResponseObservabilityInfrastructure as Code

Senior Engineer, Site Reliability Engineering (Cloud Focus: AWS & Azure) Our Team We are evolving our Reliability Engineering team to move beyond support and operations. As a Senior Engineer in Site...

April 8, 2026 View Details
India, Bengaluru permanent
Computer ScienceKubernetesIaCMulti-CloudObservabilityIncident ManagementPerformance OptimizationData-Driven Operations

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 30 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. To...

April 8, 2026 View Details
Chicago, IL, United States Remote permanent
CloudKubernetesDockerTerraformGitIncident ResponseProduction ExcellenceIncident ManagementTeam MentoringProcess Improvement

Are you ready to trade your job for a journey? Become a FlyMate! Passion, excitement & global collaboration are all core to what it means to be a FlyMate. At Flywire, we’re on a mission to deliver th...

April 7, 2026 View Details
Singapore, Singapore, Singapore permanent
Site ReliabilityProduction ReliabilityOperational ExcellenceIncident ManagementMonitoringAlertingRoot Cause AnalysisPost Event ReportingSystem ResilienceService OwnershipProduction ReadinessDeployment Safety

About k-ID k-ID is the global leader in privacy-first compliance and age verification infrastructure. Recognized as one of TIME’s Best Inventions of 2025, named a Tech Pioneer by the World Economic F...

April 8, 2026 View Details
Singapore, Singapore, Singapore permanent
AWSKubernetesCloudDevOpsObservabilityIncident ResponseSecurityAutomationScalability

About k-ID k-ID is the global leader in privacy-first compliance and age verification infrastructure. Recognized as one of TIME’s Best Inventions of 2025, named a Tech Pioneer by the World Economic F...

April 8, 2026 View Details
United States Remote permanent
AWSTerraformcloud operationssecurityscalabilityobservabilitysoftware developmentscripting

HomeVision is building products to modernize real estate valuation and create a more efficient, transparent, and equitable housing market. We leverage technologies like NLP, computer vision, and large...

April 8, 2026 View Details
Mexico City Hybrid permanent
JavaContinuous IntegrationContinuous DeliveryDevOpsAWSGitLinuxProblem SolvingSecurityAutomation

Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions th...

April 7, 2026 View Details
Remote (US) (Ann Arbor, Remote) Remote permanent
KubernetesGoogle Cloud PlatformCloud-based environmentsHelmCrossplaneResilient NetworksDeveloper VelocityOn-call RotationContainer ImagesSelf-service Platform

Company Background Censys’ mission is to be the one place to understand everything on the internet. Frustrated by the lack of trustworthy Internet intelligence, we set out to create the industry’s mo...

April 7, 2026 View Details
Frederick (Axle Informatics LLC) Remote permanent
Cloud PlatformsDockerKubernetesPostgreSQLMonitoringObservabilityGrafanaSplunkOpenTelemetryAIOpsAlerting

(ID: 2025-1135) Axle is a bioscience and information technology company that offers advancements in translational research, biomedical informatics, and data science applications to research centers a...

April 7, 2026 View Details
3 Locations permanent
Site Reliability EngineeringCloud ServicesAWSAzureKubernetesContainerizationREST APIsAPI GatewaysMonitoringAlertingData Debugging

The Opportunity The Ethos Network Reliability team is looking for a Sr Site Reliability Engineer to work with our group. We deploy Adobe-wide software and infrastructure technology focused on Adobes...

April 7, 2026 View Details
Remote, India (India) Remote permanent
Site ReliabilityDisaster RecoveryData MigrationAutomationObservabilityGitLabDedicatedHygieneCutover Model

GitLab is the intelligent orchestration platform for DevSecOps. GitLab enables organizations to increase developer productivity, improve operational efficiency, reduce security and compliance risk, an...

April 7, 2026 View Details
Brentwood, Tennessee Hybrid permanent
KubernetesEKSElasticSearchObservabilityTerraformGitHub ActionsArgoCDAWSIAMCloud SecurityIncident Management

Senior/Staff Site Reliability Engineer, Platforms Location: Nashville, TN (Hybrid, 3 days in office) Compensation: $140,000 - $210,000 base salary Eligibility: U.S. residents only   About Us At ...

April 7, 2026 View Details
North America (North America Region) Remote permanent
Cloud PlatformsKubernetesDockerTerraformAWSAzureGCPPythonMonitoringObservability

Do you want to help make the world safe from cyber attack? At Corelight, we believe that the best approach to cybersecurity risk starts with the network. Attackers can evade endpoint detection, firew...

April 7, 2026 View Details

Site Reliability Engineer

Motorola Solutions

Krakow, Poland permanent
TroubleshootingSystem ArchitectureIncident CommandPerformance TuningJavaAngularCloud PlatformsIncident ResponseCode-Level Reliability

Company Overview At Motorola Solutions, we believe that everything starts with our people. We’re a global close-knit community, united by the relentless pursuit to help keep people safer everywhere. ...

April 7, 2026 View Details