ARCHIVED
This job listing has been archived and is no longer accepting applications.
MisuJob - AI Job Search Platform MisuJob

Site Reliability Engineer - Systems (7 to 10 Years)

Phonepe

Bangalore (PhonePe Limited) Remote permanent

Posted: January 29, 2026

Interested in this position?

Create a free account to apply with AI-powered matching

Quick Summary

Site Reliability Engineer - Systems (7 to 10 Years)

Job Description

About PhonePe Limited:

Headquartered in India, its flagship product, the PhonePe digital payments app, was launched in Aug 2016. As of April 2025, PhonePe has over 60 Crore (600 Million) registered users and a digital payments acceptance network spread across over 4 Crore (40+ million) merchants. PhonePe also processes over 33 Crore (330+ Million) transactions daily with an Annualized Total Payment Value (TPV) of over INR 150 lakh crore.

PhonePe’s portfolio of businesses includes the distribution of financial products (Insurance, Lending, and Wealth) as well as new consumer tech businesses (Pincode - hyperlocal e-commerce and Indus AppStore Localized App Store for the Android ecosystem) in India, which are aligned with the company’s vision to offer every Indian an equal opportunity to accelerate their progress by unlocking the flow of money and access to services.

Culture:

At PhonePe, we go the extra mile to make sure you can bring your best self to work, Everyday!. And that starts with creating the right environment for you. We empower people and trust them to do the right thing. Here, you own your work from start to finish, right from day one. PhonePe-rs solve complex problems and execute quickly; often building frameworks from scratch. If you’re excited by the idea of building platforms that touch millions, ideating with some of the best minds in the country and executing on your dreams with purpose and speed, join us!

Site Reliability Engineer - System

Expeience: 7 to 10 Years

Summary

We are seeking a skilled and proactive Site Reliability Engineer (SRE) to join our team. The ideal candidate will have extensive experience in Linux systems administration, understanding of database management, and a proven track record of troubleshooting complex, system-level issues. You will be responsible for ensuring the reliability, performance, and scalability of our production environments, balancing system and database stability through robust monitoring, debugging, and automation practices.

Responsibilities:

• Lead incident response and resolution: Proactively troubleshoot, debug, and resolve complex system-level incidents and outages, encompassing Linux operating systems, applications, and database technologies.

• Conduct deep-dive root cause analysis: Perform thorough post-incident analysis to identify underlying issues in production environments, implementing sustainable solutions.

• Design and implement robust monitoring: Develop, maintain, and enhance comprehensive system and database monitoring, alerting, and observability solutions (e.g., Grafana, Prometheus, PMM).

• Drive automation and efficiency: Automate Linux system administration tasks, operational runbooks, and database maintenance to improve system reliability, consistency, and operational efficiency.

• Collaborate on resilient deployments: Partner with development and engineering teams to ensure seamless, reliable, and secure software deployments and infrastructure changes.

• Architect scalable infrastructure: Contribute to the architectural design and implementation of highly scalable, resilient, and performant infrastructure solutions.

• Enhance on-call effectiveness: Participate in and continuously improve on-call rotations, developing tools and processes to reduce alert fatigue and minimize human error.

• Foster technical growth: Mentor and guide junior Site Reliability Engineers (SREs), promoting knowledge sharing and skill development within the team.

Qualifications:

• Extensive Linux Expertise: Proven experience in advanced Linux systems administration, including deep understanding of file systems, kernel tuning (Sysctl), and performance optimization.

• Advanced Troubleshooting & Debugging: Exceptional ability to debug and rapidly resolve complex, distributed system-level issues in high-pressure production environments.

• Configuration Management: Hands-on experience with industry-standard configuration management tools (e.g., SaltStack, Ansible, Puppet).

• Load Balancing & Proxying: Practical experience with load balancing technologies (e.g., Nginx, HAProxy, LVS) and their configuration for high availability.

• Containerization & Orchestration: Strong understanding and practical experience with containerization (e.g., Docker) and container orchestration platforms (e.g., Kubernetes, Mesosphere).

• Monitoring & Alerting Tooling: Proficiency in implementing, maintaining, and leveraging system and database monitoring platforms (e.g., Grafana, Prometheus, PMM) and custom scripting for alerts.

• Automation & Scripting Mastery: Highly proficient in developing automation solutions using scripting languages (e.g., Python, Shell scripting, Go) for operational tasks.

• Networking Fundamentals: Solid understanding of core networking concepts and protocols (e.g., TCP/IP, DNS, DHCP, BGP, IPTables, IP & Routing protocols).

• Database Administration Fundamentals: Strong grasp of relational database concepts and practical experience with database administration principles.

Preferred Qualifications:

• Cloud Infrastructure Experience: Experience managing and troubleshooting private/on-premise cloud environments, with a focus on identifying and mitigating hardware-related issues and their impact.

• Relational Database Specialization: Deep practical experience with MariaDB, Percona Server, and/or MySQL, encompassing advanced database administration, performance tuning, and complex replication topologies.

• Backup & Recovery Expertise: Hands-on experience with robust backup and restore technologies, including ZFS.

• Message Queuing Systems: Familiarity with message queuing systems like RabbitMQ (RMQ).

PhonePe Full Time Employee Benefits (Not applicable for Intern or Contract Roles)

• Insurance Benefits - Medical Insurance, Critical Illness Insurance, Accidental Insurance, Life Insurance

• Wellness Program - Employee Assistance Program, Onsite Medical Center, Emergency Support System

• Parental Support - Maternity Benefit, Paternity Benefit Program, Adoption Assistance Program, Day-care Support Program

• Mobility Benefits - Relocation benefits, Transfer Support Policy, Travel Policy

• Retirement Benefits - Employee PF Contribution, Flexible PF Contribution, Gratuity, NPS, Leave Encashment

• Other Benefits - Higher Education Assistance, Car Lease, Salary Advance Policy

Our inclusive culture promotes individual expression, creativity, innovation, and achievement and in turn helps us better understand and serve our customers. We see ourselves as a place for intellectual curiosity, ideas and debates, where diverse perspectives lead to deeper understanding and better quality results. PhonePe is an equal opportunity employer and is committed to treating all its employees and job applicants equally; regardless of gender, sexual preference, religion, race, color or disability. If you have a disability or special need that requires assistance or reasonable accommodation, during the application and hiring process, including support for the interview or onboarding process, please fill out this form.

Read more about PhonePe on our blog.

Life at PhonePe

PhonePe in the news

Why Apply Through MisuJob?

AI-Powered Job Matching: MisuJob uses advanced artificial intelligence to analyze your skills, experience, and career goals. Our matching algorithm compares your profile against thousands of job requirements to find positions where you have the highest chance of success. This saves you hours of manual job searching and ensures you only see relevant opportunities.

One-Click Applications: Once you create your profile, applying to jobs is effortless. Your resume and cover letter are automatically tailored to highlight the most relevant experience for each position. You can apply to multiple jobs in minutes, not hours.

Career Intelligence: Beyond job matching, MisuJob provides valuable career insights. See how your skills compare to market demands, identify skill gaps to address, and understand salary benchmarks for your experience level. Make data-driven decisions about your career path.

Frequently Asked Questions

How do I apply for this position?

Click the "Register to Apply" button above to create a free MisuJob account. Once registered, you can apply with one click and track your application status in your dashboard.

Is MisuJob free for job seekers?

Yes, MisuJob is completely free for job seekers. Create your profile, get matched with jobs, and apply without any cost. We help you find your dream job without any hidden fees.

How does AI matching work?

Our AI analyzes your resume, skills, and experience to understand your professional profile. It then compares this against job requirements using natural language processing to calculate a match percentage. Higher matches mean better fit for the role.

Can I apply to jobs in other countries?

Absolutely. MisuJob features jobs from companies worldwide, including remote positions. Filter by location or look for remote opportunities to find jobs that match your preferences.

Ready to Apply?

Join thousands of job seekers using MisuJob's AI to find and apply to their dream jobs automatically.

Register to Apply