About the role
<h2><strong>About the Role:</strong></h2> <p><strong>Production Engineer</strong><br>The Production Engineer at Rubrik plays a critical role in operational excellence, managing alerts, responding to outages, and leading incident resolution as an Incident Manager. This role requires hands-on experience in maintaining highly available critical services across multi-cloud environments while driving continuous improvements through automation and intelligent monitoring.</p> <h2>What you’ll do:</h2> <ul> <li>Join a 24/7 Production Operations team responsible for managing and supporting critical infrastructure and services in multi-cloud environments.</li> <li>Oversee staging and production environments to ensure maximum uptime and reliability.</li> <li>Implement and maintain comprehensive observability solutions for real-time monitoring, alerting, and metrics collection.</li> <li>Lead incident management efforts by swiftly responding to alerts and outages, coordinating teams to drive timely resolution.</li> <li>Analyze recurring incidents to identify root causes, reduce toil, and improve system resilience.</li> <li>Design and develop automation tools to proactively detect, triage, and remediate production issues.</li> <li>Maintain and update runbooks to support incident response and recurring issues.</li> <li>Demonstrate strong decision-making skills under pressure, effectively managing critical situations with urgency and composure.</li> </ul> <h2>Experience you’ll need:</h2> <ul> <li>Solid understanding of distributed system concepts.</li> <li>Practical experience working with production systems and environments, preferably within public cloud infrastructures.</li> <li>Familiarity with container orchestration platforms, especially Kubernetes.</li> <li>Hands-on experience with infrastructure&nbsp; management tools like CloudFormation and Terraform.</li> <li>Strong analytical and problem-solving skills&nbsp; for diagnosing and resolving system and application issues.</li> <li>Proficient in data structures and algorithms, UNIX, networking, operating systems, and database systems such as MySQL.</li> <li>Proficient with Python programming skills.</li> <li>Excellent verbal and written communication skills.</li> </ul> <h4>Location: Bangalore, India<br>Work Shift: Rotation (24/7 coverage expected)&nbsp;<br><br></h4> <h2><strong>ABOUT RUBRIK</strong></h2> <p>Join Us in Securing the World's Data</p> <p>Rubrik (NYSE: RBRK) is on a mission to secure the world’s data. With Zero Trust Data Security™, we help organizations achieve business resilience against cyberattacks, malicious insiders