Site Reliability Manager
: Job Details :


Site Reliability Manager

Macmillan Publishers

Job Location : New York,NY, USA

Posted on : 2025-08-15T17:13:50Z

Job Description :
The Site Reliability Manager (SRM) maintains the availability, reliability, and performance of internal applications and Saas platforms. This role involves managing incidents, optimizing system performance, and ensuring operational excellence through automation and monitoring strategies. What you'll do: Lead incident management processes, ensuring swift resolution and communication during outages. Conduct root cause analyses and implement preventive measures. Design and maintain robust monitoring systems for internal and third-party applications, establishing SL - Is, SL - Os, and SLAs. Automate operational tasks and develop self-healing systems to reduce manual intervention. Collaborate with cross-functional teams and vendors to maintain system performance and address potential reliability issues proactively. Provide leadership in system performance reporting, ensuring proactive communication with stakeholders on system health, ongoing initiatives, incident updates, and post-resolutio...Manager, Liability, Reliability, Reliability, Reliability Engineer, Monitoring, Manufacturing
Apply Now!

Similar Jobs (0)