Site Reliability Engineer - MultiBank Group : Job Details

Site Reliability Engineer

MultiBank Group

Job Location : New York,NY, USA

Posted on : 2025-08-09T01:03:29Z

Job Description :

Direct message the job poster from MultiBank Group

Talent Acquisition Specialist | Fintech, Crypto & Tech Recruitment | Hiring Across MENA, USA & APAC

Welcome to MultiBank Group, a global financial pioneer established in 2005 in California and now proudly headquartered in Dubai, UAE. We specialize in delivering cutting-edge trading technology, unparalleled liquidity, and exceptional customer service. Our extensive range of financial products includes Forex, Metals, Shares, Indices, Commodities, and Cryptocurrency CFDs.

Join our thriving community of over 2 million clients across 100 countries, contributing to a daily trading volume exceeding US$ 35 billion. As a heavily regulated institution with oversight from 17+ financial regulators across 5 continents, and recipient of over 70 financial awards, MultiBank Group is devoted to innovation, excellence, and empowering our clients to achieve their financial goals.

Role Overview

The Site Reliability Engineer ideally comes from strong Software Engineering background and automation mindset. The Site Reliability Engineer is responsible for overseeing the reliability, scalability, and performance of the infrastructure and services. This role involves supporting the team's day-to-day activities, defining strategies for improving system reliability, and adopting the best practices in automation, incident response, and infrastructure management.

Key Responsibilities

SRE Practices

Implement SRE strategy, processes, and practices defined by the organization, ensuring that they are adhered to within the team.
Bridges the gap between development and operations, ensuring a seamless collaboration that enhances reliability.
Applying a software engineering mindset to operational challenges.
Automating repetitive tasks to minimize manual effort and reduce toil.
Implementing robust monitoring and alerting mechanisms for early issue detection and response.

System Reliability and Incident Management

Oversee system health, ensuring a high level of reliability, uptime, and performance across production environments.
Lead incident management efforts, including response, resolution, and post-mortem reviews, ensuring root causes are identified and mitigated.
Drive the development of incident response protocols and on-call rotations to ensure 24/7 support and quick resolution of critical issues.

Automation and Infrastructure Optimization

Drive the adoption and scaling of automation practices across the team, reducing manual tasks related to deployments, scaling, and monitoring.
Ensure the team implements Infrastructure as Code (IaC) and continuously refines CI/CD pipelines to support efficient, repeatable, and reliable infrastructure management.
Lead initiatives for optimizing cloud infrastructure and resource usage, ensuring performance meets business needs while optimizing costs.

Production Release Support

Oversee and support the deployment of new features and updates to production, ensuring minimal downtime and maximum reliability.
Collaborate with development and management teams to ensure a smooth and efficient release process, adhering to established release procedures.
Monitor production environments during and after releases, ready to address any issues or rollbacks if necessary.

Monitoring, Observability, and Performance Tuning

Oversee the development and maintenance of monitoring and observability systems, ensuring they provide real-time insights into system performance and reliability.
Ensure that system metrics are regularly reviewed and that performance-tuning efforts are prioritized based on system bottlenecks and resource usage patterns.
Work with development teams to ensure observability is integrated into the design and development of applications and services.

Cross-Functional Collaboration and Communication

Serve as the point of contact for reliability-related matters, providing regular updates on system health, incident trends, and improvement plans.
Foster a culture of shared responsibility between SRE and development teams, encouraging collaboration on building reliable, scalable, and performant systems.

Continuous Improvement and Innovation

Promote the adoption of new technologies, frameworks, and tools that enhance system resilience, scalability, and automation.
Regularly review and refine processes to increase the efficiency and effectiveness of incident response, system monitoring, and infrastructure management.

Security, Compliance, and Risk Management

Ensure that security best practices are integrated into all aspects of infrastructure management, including access control, vulnerability management, and data protection.
Collaborate with security teams to ensure compliance with industry standards and regulations while maintaining system availability and performance.
Proactively manage risks related to system reliability and availability, identifying and mitigating potential threats before they impact production environments.

Reporting and Metrics

Define, track, and report on key metrics related to system performance, uptime, and incident response, providing insights to both the engineering team and leadership.
Lead efforts to use data-driven insights for system improvements and to measure the impact of changes to reliability and performance.
Present regular reports on the state of system reliability, key incidents, and ongoing improvement initiatives to leadership and stakeholders.

Why Join Us?

Work with an industry-leading global financial institution.
Competitive salary and comprehensive employee benefits.
Opportunities for professional growth and career advancement.
Collaborative, inclusive, and dynamic work environment.
Commitment to innovation and professional excellence.

Become part of our international community at MultiBank Group, dedicated to excellence, innovation, and shaping the future of finance.

Seniority level

Seniority levelNot Applicable

Employment type

Employment typeFull-time

Job function

Job functionEngineering, Information Technology, and Product Management
IndustriesFinancial Services and Capital Markets

Referrals increase your chances of interviewing at MultiBank Group by 2x

New York, United States $100,000.00-$720,000.00 2 weeks ago

New York, NY $70,000.00-$150,000.00 1 week ago

New York, NY $99,500.00-$200,000.00 6 days ago

New York, NY $220,000.00-$260,000.00 1 week ago

Software Engineer II, Merchant APIs (UberEats)

New York, NY $167,000.00-$185,500.00 1 week ago

Brooklyn, NY $150,000.00-$200,000.00 3 months ago

New York, NY $163,200.00-$223,200.00 2 weeks ago

Full Stack Software Engineer (All Levels)

New York, NY $120,000.00-$180,000.00 5 months ago

New York, NY $140,000.00-$200,000.00 1 week ago

New York, NY $70,000.00-$150,000.00 1 week ago

New York, NY $120,000.00-$140,000.00 2 weeks ago

New York, NY $176,000.00-$250,000.00 5 days ago

Want to work with us, but don't see the right job listed?

New York, NY $99,500.00-$200,000.00 6 days ago

Software Engineer (Fullstack) - Payments

New York, NY $163,200.00-$223,200.00 2 weeks ago

Backend Software Engineer, CloudKitchens - New York City

New York, NY $130,000.00-$240,000.00 5 days ago

Backend Engineer, Real-time supply management

New York, NY $128,000.00-$160,000.00 2 weeks ago

New York, NY $150,000.00-$180,000.00 13 hours ago

New York, NY $140,000.00-$140,000.00 1 month ago

New York, United States $155,000.00-$213,200.00 1 week ago

New York, NY $165,000.00-$165,000.00 1 year ago

New York, United States $140,000.00-$200,000.00 4 days ago

New York, NY $140,000.00-$170,000.00 3 months ago

New York, NY $120,000.00-$220,000.00 1 month ago

New York, NY $145,000.00-$260,000.00 9 months ago

We're unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr

Apply Now!

Similar Jobs ( 0)

-- View More Similar Jobs --