Get AI-powered advice on this job and more exclusive features.
This range is provided by Insight Global. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.
Base pay range
$200,000.00/yr - $205,000.00/yr
Direct message the job poster from Insight Global
Professional Recruiter at Insight Global
Required Skills & Experience
- 10+ years overall experience in application engineering
- 7+ years of SRE experience (architect or engineer) with SRE/Observability toolsets like Dynatrace/ AppDynamics/ New Relic, Splunk/Elastic
- 3+ years' experience monitoring applications using various SDLC methodologies, preferably Agile
- 3+ years of technology design expertise including Performance, Security, Availability, Operations, Monitoring, and Support
- 2+ years of relational database management skills like MSSQL, MySQL, SQL, PostgreSQL, or MongoDB
- 2+ years of scripting experience in Unix Shell, Python, or PowerShell
- Experience with containerization, performance tuning, security, and system availability
- Experience in a regulated industry; financial services experience is ideal
- Bachelor's degree in MIS, computer science, math, or related field; an advanced degree is a plus
Job Description
Insight Global seeks a Site Reliability Engineer to join their client's team.
- Design, configure, and set up observability platform tools (Splunk and Dynatrace), both on-premises and cloud, to enhance application development and operational stability
- Collaborate with the Observability Manager and Architect to develop monitoring strategies and roadmaps
- Develop automation tooling and processes for monitoring and security compliance
- Integrate and configure tools/frameworks to automate monitoring activities enterprise-wide
- Analyze incidents and usage data to predict issues proactively
- Work across departments to evaluate system effectiveness and efficiency
- Promote adoption of observability tools across technology groups
- Partner with service owners to implement Service Level Metrics & Objectives
- Ensure enterprise platform stability, scalability, and maturity in DevOps practices
- Resolve system issues based on SLAs regarding availability, performance, and service levels
- Translate monitoring requirements into engineering tasks
- Present findings and strategies to managers and stakeholders
- Mentor engineers through guidance and training
Seniority level
Employment type
Job function
Industries
Referrals can double your chances of interviewing at Insight Global.
Inferred benefits
Medical insurance
Vision insurance
401(k)
Get notified when a new job is posted.
Related roles
- Site Reliability Engineering - Systems Engineer - Vice President - Dallas
- Principal Software Engineer - Workflow Tools
- Senior Manager of Site Reliability Engineer
- DevOps Engineer with Ariba or SAC experience - 100% Remote
#J-18808-Ljbffr