Sr Site Reliability Engineer (Linux, UNIX, Reliability Engineering, Python, C, C++, Java, DevOps) in New York City
Location: New York City
Salary: Excellent Compensation with benefits + Interview Travel
SKILLS: Linux, UNIX, Windows, Reliability Engineering, Python, Perl, C, C++, SQL, Java, DevOps, Systems Administration, Application Programming, Networking
DESCRIPTION:
For our prestigious client, we are looking for a Sr Site Reliability Engineer with expertise in Linux, UNIX, Reliability Engineering, Python, C, C++, Java, and DevOps.
Their Deployment and Runtime Platform is responsible for corporate mission-critical production platforms including:
Private and public Cloud computing platformEntitlements platformSecurity and critical infrastructure servicesLinux and Windows engineeringCore data centre platforms including network and storageYou will be part of a diverse global technical team focusing on critical business problems interacting with multiple businesses, operations, and technology teams. We are responsible for a critical client-facing function and aim to innovate and drive solutions through technology that will impact the bottom line for the firm.
HOW YOU WILL FULFILL YOUR POTENTIAL:
Engage with application development teams to improve the whole lifecycle of services: from inception and design, through deployment, operation and refinement.Participate in system design consulting, platform management and capacity planning.Develop software and systems architectural frameworks and tooling.Maintain services by measuring and monitoring availability, latency and overall system health.Create and sustain scalable systems and services through automation and uplifts.REQUIRED SKILLS AND EXPERIENCE:
5+ to 7 years experience in a relevant role (DevOps, Reliability Engineering, Systems Administration, Application Programming, etc)Team player, eager to work in a global organizationEnergy, self-motivation and independence to manage multiple tasksExperience with distributed compute systemsHighly knowledgeable of at least one of Linux or Windows platforms running key business applicationsKnowledgeable of many other areas of technology (networking, hardware, etc)Highly entrepreneurial and motivatedStrong interpersonal skills - good client facing skills as well as excellent oral and written communicationPREFERRED SKILLS:
Exposure to Linux / UNIX environment with advanced knowledge and experience of automating / scripting tasks is essentialHands-on experience in production deployment and release managementHands-on experience in troubleshooting and debugging application issuesKnowledge of best practices and IT operations in a highly available and mission critical serviceKnowledge of at least one programming language (Erlang, Python, Perl, C++, C, Java, SQL) beyond basic scripting is a key advantageExperience managing performance, availability and scale of mid to large sized systems (Experience of building and running high availability systems is an advantage)Experience with all stages in the lifecycle of large distributed systems: inception, analysis, design, implementation, runtime, maintenanceGood knowledge of at least one database product (like MSQL / PostgreSQL / DB2) (Knowledge of NOSQL products like MongoDB would be an advantage)Possesses good knowledge of different software systems, client/server architectures and various compatibility requirementsKnowledge on Software Development Lifecycles and methodologies, Release Management Process & Products are added advantageFamiliarity with open source automation / configuration tools such as Chef, Puppet, Ansible or SaltStack would be an advantageFOLLOWING SKILLS ARE A MUST HAVE:
Reliability EngineeringPythonOur ideal candidate will have a mix of infrastructure skills with UNIX & Linux and software skills such as Python, C/C++ or Java.#J-18808-Ljbffr